Illinois Data Bank

Dataset for studying transitive closure in citations

This dataset contains the first-generation (1st-gen) and second-generation (2nd-gen) citation relationships to a set of focal papers. The 1st-gen citation relationships are the instances of one paper citing a focal paper. These citing papers are called "1st-gen citations." The 2nd-gen citation relationships are the instances that a paper cites a 1st-gen citation. The citing paper in the 2nd-gen citation relationship is a second-generation (2nd-gen) citation. When a 2nd-gen citation is also a 1st-gen citation, it creates a transitive closure with the focal paper.

Each focal paper has an abbreviation, which can be found below. The 1st-gen and 2nd-gen citation relationships were extracted from the Curated Open Citation Dataset (Korobskiy & Chacko, 2023), which is derived from a copy of COCI, the OpenCitations Index of Crossref Open DOI-to-DOI Citations, downloaded on May 6, 2023. Scripts used to collect this dataset can be found at https://github.com/yuanxiesa/transitive_closure_study. Each focal paper currently has two files: {abbreviation}_1st.csv contains the 1st-gen citation relationships; {abbreviation}_2nd.csv contains the 2nd-gen citation relationships.

Focal paper abbreviation == "louvain": Blondel, V. D., Guillaume, J.-L., Lambiotte, R., & Lefebvre, E. (2008). Fast unfolding of communities in large networks. Journal of Statistical Mechanics: Theory and Experiment, 2008(10), P10008. https://doi.org/10.1088/1742-5468/2008/10/P10008

Focal paper abbreviation == "lp": Raghavan, U. N., Albert, R., & Kumara, S. (2007). Near linear time algorithm to detect community structures in large-scale networks. Physical Review E, 76(3), 036106. https://doi.org/10.1103/PhysRevE.76.036106

Focal paper abbreviation == "gn": Newman, M. E. J., & Girvan, M. (2004). Finding and evaluating community structure in networks. Physical Review E, 69(2), 026113. https://doi.org/10.1103/PhysRevE.69.026113

Social Sciences
transitive closure; citations; community detection algorithms; OpenCitations; method papers
CC BY
Yuanxi Fu
363 times
Version DOI Comment Publication Date
1 10.13012/B2IDB-3971668_V1 2025-05-02

4.58 KB File
1.39 MB File
38.8 MB File
1.93 MB File
28.9 MB File
360 KB File
7.59 MB File

Contact the Research Data Service for help interpreting this log.

Research Data Service Illinois Data Bank
Access and Use Policies Web Privacy Notice Contact Us