Displaying 76 - 100 of 801 in total
Subject Area
Funder
Publication Year
License
Illinois Data Bank Dataset Search Results

Dataset Search Results

published: 2025-05-05
 
The dataset includes responses from approximately 550 participants to survey questions about trust in images labeled with AI-related tags, compared to other images found online. The questions also explore how the type of label influences their trust.
keywords: Artificial intelligence (AI); Trust in AI; Al labeling; AI ethics
published: 2025-06-05
 
There are two files in this dataset. File1: AffiNorm AffiNorm contains 1,001 rows, including one header row, randomly sampled from MapAffil 2018 Dataset ([**https://doi.org/10.13012/B2IDB-2556310_V1**](https://databank.illinois.edu/datasets/IDB-2556310)). Each row in the file corresponds to a particular author on a particular PubMed record, and contains the following 26 columns, comma-delimited. All columns are ASCII, except city which contains Latin-1. COLUMN DESCRIPTION 1. PMID: the PubMed identifier. int. 2. ORDER: the position of the author. int. 3. YEAR - The year of publication. int(4), eg: 1975. 4. affiliation - affiliation string of the author. eg: Department of Pathology, University of Chicago, Illinois 60637. 5. annotation_type: the number of institutions annotated, denoted by S, M, O, or Z, where "S" (single) indicates 1 institution was annotated; "M" (Multiple) indicates more than one institutions were annotated; "O" (Out of Vocabulary or None) indicates no institution was annotated, but an institution was apparently mentioned; "Z" indicates no institution was mentioned. 6. Institution: the standard name(s) of the annotated institution(s), according to ROR. if "S" (single institution), it is saved as a string, eg: University of Chicago; if "M", it is saved as a string that looks like a python list, eg: ['Public Health Laboratory Service'; 'Centre for Applied Microbiology and Research']; if "O" or "Z", then blank. 7. inst_type: the type of institution, according to ROR. the potential values are: education, funder, healthcare, company, archive, nonprofit, government, facility, other. An institution may have more than one type, eg: ['Education', 'Funder'] 8. type_edu: TRUE if the inst_type contains "Education"; FALSE otherwise. 9. RORid: ROR identifier(s), eg: https://ror.org/05hs6h993. when multiple, the order corresponds to institution (column 6) 10. RORid_label. the standard name(s) of the annotated institution(s) according to ROR.same as institution (column 6) 11. GRIDid: GRID identifier(s). eg: grid.170205.1 12. GRIDid_label: the standard name(s) of the annotated institution(s) according to GRID. eg: University of Chicago. 13. WikiDataid: WikiData identifier(s). eg: Q131252 14. WikiDataid_label: the standard name(s) of the annotated institution(s) according to WikiData. eg: University of Chicago 15. synonyms: a comma separated list of variant names from InsVar (file 2) . format of string. eg: University of Chicago, Chicago University, U of C, UChicago, uchicago.edu, U Chicago, ... 16. MapAffil-grid: GRID from the MapAffil 2018 Dataset. 17. MapAffil-grid_label: The standard name of institution from MapAffil 2018 Dataset. 18. judge_mapA: TRUE if GRIDid (column 11) contains MapAffil-grid (column 16); FALSE otherwise. 19. MapAffiltemporal-grid: GRID from the temporal version of MapAffil, http://abel.ischool.illinois.edu/data/MapAffilTempo2018.tsv.gz 20. MapAffiltemporal-grid_label: The standard name of institution from MapAffilTemporal 2018 Dataset. 21. judge_mapT: TRUE if GRIDid (column 11) contains MapAffiltemporal-grid (column 19); FALSE otherwise. 22. RORapi_query_id: ROR from ROR api tool (query endpoint) 23. RORapi_query_id_label: The standard name of institution from ROR api tool (query endpoint). format in string. 24. judge_rorapi_affiliation: TRUE if RORid (column 9) contains RORapi_query_id (column 22); FALSE otherwise. 25. rorapi_affiliation_id: ROR from ROR api tool (affiliation endpoint). 26. judge_rorapi_affiliation: TRUE if RORid (column 9) contains RORapi_affiliation (column 25); FALSE otherwise. File 2: insVar.json InsVar is a supplementary dataset for AffiNorm, which includes the institution ID and its redirected aliases from wikidata. The institution ID list is from GRID, the redirected aliases are from wiki api, for example: https://en.wikipedia.org/wiki/Special:WhatLinksHere?target=University+of+Illinois+Urbana-Champaign&namespace=&hidetrans=1&hidelinks=1&limit=100 In InsVar, the data is saved in a python dictionary format. the key is the GRID identifier, for example: "grid.1001.0" (Australian National University), and the value is a list of redirected aliases strings. {"grid.1001.0": ["ANU", "ANU College", "ANU College of Arts and Social Sciences", "ANU College of Asia and the Pacific", "ANU Union", "ANUSA", "Asia Pacific Week",    "Australia National University", "Australian Forestry School", "the Australian National University", ...], "grid.1002.3": ...}
keywords: PubMed; MEDLINE; Digital Libraries; Bibliographic Databases; Institution Names; Author Affiliations; Institution Name Ambiguity; Authority files
published: 2021-08-27
 
The dataset shows all poison frogs (superfamily Dendrobatoidea) in private U.S. collections during 1990–2020. For each species and color morph, there is a date of arrival, the way it arrived in U.S. collections, and detailed notes related to its presence in the pet trade.
keywords: pet trade; amphibians; Dendrobatidae
published: 2025-06-06
 
The materials used to provide Continuing Medical Education on ticks and tick-borne diseases in Illinois on February 1, 2023 at Carle Hospital, along with the pre- and post-quiz and deidentified data of the quiz takers. Files: "Ticks and Tick-borne Diseases of Illinois_Final_w_speaker_notes.pptx": Presentation slides used for CME course, with notes to indicate verbal commentary "CME assessment_final.docx": Pre- and post-CME quiz questions and answers, annotated to indicate correct answers and reasoning for incorrect answers "CME_prequiz_data_for_sharing.csv": De-identified data from pre-CME quiz "CME_postquiz_data_for_sharing.csv": De-identified data from post-CME quiz, including demographics "DataCleaning_forSharing.R": R file used to clean the raw data and calculate the scores "ReadMe.txt":
keywords: tick-borne disease; CME
published: 2025-06-03
 
GIS data and geoprocessing tools associated with White and Lambert (2025) modeling paper that assesses the potential impact of development on the archaeological resources of Illinois.
keywords: development; archaeology; climate change; GIS
published: 2025-06-04
 
These datasets contain the complete output from a Monte Carlo simulation of the number of wild cervids to test for chronic wasting disease (CWD) depending on true prevalence. Five CSVs of the simulation results are provided, split due to limitations in file size. The R code used to run the simulation and process the data is included. The data to replicated Table 1 and the data used to compare the simulation results to the CWD surveillance efforts of the Illinois Department of Natural Resources (IDNR) are also provided.
keywords: chronic wasting disease; cwd; cervid; test; sample size; diagnostic testing; surveillance
published: 2025-06-03
 
This is a peptide imaging data obtained by mtarix assisted laser desoption ionization trapped ion mobility datasets from the central nervous sytem and select ganglion of aplysia Californica.
keywords: Neuropeptides, Iosmerization, D-amino acids, MALDI-TIMS
published: 2019-06-13
 
This lexicon is the expanded/enhanced version of the Moral Foundation Dictionary created by Graham and colleagues (Graham et al., 2013). Our Enhanced Morality Lexicon (EML) contains a list of 4,636 morality related words. This lexicon was used in the following paper - please cite this paper if you use this resource in your work. Rezapour, R., Shah, S., & Diesner, J. (2019). Enhancing the measurement of social effects by capturing morality. Proceedings of the 10th Workshop on Computational Approaches to Subjectivity, Sentiment and Social Media Analysis (WASSA). Annual Conference of the North American Chapter of the Association for Computational Linguistics (NAACL), Minneapolis, MN. In addition, please consider citing the original MFD paper: <a href="https://doi.org/10.1016/B978-0-12-407236-7.00002-4">Graham, J., Haidt, J., Koleva, S., Motyl, M., Iyer, R., Wojcik, S. P., & Ditto, P. H. (2013). Moral foundations theory: The pragmatic validity of moral pluralism. In Advances in experimental social psychology (Vol. 47, pp. 55-130)</a>.
keywords: lexicon; morality
published: 2025-02-14
 
This dataset includes the original data (including photographs as .jpg files and sound recordings as .wav files) and detailed descriptions of workflows for analyses of acoustic and morphometric data for the Neoaliturus tenellus (beet leafhopper) species complex. Files needed for different parts of the two analytical workflows are included in the "Acoustics.zip" and "PCA.zip" archives. The "Folder Structure.png" file contains a diagram of the folder structure of the two archives. Each archive contains a "ReadMe" file with instructions for repeating the analyses. File and folder names including the two-letter abbreviations TB, TD, TN and TP refer to four different putative species (operational taxonomic units, or OTUs, of the Neoaliturus tenellus complex.
keywords: Hemiptera; Cicadellidae; integrative taxonomy; courtship; morphology
published: 2025-04-23
 
These data files were used for phylogenomic analyses of Darnini and related Membracidae (Hemiptera: Auchenorrhyncha) in the referenced article by Gonzalez-Mozo et al. - The "mem_50p_alignment.fas" file contains the aligned, concatenated nucleotide sequence data for 51 species and 492 genetic loci included in the phylogenetic analyses ("N" indicates missing data and "-" indicates an alignment gap). - The file "Table1.rtf" lists the included species, country of origin and genbank accession number. Species newly sequenced for this study have a Sample ID with prefix "DAR"; previously sequenced species for which data were downloaded from genbank have "NCBI" indicated in the same column of the table. - The file "partition_def.txt" lists the 492 genetic loci included in the alignment with their exact positions indicated by the range of numbers given at the end of each line (e.g., locus "uce-1" occupies positions 1-280 in the alignment). - The substitution model file "mem_50p.model" contains information on the substitution models used in the partitioned maximum likelihood analysis, including the models used for different data partitions and parameter values, as output by the phylogenetic software IQ-TREE. - Individual tree files in Newick format (plain text) are provided for the phylogeny from concatenated analysis with the best likelihood score ("mem_50p_bestLikelihoodScore"), concatenated likelihood analysis with gene concordance factors ("mem_50p_gcf") and site concordance factors ("mem_50p_scf"). - The tree file from the ASTRAL analysis is "mem_50p_astral". - The zip archive entitled “IQ-TREE analysis results.zip” includes output from the maximum likelihood analysis of the concatenated nucleotide sequence data, including the following: (1) main output file “mem_50p.iqtree” summarizing model selection, partitioning schemes, likelihood scores, and run parameters; (2) “mem_50p.mldist” including pairwise ML distances between taxa; (3) “mem_50p.best_scheme.nex” with the best partitioning scheme identified by ModelFinder in NEXUS format and (4) “mem_50p.best_scheme” the RAxM-compatible version of the same file. - The “Ultrafast bootstrap results.zip” zip archive contains: (1) “mem_50p.ufboot” with the bootstrap replicate trees; (2) “mem_50p.contree” with the majority-rule consensus tree with support values; (3) “mem_50p.splits.nex”, with split support values across the replicates; (4) “mem_50p.log” is the log file. - The “gene_trees.zip” zip archive contains the individual gene trees as input for subsequent coalescent gene tree analysis in the phylogenetic program ASTRAL. - The file "DarniniAHE_Character Matrix.csv" contains the data for 6 morphological characters for which the ancestral states were reconstructed using the phylogenetic results from analysis of anchored-hybrid data (see article text for details). - The file "scriptACRDarnini.txt" contains the commands used to reconstruct ancestral morphological characters states using the corHMM 2.8 R package. See the Methods section of the article for more details.
keywords: Insecta; Hemiptera; anchored-hybrid enrichment; phylogeny; treehopper
published: 2025-06-03
 
This data comprises image files used in the analysis of Analysis of Nematode Ventral Nerve Cords Suggests Multiple Instances of Evolutionary Addition and Loss of Neurons by Han et al. (bioRxiv, 2025: doi: https://doi.org/10.1101/2025.03.20.644414). It is separated into two folders. The first comprise data using DAPI staining to quantify the number of VNC nuclei in diverse nematodes. The second includes dye-filling data of Mononchus aquaticus.
keywords: C. elegans; Mononchus; neuroanatomy; nematode nervous system; ventral nerve cord; secondary simplification
published: 2025-03-13
 
ALMA Band 4 and 7 observations of the dust continuum in the Class 0 protostellar system L1448 IRS3B. We include the selfcal script, imaging scripts, fits files, and the python scripts for the figures in the paper.
keywords: ALMA; Band 4; Band 6; polarization; L1448 IRS3B
published: 2025-05-29
 
These data support Ruess et al (2025) "Drought impacts to water footprints and virtual water transfers of counties of the United States", Water Resources Research, 61, e2024WR037715, https://doi.org/10.1029/2024WR037715. The dataset contains estimates for Virtual Water Content (VWC) and Virtual Water Trade (VWT) for nine unique combinations of three crop categories (cereal grains, produce, and animal feed) and three water sources (surface water withdrawals, groundwater withdrawals, and groundwater depletion) for the years 2012 and 2017 within the Continental United States. The VWC is calculated by dividing irrigation withdrawal estimates (m3) by the production (tons) at the county resolution. The VWT is calculated by multiplying the VWC by the estimated county level food flows (tons) from Karakoc et al. (2022). All VWC estimates are provided at the county resolution according to county GEOID and are given in units of m3/ton. All VWT estimates are given in pairs of origin and destination GEOID’s and provided in units of m3. When using, please cite as: Ruess, P.J., Hanley, J., and Konar, M. (2025) "Drought impacts to water footprints and virtual water transfers of counties of the United States", Water Resources Research, 61, e2024WR037715, doi: 10.1029/2024WR037715.
keywords: irrigation; water footprints; supply chains
published: 2025-05-28
 
This dataset captures ‘Hype’ and 'Diversity', including article-level (pmid) and author-level (auid) data within biomedical abstracts sourced from PubMed. The selection chosen is ‘journal articles’ written in English, published between 1991 and 2014, totaling 421,580 (merged_df). The classification of hype relies on the presence of specific candidate ‘hype words’ and their abstract location. Therefore, each article (PMID) might have multiple instances in the dataset due to the presence of multiple hype words in different abstract sentences. Diversity is classified for ethnicity, gender, academic age, and topical expertise for authors based on the Rao-Sterling Diversity index. File1: merged_auids.csv (Important columns defined) • AUID: a unique ID for each author • Genni: gender prediction • Ethnea: ethnicity prediction ################################################# File2: merged_df.csv (Important columns defined) - pmid: unique paper - auid: all unique auids (author-name unique identification) - year: Year of paper publication - no_authors: Author count - journal: Journal name - years: first year of publication for every author - Country-temporal: Country of affiliation for every author - h_index: Journal h-index - TimeNovelty: Paper Time novelty - nih_funded: Binary variable indicating funding for any author - prior_cites_mean: Mean of all authors’ prior citation rate - insti_impact: All unique institutions’ citation rate - mesh_vals: Top MeSH values for every author of that paper - hype_word: Candidate hype word, such as ‘novel' - hype_value: Propensity of hype based on the hype word, the sentence, and the abstract location - hype_percentile: Abstract relative position of hype word - relative_citation_ratio: RCR
keywords: Hype; Diversity: PubMed; Abstracts; Scientometrics; Biomedicine
published: 2025-05-21
 
Raw data of Auchenorrhyncha (Hemiptera) species presence and abundance from samples collected as part of Morgan Brown's M.S. thesis entitled "Investigating changes in Auchenorrhyncha (Hemiptera) communities in Illinois prairies over 25 years." Collection_Events_MBrown.pdf contains information that corresponds to each collection event code listed in the raw data files, including coordinates, date of collection, collection method, and name of collector. Each CSV file contains Auchenorrhyncha species presence and abundance data from each sampling area in Illinois: Route 45 Railroad Prairie, Richardson Wildlife Foundation, Mason County nature preserves, and Twelve Mile Prairie. Variables included in the CSV files include: Family: Taxonomic family to which each species belongs Subfamily: Taxonomic subfamily to which each species belongs Tribe: Taxonomic tribe to which each species belongs Species: Lowest taxonomic level to which individuals were identified The first row of column 5 to the end are collection event codes which correspond to each code listed in the PDF * New in V2: The CSV files originally uploaded in V1 contained outdated species names. V2 provides updated CSV files with the corrected names.
keywords: Biodiversity; Entomology; Conservation
published: 2025-05-21
 
___________________________________SUMMARY This dataset contains derivative data from concurrent fMRI and scalp EEG recordings used in: Mostame Parham, Wirsich Jonathan, Alderson Thomas H, Ridley Ben, Giraud Anne-Lise, Carmichael David W, Vulliemoz Serge, Guye Maxime, Lemieux Louis, Sadaghiani Sepideh (2024) A multiplex of connectome trajectories enables several connectivity patterns in parallel eLife 13:RP98777. doi: https://doi.org/10.7554/eLife.98777.3 ___________________________________RAW DATA The data has been originally published and described as part of other studies (Morillon et al., 2010; Sadaghiani et al., 2012). Briefly, 10 minutes of eyes-closed resting state were analyzed from 26 healthy subjects (average age = 24.39 years; range: 18-31 years; 8 females) with no history of psychiatric or neurological disorders. Informed consent was given by each participant and the study was approved by the local Research Ethics Committee (CPP Ile de France III). FMRI was acquired using a 3T Siemens Tim Trio scanner with a GE-EPI pulse sequence (TR = 2 s; TE = 50 ms; 40 slices; 300 volumes; field of view: 192×192; voxel size: 3×3×3 mm3). Structural T1-weighted scan were acquired using the MPRAGE pulse sequence (176 slices; field of view: 256×256; voxel size: 1×1×1 mm3). 62-channel scalp EEG (Easycap, with an additional EOG and an ECG channel) was recorded using an MR-compatible amplifier (BrainAmp MR, Brain Products) at 5Hz sampling rate. ___________________________________PREPROCESSING fMRI and EEG data were preprocessed with standard preprocessing steps as explained in detail elsewhere (Wirsich et al., 2020). In brief, fMRI underwent standard slice-time correction, spatial realignment (SPM12, http://www.fil.ion.ucl.ac.uk/spm/software/spm12). Structural T1-weighted images were processed using Freesurfer (recon-all, v6.0.0, https://surfer.nmr.mgh.harvard.edu/) in order to perform non-uniformity and intensity correction, skull stripping and gray/white matter segmentation. The cortex was parcellated into 68 regions of the Desikan-Kiliany atlas (Desikan et al., 2006). This atlas was chosen because —as an anatomical parcellation— avoids biases towards one or the other functional data modality. The T1 images of each subject and the Desikan-Killiany were co-registered to the fMRI images (FSL-FLIRT 6.0.2, https://fsl.fmrib.ox.ac.uk/fsl/fslwiki). We extracted signals of no interest such as the average signals of cerebrospinal fluid (CSF) and white matter from manually defined regions of interest (ROI, 5 mm sphere, Marsbar Toolbox 0.44, http://marsbar.sourceforge.net) and regressed out of the BOLD timeseries along with 6 rotation, translation motion parameters and global gray matter signal (Wirsich et al., 2017a). Then we bandpass-filtered the timeseries at 0.009–0.08 Hz. Average timeseries of each region was then used to calculate connectivity. EEG underwent gradient and cardio-ballistic artifact removal using Brain Vision Analyzer software (Allen et al., 1998, 2000) and was down-sampled to 250 Hz. EEG was projected into source space using the Tikhonov-regularized minimum norm in Brainstorm software (Baillet et al., 2001; Tadel et al., 2011). Source activity was then averaged to the 68 regions of the Desikan-Killiany atlas. Band-limited EEG signals in each canonical frequency band and every atlas region were then used to calculate frequency-specific connectome dynamics. Note that the MEG-ROI-nets toolbox in the OHBA Software Library (OSL; https://ohba-analysis.github.io/osl-docs/) was used to minimize source leakage in the band-limited source-localized EEG data (Colclough et al., 2015). ___________________________________FOLDER STRUCTURE The dataset includes five separate folders as described below: 1) EEGfMRI_dFC folder: connectome dynamics of scalp data This folder contains 26 single MATLAB (.mat) files for each subject. Inside each `.mat` is a structure with fields `A`, `B`, and `C`, corresponding to fMRI, amplitude-coupling, and phase-coupling connectome dynamics, respectively. The fMRI data are 3-dimensional (ROI × ROI × timepoints). The EEG data are stored in a 1×5 cell array (Delta, Theta, Alpha, Beta, Gamma), each cell containing a 3-D ROI × ROI × timepoints matrix. 2) EEGfMRI_dFC_SourceOrtho foldeR: connectome dynamics of source-orthogonalized scalp data Same format as above, except that EEG connectome dynamics are derived from source-orthogonalized signals. The MEG-ROI-nets toolbox in the OHBA Software Library (OSL; https://ohba-analysis.github.io/osl-docs/) was used to minimize source leakage in the band-limited, source-localized EEG data (Colclough et al., 2015). 3-5) Cross-modal Recurrence Plot (CRP) data Each subject has an Excel file with five sheets (Delta through Gamma), corresponding to the five frequency bands. Each sheet contains a 2-D CRP matrix (rows = fMRI timepoints, columns = band-limited EEG timepoints). - Scalp EEG–fMRI CRPs (CRP_EEGfMRI and CRP_EEGfMRI_SourceOrtho folder): two versions (with and without source-orthogonalization), each has 52 Excel files, including amplitude- and phase-coupling CRPs. - Intracranial EEG–fMRI CRPs (CRP_iEEGfMRI folder): one version, 27 Excel files, containing three cases: amplitude coupling, HRF-convolved amplitude coupling, and phase coupling.
keywords: Connectome; fMRI-EEG; Intracranial; Multiplex
published: 2018-04-19
 
MapAffil 2016 dataset -- PubMed author affiliations mapped to cities and their geocodes worldwide. Prepared by Vetle Torvik 2018-04-05 The dataset comes as a single tab-delimited Latin-1 encoded file (only the City column uses non-ASCII characters), and should be about 3.5GB uncompressed. &bull; How was the dataset created? The dataset is based on a snapshot of PubMed (which includes Medline and PubMed-not-Medline records) taken in the first week of October, 2016. Check here for information to get PubMed/MEDLINE, and NLMs data <a href ="https://www.nlm.nih.gov/databases/download/pubmed_medline.html">Terms and Conditions</a> &bull; Affiliations are linked to a particular author on a particular article. Prior to 2014, NLM recorded the affiliation of the first author only. However, MapAffil 2016 covers some PubMed records lacking affiliations that were harvested elsewhere, from PMC (e.g., PMID 22427989), NIH grants (e.g., 1838378), and Microsoft Academic Graph and ADS (e.g. 5833220). &bull; Affiliations are pre-processed (e.g., transliterated into ASCII from UTF-8 and html) so they may differ (sometimes a lot; see PMID 27487542) from PubMed records. &bull; All affiliation strings where processed using the MapAffil procedure, to identify and disambiguate the most specific place-name, as described in: <i>Torvik VI. MapAffil: A bibliographic tool for mapping author affiliation strings to cities and their geocodes worldwide. D-Lib Magazine 2015; 21 (11/12). 10p</i> &bull; Look for <a href="https://doi.org/10.1186/s41182-017-0073-6">Fig. 4</a> in the following article for coverage statistics over time: <i>Palmblad M, Torvik VI. Spatiotemporal analysis of tropical disease research combining Europe PMC and affiliation mapping web services. Tropical medicine and health. 2017 Dec;45(1):33.</i> Expect to see big upticks in coverage of PMIDs around 1988 and for non-first authors in 2014. &bull; The code and back-end data is periodically updated and made available for query by PMID at <a href="http://abel.ischool.illinois.edu/">Torvik Research Group</a> &bull; What is the format of the dataset? The dataset contains 37,406,692 rows. Each row (line) in the file has a unique PMID and author postition (e.g., 10786286_3 is the third author name on PMID 10786286), and the following thirteen columns, tab-delimited. All columns are ASCII, except city which contains Latin-1. 1. PMID: positive non-zero integer; int(10) unsigned 2. au_order: positive non-zero integer; smallint(4) 3. lastname: varchar(80) 4. firstname: varchar(80); NLM started including these in 2002 but many have been harvested from outside PubMed 5. year of publication: 6. type: EDU, HOS, EDU-HOS, ORG, COM, GOV, MIL, UNK 7. city: varchar(200); typically 'city, state, country' but could inlude further subvisions; unresolved ambiguities are concatenated by '|' 8. state: Australia, Canada and USA (which includes territories like PR, GU, AS, and post-codes like AE and AA) 9. country 10. journal 11. lat: at most 3 decimals (only available when city is not a country or state) 12. lon: at most 3 decimals (only available when city is not a country or state) 13. fips: varchar(5); for USA only retrieved by lat-lon query to https://geo.fcc.gov/api/census/block/find
keywords: PubMed, MEDLINE, Digital Libraries, Bibliographic Databases; Author Affiliations; Geographic Indexing; Place Name Ambiguity; Geoparsing; Geocoding; Toponym Extraction; Toponym Resolution
published: 2025-04-24
 
Includes two files (.csv) behind all analyses and results in the paper published with the same title. <b>1) 'sites.species.counts'</b> is the raw 2018-2022 data from Angella Moorehouse (Illinois Nature Preserves Commission) including her 456 identified pollinator species and her raw counts per site (there may be a few errors of identification or naming, and there will always be name changes over time). Headers in columns F through Q correspond to the remnant-site labels in Figure 1 and Table 1 of the paper. Columns R to AB are the “nonremnant” sites, which have not been uniquely labelled since the specific sites aren't referenced anywhere in the manuscript. <b>2) 'C.scores'</b> has the 265 species assigned empirical C values (empirical.C) along with the four sets of expert C values and their confidence ranks (low, medium, high), and the Illinois/Indiana conservation ranks (S-ranks), following the methods described in the paper. Other headers in these files: - taxa.code: four-letter abbreviation for genus and specific name - genus: genus name - species: specific epithet - common.name: English name - group: general pollinator taxa group - empirical.C: empirically estimated conservatism score - expert#.C: conservatism score assigned by each of four experts - expert#.conf: expert's confidence in their conservatism score Blank cells in the site-species abundance matrix indicates species absence (or non-detection) Blank cells in C.scores.csv indicates missing S-ranks and unassigned C-scores (with associated missing confidence ranks) where experts lacked knowledge or confidence
keywords: ecological conservatism; indicator values; pollinator conservation; prairie ecosystems; protected areas; remnant communities
published: 2025-05-15
 
Coagulation testing (VCM Vet™) was performed on 57 horses with acute abdominal pain at admission to the University of Illinois Veterinary Teaching Hospital. Additional clinical data were recorded retrospectively. ROC analysis was performed to determine the optimal number of abnormal coagulation parameters for coagulopathy diagnosis based on survival. General linear regression (GLM) and random forest (RF) classification models were developed to predict short-term survival. A training cohort of 40 horses was used for model development, and model performance was determined using the remaining 17 horses.
keywords: horse; coagulation; colic; abdominal pain; survival; machine learning; blood clotting; viscoelastic testing
published: 2025-05-14
 
1228 egg hyperspectral images, the wavelength from 400 nm to 900 nm.
published: 2025-04-14
 
This dataset builds on an existing dataset which captures artists’ demographics who are represented by top tier galleries in the 2016–2017 New York art season (Case-Leal, 2017, https://web.archive.org/web/20170617002654/http://www.havenforthedispossessed.org/) with a census of reviews and catalogs about those exhibitions to assess proportionality of media coverage across race and gender. The readme file explains variables, collection, relationship between the datasets, and an example of how the Case-Leal dataset was transformed. The ArticleDataset.csv provides all articles with citation information as well as artist, artistic identity characteristic, and gallery. The ExhibitionCatalog.csv provides exhibition catalog citation information for each identified artist. New in this V2: - In V1, ArticleDataset.csv had both data on the articles published and all of the exhibitions, which was misleading. In V2 I separated out so that ArticleDataset only has articles, and AllSoloShows has all shows, including those that had no articles written about them in the publications reviewed. - Upon closer review I noticed approximately 10 out of the 133 articles had incorrect information in variable "Publication content type: art or general" and/or "Publication Carrier type: web or library?" so I updated V2. - Upon closer review I noticed there was 3 instances of artists who had two solo shows apiece: in addition to Meleko Mokgosi and Carrie Mae Weems which I had already noted in V1, there was also Roxy Paine. I had not noticed this because only one of two of Paine's shows had been written about. This brings the total number of shows to 117 (which was 116 in V1). -Upon closer review I removed one row from ExhibitionCatalogs.csv, as the item i had listed did not meet the parameters.
keywords: diversity and inclusion; diversity audit; contemporary art; art exhibitions; art exhibition reviews; exhibition catalogs; magazines; newspapers; demographics
published: 2025-05-07
 
Data collected at 71 study sites from 2023 to 2024 for Reves, Olivia P. (2025): Using Environmental DNA Metabarcoding to Inform Biodiversity Conservation in Agricultural Landscapes. Master's thesis, University of Illinois Urbana-Champaign. Files include study site information, taxa by site matrices for vertebrates from environmental DNA metabarcoding using multiple mitochondrial DNA primers (COI, 12S), and bird species audibly detected by a phone app at study sites.
keywords: agricultural conservation; biodiversity; eDNA; environmental DNA; Illinois; metabarcoding; riparian buffers; stream flow; vertebrates
published: 2025-05-07
 
Data was generated from a field and lab study to determine the influence of thermal variation in livewells on largemouth bass recovery from angling tournaments. The field study data includes temperature data from inside angler livewells during a tournament day. This was collected across four tournaments. The laboratory data was generated from a series of experiments simulating tournament practices and trends in thermal variation within livewells observed in the field proponent. Following simulation, blood was collected to determine levels of analytes consistent with primary, secondary, and tertiary stress response in largemouth bass. Reflex impairment was also included.
published: 2025-05-02
 
This dataset contains the first-generation (1st-gen) and second-generation (2nd-gen) citation relationships to a set of focal papers. The 1st-gen citation relationships are the instances of one paper citing a focal paper. These citing papers are called "1st-gen citations." The 2nd-gen citation relationships are the instances that a paper cites a 1st-gen citation. The citing paper in the 2nd-gen citation relationship is a second-generation (2nd-gen) citation. When a 2nd-gen citation is also a 1st-gen citation, it creates a transitive closure with the focal paper. Each focal paper has an abbreviation, which can be found below. The 1st-gen and 2nd-gen citation relationships were extracted from the Curated Open Citation Dataset (Korobskiy & Chacko, 2023), which is derived from a copy of COCI, the OpenCitations Index of Crossref Open DOI-to-DOI Citations, downloaded on May 6, 2023. Scripts used to collect this dataset can be found at https://github.com/yuanxiesa/transitive_closure_study. Each focal paper currently has two files: {abbreviation}_1st.csv contains the 1st-gen citation relationships; {abbreviation}_2nd.csv contains the 2nd-gen citation relationships. Focal paper abbreviation == "louvain": Blondel, V. D., Guillaume, J.-L., Lambiotte, R., & Lefebvre, E. (2008). Fast unfolding of communities in large networks. Journal of Statistical Mechanics: Theory and Experiment, 2008(10), P10008. https://doi.org/10.1088/1742-5468/2008/10/P10008 Focal paper abbreviation == "lp": Raghavan, U. N., Albert, R., & Kumara, S. (2007). Near linear time algorithm to detect community structures in large-scale networks. Physical Review E, 76(3), 036106. https://doi.org/10.1103/PhysRevE.76.036106 Focal paper abbreviation == "gn": Newman, M. E. J., & Girvan, M. (2004). Finding and evaluating community structure in networks. Physical Review E, 69(2), 026113. https://doi.org/10.1103/PhysRevE.69.026113
keywords: transitive closure; citations; community detection algorithms; OpenCitations; method papers
published: 2021-05-17
 
Please cite as: Wuebbles, D., J. Angel, K. Petersen, and A.M. Lemke, (Eds.), 2021: An Assessment of the Impacts of Climate Change in Illinois. The Nature Conservancy, Illinois, USA. https://doi.org/10.13012/B2IDB-1260194_V1 Climate change is a major environmental challenge that is likely to affect many aspects of life in Illinois, ranging from human and environmental health to the economy. Illinois is already experiencing impacts from the changing climate and, as climate change progresses and temperatures continue to rise, these impacts are expected to increase over time. This assessment takes an in-depth look at how the climate is changing now in Illinois, and how it is projected to change in the future, to provide greater clarity on how climate change could affect urban and rural communities in the state. Beyond providing an overview of anticipated climate changes, the report explores predicted effects on hydrology, agriculture, human health, and native ecosystems.
keywords: Climate change; Illinois; Public health; Agriculture; Environment; Water; Hydrology; Ecosystems