Illinois Data Bank Dataset Search Results
Results
published:
2023-12-06
Starbuck, Clarissa; DeSchepper, Logan; Hoggatt, Meredith; O'Keefe, Joy
(2023)
This dataset accompanies an article published in the journal Bioacoustics: "Tradeoffs in sound quality and cost for passive acoustic devices", https://doi.org/10.1080/09524622.2023.2290715. The dataset contains measurements for acoustic call files for free-flying bats simultaneously recorded on both Audiomoth and Anabat Swift passive acoustic recording devices in a conservation area in northeastern Missouri, USA. We paired calls from the two devices and compared indicators of recording quality measured in a proprietary program (Bat Call Identification Software). The dataset also contains a file enumerating the proportions of calls classified as low frequency, mid frequency, or Myotis (three phonic groups) for each type of recording device. The data were used to compare the quality and sensitivity of the two devices. The scripts for modeling procedures and figures are included in the dataset.
keywords:
Bats; echolocation; passive acoustic monitoring; sensors
published:
2023-12-08
Preza Fontes, Giovani; Greer, Kristin; Pittelkow, Cameron
(2023)
A two-year field study was conducted to test the hypothesis that biochar application increases inorganic soil N availability during maize growth, leading to higher grain yields and N recovery efficiency while reducing the risk of N leaching following harvest. Four N fertilizer rates (0, 90, 179, and 269 kg ha-1 as urea ammonium nitrate solution) were applied with or without biochar (10 Mg ha-1) before maize planting each year. This dataset contains selected summary statistics (average and standard deviation) on soil and plant measurements. This file package also includes a readme.txt file that describes the data in detail, including attribute descriptions.
keywords:
biochar; nitrogen fertilizer; nitrogen use efficiency; corn yield, soil inorganic nitrogen; nitrate leaching
published:
2024-05-13
Hohoff, Tara; Rogness, Brittany; Davis, Mark
(2024)
Survey questions and data collected from Illinois land managers on practices and knowledge relating to impacts to wildlife. 0s indicated non-selection, 1s indicate selection of answer.
keywords:
forestry management; online survey; wildlife
published:
2023-03-24
This datasets provide basis of our analysis in the paper - Potential Impacts on Ozone and Climate from a Proposed Fleet of Supersonic Aircraft. All datasets here can be categorized into emission data and model output data (WACCM). All the model simulations (background and perturbation) were run to steady-state and only the datasets used in analysis are archived here.
keywords:
NetCDF; Supersonic aircraft; Stratospheric ozone; Climate
published:
2023-07-06
Schneider, Amy; Suski, Cory; Esbaugh, Andrew
(2023)
published:
2024-02-25
Coshic, Kush; Maffeo, Christopher; Winogradoff, David; Aksimentiev, Aleksei
(2024)
Simulation trajectory data and scripts for Nature manuscript "The structure and physical properties of a packaged bacteriophage particle" that reports the all-atom structure of a complete HK97 virion, including its entire 39,732 base pair genome, obtained through multi-resolution simulations.
keywords:
Virus capsid; Bacteriophage packaging; Multiresolution simulations; all-atom MD simulation
published:
2024-07-01
Chen, Henry; Ang, Claire; Crowder, Molly; Brieher, William; Blanke, Steven
(2024)
This page contains the data for the publication "Revisiting bacterial cytolethal distending toxin structure and function" published in Frontiers in Cellular and Infection Microbiology in 2023.
keywords:
AB toxin; cytolethal distending toxin; protein-protein interactions; Campylobacter jejuni; DNA damage; holotoxin structure
published:
2025-06-05
Guan, Yingjun; Fang, Liri
(2025)
There are two files in this dataset.
File1: AffiNorm
AffiNorm contains 1,001 rows, including one header row, randomly sampled from MapAffil 2018 Dataset ([**https://doi.org/10.13012/B2IDB-2556310_V1**](https://databank.illinois.edu/datasets/IDB-2556310)). Each row in the file corresponds to a particular author on a particular PubMed record, and contains the following 26 columns, comma-delimited. All columns are ASCII, except city which contains Latin-1.
COLUMN DESCRIPTION
1. PMID: the PubMed identifier. int.
2. ORDER: the position of the author. int.
3. YEAR - The year of publication. int(4), eg: 1975.
4. affiliation - affiliation string of the author. eg: Department of Pathology, University of Chicago, Illinois 60637.
5. annotation_type: the number of institutions annotated, denoted by S, M, O, or Z, where "S" (single) indicates 1 institution was annotated; "M" (Multiple) indicates more than one institutions were annotated; "O" (Out of Vocabulary or None) indicates no institution was annotated, but an institution was apparently mentioned; "Z" indicates no institution was mentioned.
6. Institution: the standard name(s) of the annotated institution(s), according to ROR. if "S" (single institution), it is saved as a string, eg: University of Chicago; if "M", it is saved as a string that looks like a python list, eg: ['Public Health Laboratory Service'; 'Centre for Applied Microbiology and Research']; if "O" or "Z", then blank.
7. inst_type: the type of institution, according to ROR. the potential values are: education, funder, healthcare, company, archive, nonprofit, government, facility, other. An institution may have more than one type, eg: ['Education', 'Funder']
8. type_edu: TRUE if the inst_type contains "Education"; FALSE otherwise.
9. RORid: ROR identifier(s), eg: https://ror.org/05hs6h993. when multiple, the order corresponds to institution (column 6)
10. RORid_label. the standard name(s) of the annotated institution(s) according to ROR.same as institution (column 6)
11. GRIDid: GRID identifier(s). eg: grid.170205.1
12. GRIDid_label: the standard name(s) of the annotated institution(s) according to GRID. eg: University of Chicago.
13. WikiDataid: WikiData identifier(s). eg: Q131252
14. WikiDataid_label: the standard name(s) of the annotated institution(s) according to WikiData. eg: University of Chicago
15. synonyms: a comma separated list of variant names from InsVar (file 2) . format of string. eg: University of Chicago, Chicago University, U of C, UChicago, uchicago.edu, U Chicago, ...
16. MapAffil-grid: GRID from the MapAffil 2018 Dataset.
17. MapAffil-grid_label: The standard name of institution from MapAffil 2018 Dataset.
18. judge_mapA: TRUE if GRIDid (column 11) contains MapAffil-grid (column 16); FALSE otherwise.
19. MapAffiltemporal-grid: GRID from the temporal version of MapAffil, http://abel.ischool.illinois.edu/data/MapAffilTempo2018.tsv.gz
20. MapAffiltemporal-grid_label: The standard name of institution from MapAffilTemporal 2018 Dataset.
21. judge_mapT: TRUE if GRIDid (column 11) contains MapAffiltemporal-grid (column 19); FALSE otherwise.
22. RORapi_query_id: ROR from ROR api tool (query endpoint)
23. RORapi_query_id_label: The standard name of institution from ROR api tool (query endpoint). format in string.
24. judge_rorapi_affiliation: TRUE if RORid (column 9) contains RORapi_query_id (column 22); FALSE otherwise.
25. rorapi_affiliation_id: ROR from ROR api tool (affiliation endpoint).
26. judge_rorapi_affiliation: TRUE if RORid (column 9) contains RORapi_affiliation (column 25); FALSE otherwise.
File 2: insVar.json
InsVar is a supplementary dataset for AffiNorm, which includes the institution ID and its redirected aliases from wikidata. The institution ID list is from GRID, the redirected aliases are from wiki api, for example: https://en.wikipedia.org/wiki/Special:WhatLinksHere?target=University+of+Illinois+Urbana-Champaign&namespace=&hidetrans=1&hidelinks=1&limit=100
In InsVar, the data is saved in a python dictionary format. the key is the GRID identifier, for example: "grid.1001.0" (Australian National University), and the value is a list of redirected aliases strings.
{"grid.1001.0": ["ANU", "ANU College", "ANU College of Arts and Social Sciences", "ANU College of Asia and the Pacific", "ANU Union", "ANUSA", "Asia Pacific Week", "Australia National University", "Australian Forestry School", "the Australian National University", ...], "grid.1002.3": ...}
keywords:
PubMed; MEDLINE; Digital Libraries; Bibliographic Databases; Institution Names; Author Affiliations; Institution Name Ambiguity; Authority files
published:
2024-07-31
LaBonte, Nicholas R.; Zerpa-Catanho, Dessiree P.; Liu, Siyao; Xiao, Liang; Dong, Hongxu; Clark, Lindsay V.; Sacks, Erik J.
(2024)
This dataset contains all data and supplementary materials from "Improving precision and accuracy of genetic mapping with genotyping-by-sequencing data in outcrossing species". An Excel file a list of all QTLs and linkage group length (in cM) obtained with two different SNP-calling methods (Tassel-Uneak and Tassel-GBS), genetic map-construction method (linkage-only and reference order-corrected) and depth filters (12x, 20x, 30x and 40x) for genetic mapping of 18 biomass yield traits in a biparental Miscanthus sinensis population using RAD-Seq SNPs is provided as "Supplementary file 1". A Perl script with the code for filtering VCF and HapMap-formatted data files is provided as “Supplementary file 2”. Phenotype data used for QTL mapping is provided as “Supplementary File 3”. A Perl script with the code for the simulation study is provided as “Supplementary file 4”.
keywords:
HapMapParser; GenotypingSimulator
published:
2025-04-25
Tassitano, Rafael; Chakraborty, Shreyonti
(2025)
This is an Excel file containing data about the physical environments of four Brazilian schools and the average daily minutes/day of physical activity and sedentary behavior exhibited by schoolchildren during school hours.
The Following Key describes the basic variables:
Subject IDs and Characteristics
Subject_ID: ID of Subject
total_days: Total number of days subject participated in experiment
Gender : Gender of subject
Age: Age of subject
School IDs and Characteristics
ID_School = ID of School
school1 = 1 if ID_School = 1, else = 0
school2 = 1 if ID_School = 2, else = 0
school3 = 1 if ID_School = 3, else = 0
school4 = 1 if ID_School = 4, else = 0
TotalSiteArea: Total Site Area on School Campus
PatioArea: Area of Patio(s)
CourtyardArea: Area of Courtyard(s)
TotalOpenArea: Total Area of Open Spaces on Campus
Class: Number of Sections in the School
Population: Total Number of Students Enrolled in the School
keywords:
school environment; physical activity
published:
2025-09-10
Lu, Yi; Mirts, Evan; Petrik, Igor D.; Hosseinzadeh, Parisa; Nilges, Mark J.
(2025)
Enzymatic reduction of oxyanions such as sulfite (SO32−) requires the delivery of multiple electrons and protons, a feat accomplished by cofactors tailored for catalysis and electron transport. Replicating this strategy in protein scaffolds may expand the range of enzymes that can be designed de novo. Mirts et al. selected a scaffold protein containing a natural heme cofactor and then engineered a cavity suitable for binding a second cofactor—an iron-sulfur cluster (see the Perspective by Lancaster). The resulting designed enzyme was optimized through rational mutation into a catalyst with spectral characteristics and activity similar to that of natural sulfite reductases.
keywords:
Conversion;Catalysis
published:
2025-11-06
Sweedler, Jonathan; Rosado Rosa, Joenisse M.
(2025)
SCiLS MSI data files, images used in the figures and table contents for the tables found in the manuscript. The figures are labeled by figure and by their title on each figure set, including those found in the Supplementary Information. The tables are in an MS Excel sheet with the corresponding contents. The tables list the metabolites found in the images. To reduce the number of images in the manuscript, the tables complete the metabolite information not observed in the images. The images can be found using the SCiLS data files. A software license is needed to open these files. The SCiLS data files contains the processed MSI data for all obtained images. All files in the corresponding SCiLS data file must be present to open the individual data file. The feature list used for MSI analysis should be saved on the attached bookmark inside the SCiLS file so it should be available once the file is opened. SCiLS files can only be opened with the Bruker SCiLS software. If using an outdated version (before Version 13.01.17218), the files may not open or show poor quality.
keywords:
Tendrils; Pyocyanin; Quinolones; Spatiochemical; Metabolomics
published:
2025-06-16
Sarkar, Adwitiya; Looney, Leslie
(2025)
Data for the publication of Magnetic Fields in the Pillars of Creation (Sarkar et al.). Contains the fits files and python scripts.
keywords:
HAWC+; SOFIA; Pillars of Creation; M16; Eagle Nebula; Dust Polarization
published:
2025-10-21
Jia, Yuyao; Maitra, Shraddha; Singh, Vijay
(2025)
Bioenergy crops have potential for being a sustainable and renewable feedstock for biofuels and various value-added bioproducts. The study utilizes recently developed transgenic sugarcane (“oilcane”) bagasse for chemical-free coproduction of high-value bioproducts, i.e., furfurals, HMF, acetic acid, cellulosic sugars, and vegetative lipids. Hydrothermal pretreatment was optimized at 210 °C for 5 min to coproduce 6.91%, 2.67%, 5.07%, 2.42% and 37.82% (w/w) furfurals, HMF, acetic acid, vegetative lipids, and cellulosic sugars, respectively from lignocellulosic biomass. Additionally, nanofiltration system in-series was successfully established to recover sugars, furfurals, HMF, and acetic acid from the pretreatment liquor. 1st nanofiltration with Duracid NF membrane rejected ∼99% sugars. Concentrated sugars with significantly reduced inhibitory products were obtained in retentate for fermentation. 2nd nanofiltration with NF90 membrane used permeate of 1st nanofiltration as feed and rejected ∼ 86% furfurals. The work demonstrates the feasibility of coproducing and recovering multiple biochemicals from lignocellulosic biomass.
keywords:
Conversion;Biomass Analytics;Hydrolysate;Metabolomics
published:
2022-10-22
Madhavan, Vidya; Aishwarya, Anuva
(2022)
This dataset consists of all the files that are part of the manuscript titled "Evidence for a robust sign-changing s-wave order parameter in monolayer films of superconducting Fe(Se,Te)/Bi2Te3". For detailed information on the individual files refer to the readme file.
keywords:
thin film; mbe; topology; superconductivity; topological insulator; stm; spectroscopy; qpi
published:
2023-10-16
Rasoarimanana, Tantely; Edmonds, Devin; Marquis, Olivier
(2023)
This dataset provides microhabitat and environmental variables collected in the habitat of the poison frog Mantella baroni from 155 1-meter square quadrats in Vohimana Reserve along forest valleys, on slopes, and on ridgelines. We also provide data from photographic capture-recapture surveys used for estimating abundance.
keywords:
occupancy; abundance; amphibian; Madagascar; microhabitat; capture-recapture
published:
2025-08-28
Purba, Denissa Sari Darmawi; Pei, Xingrui; Kontou, Eleftheria
(2025)
This dataset contains both processed and raw data that were leveraged to conduct analysis presented fully in the report "Community Vulnerability Assessment for Electric Vehicle Travelers Responsive to Extreme Flooding" and partially in the under review paper "Vulnerability Assessment of Electric Vehicles and their Charging Station Network during Evacuations".
keywords:
electric vehicles; vulnerability assessment; flooding events; evacuation; charging infrastructure
published:
2024-08-29
Li, Shuai; Montes, Christopher; Aspray, Elise; Ainsworth, Elizabeth
(2024)
Over the past 15 years, soybean seed yield response to season-long elevated O3 concentrations [O3] and to year-to-year weather conditions was studied using free-air O3 concentration enrichment (O3-FACE) in the field at the SoyFACE facility in Central Illinois. Elevated [O3] significantly reduced seed yield across cultivars and years. However, our results quantitatively demonstrate that weather conditions, including soil water availability and air temperature, did not alter yield sensitivity to elevated [O3] in soybean.
keywords:
drought, elevated O3, heat, O3-FACE, soybean, yield
published:
2021-06-08
Todd, Jones; Michael, Ward
(2021)
Dataset associated with Jones and Ward JAE-2020-0031.R1 submission: Pre-to post-fledging carryover effects and the adaptive significance of variation in wing development for juvenile songbirds. Excel CSV files with data used in analyses and file with descriptions of each column. The flight ability variable in this dataset was derived from fledgling drop tests, examples of which can be found in the related dataset: Jones, Todd M.; Benson, Thomas J.; Ward, Michael P. (2019): Flight Ability of Juvenile Songbirds at Fledgling: Examples of Fledgling Drop Tests. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-2044905_V1.
keywords:
fledgling; wing development; life history; adaptive significance; post-fledging; songbirds
published:
2021-10-15
Perez, Sierra; Dalling, James; Fraterrigo, Jennifer
(2021)
Information on the location, dimensions, time of treefall or death, decay state, wood nutrient, wood pH and wood density data, and soil moisture, slope, distance from forest edge and soil nutrient data associated with the publication "Interspecific wood trait variation predicts decreased carbon residence time in changing forests" authored by Sierra Perez, Jennifer Fraterrigo, and James Dalling.
** <b>Note:</b> Blank cells indicate that no data were collected.
keywords:
wood decay; carbon residence time; coarse woody debris; decomposition, temperate forests
published:
2022-12-05
Ng, Yee Man Margaret ; Taneja, Harsh
(2022)
These are similarity matrices of countries based on dfferent modalities of web use. Alexa website traffic, trending vidoes on Youtube and Twitter trends. Each matrix is a month of data aggregated
keywords:
Global Internet Use
published:
2017-03-02
This data was collected between 2004 and 2010 at White River National Wildlife Refuge (WRNWR) and Saint Francis National Forest (SF). It was collected as part of two master’s and one PhD project at Arkansas State University USA studying Swainson’s Warbler habitat use, survival, and body condition.
keywords:
Swainson’s Warbler; Limnothlypis swainsonii; flooding; natural disturbance; apparent survival; body condition
published:
2023-05-08
Dataset for Food availability influences angling vulnerability in muskellunge
published:
2024-05-13
Gopalakrishnappa, Chandana; Li, Zeqian; Kuehn, Seppe
(2024)
Supplemental data for the paper titled 'Environmental modulators of algae-bacteria interactions at scale'. Each of the excel workbooks corresponding to datasets 1, 2, and 3 contain a README sheet explaining the reported data. Dataset 4 comprising microscopy data contains a README text file describing the image files.
keywords:
Algae-bacteria interactions; high-throughput; microfluidic-droplet platform
published:
2024-07-29
Caetano Machado Lopes, Lorran; Chacko, George
(2024)
This dataset consists of a citation graph. It was constructed by downloading and parsing the Works section of the Open Alex catalog of the global research system. Open Alex (see citation below) contains detailed information about scholarly research, including articles, authors, journals, institutions, and their relationships. The data were downloaded on 2024-07-15.
The dataset comprises two compressed (.xz) files.
1) filename: openalexID_integer_id_hasDOI.parquet.xz. The tabular data within contains three columns: openalex_id, integer_id, and hasDOI. Each row represents a record with the following data types:
• openalex_id: A unique identifier from the Open Alex catalog.
• integer_id: An integer representing the new identifier (assigned by the authors)
• hasDOI: An integer (0 or 1) indicating whether the record has a DOI (0 for no, 1 for yes).
2) filename: citation_table.tsv.xz
This edgelist of citations has two columns (no header) of integer values that represent citing and cited integer_id, respectively.
Summary Features
• Total Nodes (Documents): 256,997,006
• Total Edges (citations): 2,148,871,058
• Documents with DOIs: 163,495,446
• Edges between documents with DOIs: 1,936,722,541 [corrected to 2,148,788,148 edges Nov 13, 2025]
• Count of unique nodes in edgelist 111,453,719 [updated Nov 13, 2025]
Note: Nov 13, 2025. An improved curation process will be applied to a future version of this dataset
Note: Nov 13, 2025.
The code used to generate these files can be found here: https://github.com/illinois-or-research-analytics/lorran_openalex/
keywords:
citation networks; Open Alex