Illinois Data Bank Dataset Search Results
Results
published:
2018-08-06
Hoang, Linh; Cao, Linh ; Guan, Yingjun; Cheng, Yi-Yun; Schneider, Jodi
(2018)
This annotation study compared RobotReviewer's data extraction to that of three novice data extractors, using six included articles synthesized in one Cochrane review: Bailey E, Worthington HV, van Wijk A, Yates JM, Coulthard P, Afzal Z. Ibuprofen and/or paracetamol (acetaminophen) for pain relief after surgical removal of lower wisdom teeth. Cochrane Database Syst Rev. 2013; CD004624; doi:10.1002/14651858.CD004624.pub2 The goal was to assess the relative advantage of RobotReviewer's data extraction with respect to quality.
keywords:
RobotReviewer; annotation; information extraction; data extraction; systematic review automation; systematic reviewing;
published:
2019-03-06
Anderson, Nicholas L.; Harmon-Threatt, Alexandra N.
(2019)
Chronic contact exposure to realistic soil concentrations (0, 7.5, 15, and 100 ppb) of the neonicotinoid pesticide imidacloprid had species- and sex-specific effects on bee adult longevity, immature development speed, and mass. This dataset contains a life table tracking the development, mass, and deaths of a single cohort of Osmia lignaria and Megachile rotundata over the course of two summers. Other data files include files created for multi-event survival analysis to analyze the effect on development speed. Detected effects included: decreased adult longevity for female O. lignaria at the highest concentration, a trend for a hormetic effect on female M. rotundata development speed and mass (longest development time and greatest mass in the 15 ppb treatment), and decreased adult longevity and increased development speed at high imidacloprid concentrations as well as a hormetic effect on mass (lowest in the 15 ppb treatment treatment) on male M. rotundata.
keywords:
neonicotinoid; imidacloprid; bee; habitat restoration;
published:
2021-02-24
Bieri, Carolina A.; Dominguez, Francina
(2021)
This dataset contains model output from the Community Earth System Model, Version 2 (CESM2; Danabasoglu et al. 2020). These data were used for analysis in Impacts of Large-Scale Soil Moisture Anomalies in Southeastern South America, published in the Journal of Hydrometeorology (DOI: 10.1175/JHM-D-20-0116.1). See this publication for details of the model simulations that created these data.
Four NetCDF (.nc) files are included in this dataset. Two files correspond to the control simulation (FHIST_SP_control) and two files correspond to a simulation with a dry soil moisture anomaly imposed in southeastern South America (FHIST_SP_dry; see the publication mentioned in the preceding paragraph for details on the spatial extent of the imposed anomaly). For each simulation, one file corresponds to output from the atmospheric model (file names with "cam") of CESM2 and the other to the land model (file names with "clm2"). These files are raw CESM output concatenated into a single file for each simulation.
All files include data from 1979-01-02 to 2003-12-31 at a daily resolution. The spatial resolution of all files is about 1 degree longitude x 1 degree latitude. Variables included in these files are listed or linked below.
Variables in atmosphere model output:
Vertical velocity (omega)
Convective precipitation
Large-scale precipitation
Surface pressure
Specific humidity
Temperature (atmospheric profile)
Reference temperature (temp. at reference height, 2 meters in this case)
Zonal wind
Meridional wind
Geopotential height
Variables in land model output:
See https://www.cesm.ucar.edu/models/cesm1.2/clm/models/lnd/clm/doc/UsersGuide/history_fields_table_40.xhtml
Note that not all of the variables listed at the above link are included in the land model output files in this dataset.
This material is based upon work supported by the National Science Foundation under Grant No. 1454089.
We acknowledge high-performance computing support from Cheyenne (doi:10.5065/D6RX99HX) provided by NCAR's Computational and Information Systems Laboratory, sponsored by the National Science Foundation. The CESM project is supported primarily by the National Science Foundation. We thank all the scientists, software engineers, and administrators who contributed to the development of CESM2.
References
Danabasoglu, G., and Coauthors, 2020: The Community Earth System Model Version 2 (CESM2). Journal of Advances in Modeling Earth Systems, 12, e2019MS001916, https://doi.org/10.1029/2019MS001916.
keywords:
Climate modeling; atmospheric science; hydrometeorology; hydroclimatology; soil moisture; land-atmosphere interactions
published:
2025-09-15
Kantola, Ilsa; Masters, Michael; DeLucia, Evan
(2025)
Data sets for material included in "A 13-year record indicates differences in the duration and depth of soil carbon accrual among potential bioenergy crops" by Kantola et al., 2025, in Global Change Biology Bioenergy. Data include soil organic carbon (SOC), carbon stable isotope ratios, annual belowground biomass, and annual post-harvest litter for four crops, maize/soybean, miscanthus, switchgrass, and prairie, between 2008 and 2021.
keywords:
bioenergy crops; soil organic carbon; miscanthus; switchgrass; prairie
published:
2025-09-17
Avalos, Jose L; Mantri, Krishi
(2025)
Microbial fermentation provides a sustainable method of producing valuable chemicals. Adding dynamic control to fermentations can significantly improve titers, but most systems rely on transcriptional controls of metabolic enzymes, leaving existing intracellular enzymes unregulated. This limits the ability of transcriptional controls to switch off metabolic pathways, especially when metabolic enzymes have long half-lives. We developed a two-layer transcriptional/post-translational control system for yeast fermentations. Specifically, the system uses blue light to transcriptionally activate the major pyruvate decarboxylase PDC1, required for cell growth and concomitant ethanol production. Switching to darkness transcriptionally inactivates PDC1 and instead activates the anti-Pdc1p nanobody, NbJRI, to act as a genetically encoded inhibitor of Pdc1p accumulated during the growth phase. This dual transcriptional/post-translational control improves the production of 2,3-BDO and citramalate by up to 100 and 92% compared to using transcriptional controls alone in dynamic two-phase fermentations. This study establishes the NbJRI nanobody as an effective genetically encoded inhibitor of Pdc1p that can enhance the production of pyruvate-derived chemicals.
keywords:
metabolic engineering
published:
2017-09-28
Price, Edward P. F.; Spyreas, Greg; Matthews, Jeffrey
(2017)
This is the dataset used in the Journal of Ecology publication of the same name. It is a site by species matrix of species relative abundances.
The file BH.veg.data.csv contains a site by species matrix of species relative abundance (percent cover across all sampling quadrats within site). Data under the heading Year refers to sampling periods. Year 1 refers to the first set of samples taken between 1997 and 2000, Year 2 refers to the second set taken between 2002 and 2005, Year 3 refers to the third set taken between 2007 and 2010, and Year 4 refers to the fourth set taken between 2012 and 2015. All sites met Critical Trends Assessment Program (CTAP) size criteria of being at least 2 ha in size with a minimum of 500 m2 of suitable sampling area.
The data in file BH.site.location.csv contains Public Land Survey System ranges and townships in which specific sites were located. All sites were located within the U.S. state of Illinois.
More information about this dataset: Interested parties can request data from the Critical Trends Assessment Program, which was the source for the data on the wetlands in this study. More information on the program and data requests can be obtained by visiting the program webpage.
Critical Trends Assessment Program, Illinois Natural History Survey. http://wwx.inhs.illinois.edu/research/ctap/
keywords:
biodiversity; biotic homogenization; invasive species; Phalaris arundinacea; plant population and community dynamics; similarity index; wetlands
published:
2020-07-15
This repository includes scripts and datasets for Chapter 6 of my PhD dissertation, " Supertree-like methods for genome-scale species tree estimation," that had not been published previously. This chapter is based on the article: Molloy, E.K. and Warnow, T. "FastMulRFS: Fast and accurate species tree estimation under generic gene duplication and loss models." Bioinformatics, In press. https://doi.org/10.1093/bioinformatics/btaa444.
The results presented in my PhD dissertation differ from those in the Bioinformatics article, because I re-estimated species trees using FastMulRF and MulRF on the same datasets in the original repository (https://doi.org/10.13012/B2IDB-5721322_V1). To re-estimate species trees, (1) a seed was specified when running MulRF, and (2) a different script (specifically preprocess_multrees_v3.py from https://github.com/ekmolloy/fastmulrfs/releases/tag/v1.2.0) was used for preprocessing gene trees (which were then given as input to MulRF and FastMulRFS). Note that this preprocessing script is a re-implementation of the original algorithm for improved speed (a bug fix also was implemented).
Finally, it was brought to my attention that the simulation in the Bioinformatics article differs from prior studies, because I scaled the species tree by 10 generations per year (instead of 0.9 years per generation, which is ~1.1 generations per year). I re-simulated datasets (true-trees-with-one-gen-per-year-psize-10000000.tar.gz and true-trees-with-one-gen-per-year-psize-50000000.tar.gz) using 0.9 years per generation to quantify the impact of this parameter change (see my PhD dissertation or the supplementary materials of Bioinformatics article for discussion).
keywords:
Species tree estimation; gene duplication and loss; statistical consistency; MulRF, FastRFS
published:
2020-10-14
Dalling, James W.; Heineman, Katherine D.
(2020)
Data on permanent plots at Fortuna and the Panama Canal Watershed, Republic of Panama, containing counts and percent of trees with one or more multiple stems >10cm diameter, with and without palms. Accompanying environmental data includes elevation, precipitation, soil type and soil chemical variables (pH, total N, NO3, NO4, resin P, mehlich Ca, K and Mg.
keywords:
multiple stems; resprouting; Panama Canal Watershed; Fortuna Forest Reserve
published:
2020-05-17
Mishra, Sudhanshu; Prasad, Shivangi; Mishra, Shubhanshu
(2020)
Models and predictions for submission to TRAC - 2020 Second Workshop on Trolling, Aggression and Cyberbullying
Our approach is described in our paper titled:
Mishra, Sudhanshu, Shivangi Prasad, and Shubhanshu Mishra. 2020. “Multilingual Joint Fine-Tuning of Transformer Models for Identifying Trolling, Aggression and Cyberbullying at TRAC 2020.” In Proceedings of the Second Workshop on Trolling, Aggression and Cyberbullying (TRAC-2020).
The source code for training this model and more details can be found on our code repository: https://github.com/socialmediaie/TRAC2020
NOTE: These models are retrained for uploading here after our submission so the evaluation measures may be slightly different from the ones reported in the paper.
keywords:
Social Media; Trolling; Aggression; Cyberbullying; text classification; natural language processing; deep learning; open source;
published:
2020-05-30
Long, Stephen Patrick
(2020)
Original leaf gas exchange and absorptance data used in the Collison et al. (2020) Light, Not Age, Underlies the Q9 Maladaptation of Maize and Miscanthus Photosynthesis to Self-Shading - Frontiers in Plant Science doi: 10.3389/fpls.2020.00783
keywords:
C4 photosynthesis; canopy; bioenergy; food security; quantum yield; shade acclimation; photosynthetic light-use efficiency; leaf aging
published:
2020-07-01
Rykhlevskii, Andrei; Huff, Kathryn D.
(2020)
keywords:
molten salt; fuel cycle; reprocessing; refueling
published:
2020-05-12
The data provided herein is accelerometer and strain data taken from free vibration response of pre-tensioned, partially submerged steel beam specimens (modulus of elasticity assumed = 29,000 ksi). The specimens were subjected to various levels of pre-tension, and various levels of submersion in water. The purpose of the testing was to quantify the effects of partial submersion on the vibrating frequencies of pretensioned beams. Three specimens were tested, each with different cross section (but identical cross-sectional area). The different cross sections allow
investigation of the effects of specimen width as the specimen vibrates through water.
The testing procedure was as follows:
1) Apply a specified level of tension in the beam. Measure tension via 3 strain gages.
2) Submerge the specimens to a specified depth of water
3) Excite the beams with either a hammer impact or a pull-and-release method (physically pull the middle of the bar and quickly release)
4) Measure the free vibration of the beam with 2 accelerometers.
Schematic drawings of the test setup and the test specimens are provided, as is a picture of the test setup.
keywords:
free vibration; beam; partially-submerged; prestressed;
published:
2025-11-03
Blanc-Betes, Elena; Gomez-Casanovas, Nuria; Hartman, Melannie D.; Hudiburg, Tara W.; Khanna, Madhu; Parton, William; DeLucia, Evan H.
(2025)
Bioenergy with carbon capture and storage (BECCS) sits at the nexus of the climate and energy security. We evaluated trade-offs between scenarios that support climate stabilization (negative emissions and net climate benefit) or energy security (ethanol production). Our spatially explicit model indicates that the foregone climate benefit from abandoned cropland (opportunity cost) increased carbon emissions per unit of energy produced by 14–36%, making geologic carbon capture and storage necessary to achieve negative emissions from any given energy crop. The toll of opportunity costs on the climate benefit of BECCS from set-aside land was offset through the spatial allocation of crops based on their individual biophysical constraints. Dedicated energy crops consistently outperformed mixed grasslands. We estimate that BECCS allocation to land enrolled in the Conservation Reserve Program (CRP) could capture up to 9 Tg C year–1 from the atmosphere, deliver up to 16 Tg CE year–1 in emissions savings, and meet up to 10% of the US energy statutory targets, but contributions varied substantially as the priority shifted from climate stabilization to energy provision. Our results indicate a significant potential to integrate energy security targets into sustainable pathways to climate stabilization but underpin the trade-offs of divergent policy-driven agendas.
keywords:
Sustainability;Field Data;Modeling
published:
2020-02-12
XSEDE-Extreme Science and Engineering Discovery Environment
(2020)
The XSEDE program manages the database of allocation awards for the portfolio of advanced research computing resources funded by the National Science Foundation (NSF). The database holds data for allocation awards dating to the start of the TeraGrid program in 2004 to present, with awards continuing through the end of the second XSEDE award in 2021. The project data include lead researcher and affiliation, title and abstract, field of science, and the start and end dates. Along with the project information, the data set includes resource allocation and usage data for each award associated with the project. The data show the transition of resources over a fifteen year span along with the evolution of researchers, fields of science, and institutional representation.
keywords:
allocations; cyberinfrastructure; XSEDE
published:
2025-04-14
This dataset builds on an existing dataset which captures artists’ demographics who are represented by top tier galleries in the 2016–2017 New York art season (Case-Leal, 2017, https://web.archive.org/web/20170617002654/http://www.havenforthedispossessed.org/) with a census of reviews and catalogs about those exhibitions to assess proportionality of media coverage across race and gender. The readme file explains variables, collection, relationship between the datasets, and an example of how the Case-Leal dataset was transformed. The ArticleDataset.csv provides all articles with citation information as well as artist, artistic identity characteristic, and gallery. The ExhibitionCatalog.csv provides exhibition catalog citation information for each identified artist.
New in this V2:
- In V1, ArticleDataset.csv had both data on the articles published and all of the exhibitions, which was misleading. In V2 I separated out so that ArticleDataset only has articles, and AllSoloShows has all shows, including those that had no articles written about them in the publications reviewed.
- Upon closer review I noticed approximately 10 out of the 133 articles had incorrect information in variable "Publication content type: art or general" and/or "Publication Carrier type: web or library?" so I updated V2.
- Upon closer review I noticed there was 3 instances of artists who had two solo shows apiece: in addition to Meleko Mokgosi and Carrie Mae Weems which I had already noted in V1, there was also Roxy Paine. I had not noticed this because only one of two of Paine's shows had been written about. This brings the total number of shows to 117 (which was 116 in V1).
-Upon closer review I removed one row from ExhibitionCatalogs.csv, as the item i had listed did not meet the parameters.
keywords:
diversity and inclusion; diversity audit; contemporary art; art exhibitions; art exhibition reviews; exhibition catalogs; magazines; newspapers; demographics
published:
2019-01-27
Le, Thien; Sy, Aaron; Molloy, Erin K.; Zhang, Qiuyi; Rao, Satish; Warnow, Tandy
(2019)
This repository include datasets that are studied with INC/INC-ML/INC-NJ in the paper `Using INC within Divide-and-Conquer Phylogeny Estimation' that was submitted to AICoB 2019. Each dataset has its own readme.txt that further describes the creation process and other parameters/softwares used in making these datasets. The latest implementation of INC/INC-ML/INC-NJ can be found on https://github.com/steven-le-thien/constraint_inc. Note: there may be files with DS_STORE as extension in the datasets; please ignore these files.
keywords:
phylogenetics; gene tree estimation; divide-and-conquer; absolute fast converging
published:
2019-12-22
Zachwieja, Alexandra
(2019)
Dataset providing calculation of a Competition Index (CI) for Late Pleistocene carnivore guilds in Laos and Vietnam and their relationship to humans. Prey mass spectra, Prey focus masses, and prey class raw data can be used to calculate the CI following Hemmer (2004). Mass estimates were calculated for each species following Van Valkenburgh (1990). Full citations to methodological papers are included as relationships with other resources
keywords:
competition; Southeast Asia; carnivores; humans
published:
2020-05-15
Mishra, Shubhanshu; Agarwal, Sneha; Guo, Jinlong ; Phelps , Kirstin ; Picco, Johna ; Diesner , Jana
(2020)
This data has tweets collected in paper Shubhanshu Mishra, Sneha Agarwal, Jinlong Guo, Kirstin Phelps, Johna Picco, and Jana Diesner. 2014. Enthusiasm and support: alternative sentiment classification for social movements on social media. In Proceedings of the 2014 ACM conference on Web science (WebSci '14). ACM, New York, NY, USA, 261-262. DOI: https://doi.org/10.1145/2615569.2615667
The data only contains tweet IDs and the corresponding enthusiasm and support labels by two different annotators.
keywords:
Twitter; text classification; enthusiasm; support; social causes; LGBT; Cyberbullying; NFL
published:
2019-09-25
Wong, Tony; Hughes, A; Tokuda, K; Indebetouw, R; Onishi, T; Bandurski, J. B.; Chen, C. H. R.; Fukui, Y; Glover, S. C. O.; Klessen, R. S.; Pineda, J. L.; Roman-Duval, J.; Sewilo, M.; Wojciechowski, E.; Zahorecz, S.
(2019)
<sup>12</sup>CO and <sup>13</sup>CO maps for six molecular clouds in the Large Magellanic Cloud, obtained with the Atacama Large Millimeter/submillimeter Array (ALMA). See the associated article in the Astrophysical Journal, and README files within each ZIP archive. Please cite the article if you use these data.
keywords:
Radio astronomy
published:
2020-12-12
Jones, Todd M.; Benson , Thomas J.; Ward, Michael P.
(2020)
Dataset associated with Jones et al FE-2019-01175 submission: Does the size and developmental stage of traits at fledging reflect juvenile flight ability among songbirds? Excel CSV files with all of the data used in analyses and file with descriptions of each column. The flight ability variable in this dataset was derived from fledgling drop tests, examples of which can be found in the related dataset: Jones, Todd M.; Benson, Thomas J.; Ward, Michael P. (2019): Flight Ability of Juvenile Songbirds at Fledgling: Examples of Fledgling Drop Tests. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-2044905_V1.
keywords:
body condition; fledgling; flight ability; locomotor ability; post-fledging; songbirds; wing development; wing emergence
published:
2020-02-27
Clem, Scott; Sparbanie, Taylor; Luro, Alec; Harmon-Threatt, Alexandra
(2020)
These data were collected for an experiment examining effects of neonicotinoid (clothianidin) presence on hover fly (Diptera: Syrphidae) behavior. Hover flies of two species (Eristalis arbustorum and Toxomerus marginatus) were offered a choice to feed on artificial flowers laced with sucrose solution that was either contaminated (CLO) or not contaminated (CON) with clothianidin. Two different concentrations of clothianidin in 0.5 M sucrose solution were tested: 2.5 ppb and 150 ppb. We conducted four sets of 10 trials, each trial set examining a different combination of species and clothianidin dose. Across 6 hours of video for each trial we recorded 1) number of visits to each flower that resulted in feeding, and 2) amount of time spent feeding during each visit.
We found that while neither species fed significantly longer on either of the solutions, E. arbustorum appeared to avoid flowers with clothianidin particularly at high rates. In the paper, we attribute this avoidance response, partially, to hover fly-visible spectral differences between the two flower choices and discuss potential implications for field and lab-based studies.
In the enclosed zip file we have included all data for this project and code scripts from R.
* Note: Data folder contains 4 files (instead of 6 as mentioned in Readme): e.tenax_photoreceptors.csv; hoverfly_data_UPDATE.csv; number_visits_UPDATE.csv; and Original 2018 hover fly choice test data_Clem2020.xlsx
keywords:
Syrphidae; hoverfly; Eristalis; Toxomerus; Choice Experiment; Neonicotinoid; Clothianidin
published:
2022-03-11
Kantola, Ilsa; Masters, Michael; Blanc-Betes, Elena; Gomez-Casanovas, Nuria; DeLucia, Evan
(2022)
Data sets relating to the manuscript “Long-term yields in annual and perennial bioenergy crops in the Midwestern USA” published in Global Change Biology Bioenergy. Field data, including annual peak biomass and harvest yields from maize/soy, miscanthus, switchgrass, and prairie field trials from 2008-2018 are included. Peak and harvest biomass for fertilized and unfertilized miscanthus are included from 2014-2018.
keywords:
miscanthus; switchgrass; yield; drought; crop; perennial; bioenergy
published:
2021-04-16
Xia, Yushu; Wander, Michelle; Kwon, Hoyoung
(2021)
This dataset includes five files developed using the procedures described in the article 'Developing County-level Data of Nitrogen Fertilizer and Manure Inputs for Corn Production in the United States' and Supplemental Information published in the Journal of Cleaner Production in 2021.
Citation: Xia, Yushu, Hoyoung Kwon, and Michelle Wander. "Developing county-level data of nitrogen fertilizer and manure inputs for corn production in the United States." Journal of Cleaner Production 309 (2021): e126957.
Brief method: The fertilizer and manure inputs for corn were generated with a top-down approach by assigning county-level total N inputs reported by USGS to different crops using state- and county-level survey data. The corn N needs were estimated using empirical extension-based equations coupled with soil and environmental covariates. The estimates of fertilizer N inputs were further refined for corn grain and silage production at the county level and gap-filling (using state-level averages) was carried out to generate final files for U.S. county-level N inputs.
The dataset is provided in an alternative format in Google Earth Engine: https://code.earthengine.google.com/13a0078e7ee727bc001e045ad0e8c6fc
keywords:
Corn; Nitrogen Fertilizer; Manure; Conterminous U.S.
published:
2022-02-10
Sharma, Bijay P.; Zhang, Na; DoKyoung, Lee; Heaton, Emily; Delucia, Evan H.; Sacks, Erik J.; Kantola, Ilsa B.; Boersma, Nicholas N.; Long, Stephen P.; Voigt, Thomas B.; Khanna, Madhu
(2022)
The compiled datasets include plot level observations of energy crops (miscanthus and switchgrass) from recent experimental field trials in the US including dry biomass yield, location, state, region, harvest year, growing season degree days (GDD), winter season heating degree days (HDD), growing season cumulative precipitation, annual nitrogen application rate, age of the pant when harvested, National Commodity Crop Productivity Index (NCCPI) values, and cultivar type (switchgrass) from various published and unpublished sources.
The stata codes include estimation procedures for four different specifications, i.e., Model A includes deterministic effect without interaction terms; Model B includes deterministic effect with interaction terms (N2, age2, N × age, GDD2, precip2, N × NCCPI); Model C includes deterministic effect with interaction terms, study, and location random effect; Model D includes deterministic effect with interaction terms, harvest year augmented study, and location random effect.
keywords:
Age; Miscanthus; Nitrogen; Switchgrass; Yield; Center for Advanced Bioenergy and Bioproducts Innovation
published:
2025-04-02
Pastrana-Otero, Isamar; Godbole, Apurva R.; Kraft, Mary L.
(2025)
This dataset contains Raman spectra, each acquired from an individual, living, cell entrapped within a soft or stiff gelatin methacrylate hydrogel or from a cell-free region of the hydrogel sample. Spectra were acquired from the following cell types: Madin-Darby Canine Kidney cell (MDCK); Chinese hamster ovary cell (CHO-K1); transfected CHO-K1 cell that expressed the SNAP-tag and HaloTag reporter proteins fused to an organelle-specific protein (CHO-T); human monocyte-like cell (THP-1); inactive macrophage-like (M0-like); active anti-inflammatory macrophage-like (M2-like), pro/anti-inflammatory macrophage-like (M1/M2-like). These spectra are useful for identifying whether the hydrogel matrix obscures the Raman spectral signatures that are characteristic of each of these cell types.
keywords:
Raman spectroscopy; 3D cell culture; single-cell spectrum; hydrogel scaffold; collagen scaffold; macrophage spectra; macrophage differentiation; THP-1 line; noninvasive phenotype identification; vibrational spectroscopy