Dataset Search

Displaying 101 - 125 of 1004 in total

Filters

Subject Area

Life Sciences (616)

Social Sciences (148)

Physical Sciences (143)

Technology and Engineering (84)

Uncategorized

Arts and Humanities (2)

Funder

Other (270)

U.S. National Science Foundation (NSF) (242)

U.S. Department of Energy (DOE) (239)

U.S. National Institutes of Health (NIH) (88)

U.S. Department of Agriculture (USDA) (62)

Illinois Department of Natural Resources (IDNR) (26)

U.S. Geological Survey (USGS) (8)

U.S. National Aeronautics and Space Administration (NASA) (6)

Illinois Department of Transportation (IDOT) (4)

U.S. Army (3)

Publication Year

2025 (288)

2021 (108)

2022 (106)

2024 (105)

2020 (96)

2023 (75)

2019 (72)

2018 (61)

2017 (36)

2016 (30)

2026 (22)

2009 (1)

2011 (1)

2012 (1)

2014 (1)

2015 (1)

License

CC BY (514)

CC0 (463)

custom (27)

Illinois Data Bank Dataset Search Results

Results

published: 2025-11-12

Data for Operation of a high frequency, phase slip qubit

Purmessur, Cheeranjeev; Chow, Kaicheung; van Heck, Bernard; Kou, Angela (2025)

This dataset contains all the raw and processed data used to generate the figures presented in the main text and the supplementary information of the paper "Operation of a high frequency, phase slip qubit." It also includes code for data analysis and code for generating the figures. <b>Note:</b> V2 includes time domain analysis that also accounts for the thermal dephasing from the f state (see readme in Time domain Device A).

keywords: phase slip qubit; superconducting qubit; quantum information; disordered superconductors

published: 2021-04-29

Data for "Crop-specific exposure to extreme temperature and moisture for the globe for the last half century"

Jackson, Nicole ; Konar, Megan ; Debaere, Peter; Sheffield, Justin (2021)

Global assessments of climate extremes typically do not account for the unique characteristics of individual crops. A consistent definition of the exposure of specific crops to extreme weather would enable agriculturally-relevant hazard quantification. We introduce the Agriculturally-Relevant Exposure to Shocks (ARES) model, a novel database of both the temperature and moisture extremes facing individual crops by explicitly accounting for crop characteristics. Specifically, we estimate crop-specific temperature and moisture shocks during the growing season for a 0.25-degree spatial grid and daily time scale from 1961-2014 globally for 17 crops. The resulting database presented here provides annual crop- and event-specific exposure rates. Both gridded and country-level exposure rates are provided for each of the 17 crops. Our results provide new insights into the changes in the magnitude as well as spatial and temporal distribution of extreme events that impact crops over the past half-century. For additional information, please see the related paper by Jackson et al. (2021) in Environmental Research Letters.

keywords: Crop-specific; weather extremes; temperature; moisture; global; gridded; time series

published: 2022-04-15

Data on "Evaluation of CO2 sealing potential of heterogeneous Eau Claire shale"

Kim, Hyunbin; Makhnenko, Roman (2022)

This dataset is provided to support the statements in Kim, H., and R.Y. Makhnenko. 2022. "Evaluation of CO2 sealing potential of heterogeneous Eau Claire shale". Journal of the Geological Society. In geologic carbon dioxide (CO2) storage in deep saline aquifers, buoyant CO2 tends to float upwards in the reservoirs overlaid by low permeable formations called caprocks. Caprocks should serve as barriers to potential CO2 leakage that can happen through a diffusion loss and permeation through faults, fractures, or pore spaces. The leakage through intact caprock would mainly depend on its permeability and CO2 breakthrough pressure, and is affected by the heterogeneities in the material. Here, we study the sealing potential of a caprock from Illinois Basin - Eau Claire shale, with sandy and shaly fractions distinguished via electron microscopy and grain/pore size and surface area characterization. The direct measurements of permeability of sandy shale provides the values ~ 10-15 m2, while clayey specimens are three orders of magnitude less permeable. The CO2 breakthrough pressure under in-situ stress conditions is 0.1 MPa for the sandy shale and 0.4 MPa for the clayey counterpart – these values are higher than those predicted by the porosimetry methods performed on the unconfined specimens. Sandy Eau Claire shale would allow penetration of large CO2 volumes at low overpressures, while the clayey formation can serve as a caprock in the absence of faults and fractures in it.

keywords: Geologic carbon storage; Caprock; Shale; CO2 breakthrough pressure; Porosimetry.

published: 2025-11-12

Data for Rapid and Efficient in planta Genome Editing in Sorghum Using Foxtail Mosaic Virus-mediated sgRNA Delivery

BAYSAL, CAN; Kausch, Albert P.; Cody, Jon P.; Altpeter, Fredy; Voytas, Daniel (2025)

The requirement of in vitro tissue culture for the delivery of gene editing reagents limits the application of gene editing to commercially relevant varieties of many crop species. To overcome this bottleneck, plant RNA viruses have been deployed as versatile tools for in planta delivery of recombinant RNA. Viral delivery of single-guide RNAs (sgRNAs) to transgenic plants that stably express CRISPR-associated (Cas) endonuclease has been successfully used for targeted mutagenesis in several dicotyledonous and few monocotyledonous plants. Progress with this approach in monocotyledonous plants is limited so far by the availability of effective viral vectors. We engineered a set of foxtail mosaic virus (FoMV) and barley stripe mosaic virus (BSMV) vectors to deliver the fluorescent protein AmCyan to track viral infection and movement in Sorghum bicolor. We further used these viruses to deliver and express sgRNAs to Cas9 and Green Fluorescent Protein (GFP) expressing transgenic sorghum lines, targeting Phytoene desaturase (PDS), Magnesium-chelatase subunit I (MgCh), 4-hydroxy-3-methylbut-2-enyl diphosphate reductase, orthologs of maize Lemon white1 (Lw1) or GFP. The recombinant BSMV did neither infect sorghum nor deliver or express AmCyan and sgRNAs. In contrast, the recombinant FoMV systemically spread throughout sorghum plants and induced somatic mutations with frequencies reaching up to 60%. This mutagenesis led to visible phenotypic changes, demonstrating the potential of FoMV for in planta gene editing and functional genomics studies in sorghum.

keywords: Feedstock Production;Genome Engineering;Genomics

published: 2019-12-10

Land productivity and land availability for growing bioenergy crop in the Contiguous US. Center for Advanced Bioenergy and Bioproducts Innovation (CABBI)

Yang, Pan; Zhao, Qiankun; Cai, Ximing (2019)

The dataset consists of two types of data: the estimate of land productivity (the maximum productivity, MP) and the estimate of land that has low productivity for any major crops planted in the Contiguous United States and then may be available for growing bioenergy crops (the marginal land, ML). All data items are in GeoTiff format, under the World Geodetic System (WGS) 84 project, and with a resolution of 0.0020810045 degree (~250 m). The MP values are calculated based on machine learning model estimated yields of major crops in the CONUS, and its expected value (MP_mean.tif), and associated uncertainty (MP_IDP.tif). The ML availability data have two versions: a deterministic version and a version with uncertainty. The deterministic MLs are determined as the land pixels with expected MP values falling in the range defined in the following criteria, and the MLs with uncertainty are determined as the probability that the MP value of a land pixel falls in the range defined in the following criteria: Criteria_____Description S1________ Current crop and pasture land with MP <= P50 S2________ Current crop and pasture land with MP <= P25 S3________ S1 + current grass and shrub land with P25 < MP < P50 S4________ S2 + current grass and shrub land with P10 < MP < P25 Economic__ Current crop and pasture land with potential profitability < 0 Here P10, P25 and P50 are the 10th, 25th and 50th percentile of crop MP values

keywords: Land productivity;marginal land;land use

published: 2021-10-24

Data for Design modifications affect bat box temperatures and suitability as maternity habitat

Tillman, Francis E.; Bakken, George S.; O'Keefe, Joy M. (2021)

This dataset contains daily and hourly temperature measurements in twenty different bat box designs deployed in central Indiana, USA from May to September 2018. Daily and hourly environmental data (temperature, solar radiation, wind speed and direction) are also included for days and hours sampled. Bat box temperature data were reclassified to cool (</= 30°C), permissive (30.1–39.9°C), and stressful (>/= 40°C) categories according to known temperature tolerances of temperate-zone bats.

keywords: bat box; design; environmental variables; microclimate; temperature

published: 2024-04-11

The missing phosphorus legacy of the Anthropocene: quantifying residual phosphorus in the biosphere

Margenot, Andrew; Zhou, Shengnan; Xu, Suwei; Condron, Leo; Metson, Geneviève; Haygarth, Philip; Wade, Jordon; Agyeman, Price Chapman (2024)

A defining feature of the Anthropocene is the distortion of the biosphere phosphorus (P) cycle. A relatively sudden acceleration of input fluxes without a concomitant increase in output fluxes has led to net accumulation of P in the terrestrial-aquatic continuum. Over the past century, P has been mined from geological deposits to produce crop fertilizers. When P inputs are not fully removed with harvest of crop biomass, the remaining P accumulates in soils. This residual P is a uniquely anthropogenic pool of P, and its management is critical for agronomic and environmental sustainability. This dataset includes data for us to quantify residual P from different long-term managed systems. The following is the desccription of the dataset. There are 7 sheets in total. 1. P_balance: From Morrow Plots maize-maize rotaiton (1888-2021), L: Low estimation; M: medium estimation; H: high estimation; 2. M3P: From Morrow Plots selected plots (selected years), M3P_sur: Mehlich III P concentration in surface 17cm soils; M3P_sub: Mehlich III P concentration in 17-34cm subsoils; P_balance: the difference between P inputs and P outputs; TP_sur: total P stocks in surface 17cm soils; TP_sub: total P stocks in 17-34cm subsoils; 3. Morrow_Plot_P_pool_all: Group: a - labile P; b - Fe/Al-P; c - Ca-P; d - total organic P; e - non-extractable P; Fertilized: P stocks in the fertilized plot; Unfertilized: P stocks in the unfertilized plot; F-U: difference between P stocks in ther fertilized and unfertilized plots; dif%: percent difference in total P; 4. Rothamsted_P_pool_all: Treatment: Unfertilized: no fertilization; FYM: farmyard manure; PK: synthetic P and K fertilizer; Group: a - labile P; b - Fe/Al-P; c - Ca-P; d - total organic P; e - non-extractable P; P_change: differnce in P stocks over time; dif%: percent difference in total P; 5. L'Acadie_P_pool_all: Treatment: MP_LowP: moldboard plow with low rate of P fertilizer; MP_HighP: moldboard plow with high rate of P fertilizer; NT_LowP: no till with low rate of P fertilizer; NT_HighP: no till with high rate of P fertilizer; Group: a - labile P; b - Fe/Al-P; c - Ca-P; d - total organic P; e - non-extractable P; P_change: differnce in P stocks over time; dif%: percent difference in total P; 6. Rothamsted_P_pool_duration: Treatment: Unfertilized: no fertilization; FYM: farmyard manure; PK: synthetic P and K fertilizer; Duration: from a year to another year; Group: a - labile P; b - Fe/Al-P; c - Ca-P; d - total organic P; e - non-extractable P; P_change: differnce in P stocks over time; dif%: percent difference in total P; 7. L'Acadie_P_pool_duration: Treatment: MP_LowP: moldboard plow with low rate of P fertilizer; MP_HighP: moldboard plow with high rate of P fertilizer; NT_LowP: no till with low rate of P fertilizer; NT_HighP: no till with high rate of P fertilizer; Duration: from a year to another year; Group: a - labile P; b - Fe/Al-P; c - Ca-P; d - total organic P; e - non-extractable P; P_change: differnce in P stocks over time; dif%: percent difference in total P;

keywords: phosphate rock; biosphere; balances; soil test P; long-term experiment

published: 2025-12-02

Data for The Effects of Sequential Hydrothermal-Mechanical Refining Pretreatment on Cellulose Structure Changes and Sugar Recoveries

Cheng, Ming-Hsun; Maitra, Shraddha; Carr Clennon, Aidan N.; Appell, Michael; Dien, Bruce; Singh, Vijay (2025)

The recalcitrance of lignocellulosic biomass necessitates an efficient pretreatment protocol for operating a successful cellulosic biorefinery. It is critical to improve cellulose accessibility for hydrolysis and fermentation by altering the plant cell wall’s physical structure and chemical composition. Sequential hydrothermal-mechanical refining pretreatment (HMR) allows efficient recovery of cellulosic sugars without utilizing any hazardous chemicals. HMR has been successfully applied to Liberty switchgrass, a bioenergy cultivar released by the USDA, and now it is being applied to oilcane, a recently developed transgenic sugarcane variety engineered to accumulate lipids in its vegetative tissues. Sugar yields of oilcane bagasse (OCB) and switchgrass (SG) treated with HMR are 96.4% and 75.4%, respectively. This study sought to correlate cellulosic sugar yields with structural changes within the cell wall caused by HMR on two distinct bioenergy crops. Simon’s staining technique for the specific surface area analysis showed that HMR increased the specific surface area of pretreated biomass residues by 80-112%. In addition, ATR-FTIR was performed to determine the effects of HMR on physical structures based on the total crystallinity index (TCI) and hydrogen bonding intensity (HBI). Irrespective of biomass type, HMR decreased the initial crystalline cellulose contents of untreated biomass residues by 3.5% and reduced TCI and HBI by 7-13%. The study found that sugar yields were negatively correlated to reducing values of hydrogen bonding intensity, crystalline cellulose content, and total crystallinity index.

keywords: Conversion;Biomass Analytics;Economics;Hydrolysate

published: 2025-12-18

Data for "Discovery, Characterization, and Application of Chromosomal Integration Sites in the Hyperthermophilic Archaeon Sulfolobus islandicus"

Boob, Aashutosh; Zhang, Changyi; Pan, Yuwei; Zaidi, Airah; Whitaker, Rachel; Zhao, Huimin (2025)

Sulfolobus islandicus, an emerging archaeal model organism, offers unique advantages for metabolic engineering and synthetic biology applications owing to its ability to thrive in extreme environments. Although several genetic tools have been established for this organism, the lack of well-characterized chromosomal integration sites has limited its potential as a cellular factory. Here, we systematically identified and characterized 13 artificial CRISPR RNAs targeting eight integration sites in S. islandicus using the CRISPR-COPIES pipeline and a multi-omics-informed computational workflow. We leveraged the endogenous CRISPR-Cas system to integrate the reporter gene lacS and validated heterologous expression through a β-galactosidase assay, revealing significant positional effects. As a proof of concept, we utilized these sites to genetically manipulate lipid ether composition by overexpressing glycerol dibiphytanyl glycerol tetraether (GDGT) ring synthase B (GrsB). This study expands the genetic toolbox for S. islandicus and advances its potential as a robust platform for archaeal synthetic biology and industrial biotechnology.

keywords: AI/ML; gene editing; genome engineering; metabolic engineering

published: 2019-03-22

Flight Ability of Juvenile Songbirds at Fledgling: Examples of Fledgling Drop Tests

Jones, Todd M.; Benson, Thomas J.; Ward, Michael P. (2019)

This data publication provides example video clips related to research on association among flight ability of juvenile songbirds at fledging and juvenile morphological traits (wing emergence, wing length, body condition, mass, and tarsus length. File names reflect the species dropped in each video. These videos are supplemental material for scientific publications by the authors and reflect an example subset of all videos collected form 2017-2018 as part of a larger study on the post-fledging ecology of grassland and shrubland birds in east-Central Illinois, USA. No birds were harmed/injured in the production of these videos and procedures were approved by the Illinois Institutional Animal Care and Use Committee (IACUC), protocol no. 18221. Individuals depicted in the videos have given consent for the videos to be shared (talent/model release form; <a href="https://publicaffairs.illinois.edu/resources/release/">https://publicaffairs.illinois.edu/resources/release/</a>)

keywords: songbirds; flight ability; wing development; wing length; wing emergence; nestling development; post-fledging

published: 2019-03-19

Meltwater Meandering Channels on Ice: Centerlines and Images

Fernandez, Roberto; Parker, Gary; Stark, Colin P. (2019)

This dataset includes images and extracted centerlines from experiments looking at the formation and evolution of meltwater meandering channels on ice. The laboratory data includes centimeter- and millimeter-scale rivulets. Dataset also includes an image and corresponding centerlines from the Peterman Ice Island. All centerlines were manually digitized in Matlab but no distributable code was developed for the process. Once digitized, centerlines were smoothed and standardized following methods and routines developed by other authors (Zolezzi and Guneralp, 2016; Guneralp and Rhoads, 2008). Details about the preparation of the centerlines and processing with these methods is included in the dissertation by Fernández (2018) linked to this dataset. "Millimeter scale and Peterman Ice Island centerlines.pdf": This file includes the images of two mm-scale experimetns and the Peterman Ice Island image. Seventeen centerlines were digitized from the former and seven were digitized from the latter. Those centerlines are shown above the images themselves. "Centimeter scale rivulet images.pdf": This file includes images corresponding to all cm-scale centerlines used for the analysis presented in the dissertation by Fernandez (2018). Each image has a short caption indicating the run ID and the time at which it was captured. The images were used to extract centerlines to look at the planform evolution of cm-scale meltwater meandering rivulets on ice. Images include 26 centerlines from four different runs. "Meltwater meandering channel centerlines.xlsx": This spreadsheet contains the centerline data for all fifty centerlines. The workbook includes 51 sheets. The first 50 are related to each one of the channels. The mm scale and Peterman Ice Island ones are identified using the same IDs shown in "Millimeter scale and Peterman Ice Island centerlines.pdf". The cm-scale centerlines are identified by run ID and a number indicating the time in minutes (with t = 0 min being the time at which water started flowing over the ice block). The naming convention is also associated to the images in "Centimeter scale rivulet images.pdf". The last sheet in the workbook includes a summary of the channel widths measured from every image for each centerline. The 50 sheets with the centerline information have four columns each. The titles of the columns are X, Y, S, and C. X,Y are dimensionless coordinates of the centerline. S is dimensionless streamwise coordinate (location along the centerline). C is dimensionless curvature value. All these values were non-dimensionalized with the channel width. See Fernandez (2018), Zolezzi and Guneralp (2016), and Guneralp and Rhoads (2008) for more details regarding the process of smoothing, standardizing and non-dimensionalization of the centerline coordinates.

keywords: Meltwater, Meandering, Ice, Supraglacial, Experiments

published: 2025-08-20

Comparative economic analysis between bioenergy and forage types of switchgrass for sustainable biofuel feedstock production: A DEA and cost-benefit analysis approach

Arshad, Muhammad Umer; Archer, David ; Wasonga, Daniel ; Namoi, Nictor; Boe, Arvid ; Rob , Mitchell; Heaton, Emily; Khanna, Madhu; Lee, DoKyoung (2025)

The compiled datasets include detailed costs for switchgrass production, categorized into establishment, maintenance, and harvesting expenses, along with revenue calculations. Costs were gathered from multiple sources and adjusted for inflation, focusing on farm-gate profitability, excluding fixed costs and transportation. All financial data is provided per hectare. The dataset was used to evaluate the economic performance of forage- and bioenergy-type switchgrass cultivars and their response to nitrogen fertilization across diverse marginal environments in the U.S. Midwest. Data Envelopment Analysis (DEA) and cost-benefit analysis were employed to assess the efficiency and profitability of 23 different cultivar and fertilization rate combinations over five years.

published: 2025-11-25

Diel Activity of Pumas at Kills

Allen, Maximilian (2025)

The diel activity of study animals while feeding at their kills in the Santa Cruz Mountains of California

keywords: Santa Cruz

published: 2026-01-08

Convolutional Neural Network-based Sequence-to-Expression Prediction Tool (CoNSEPT)

Dibaeinia, Payam; Sinha, Saurabh (2026)

CoNSEPT is a tool to predict gene expression in various cis and trans contexts. Inputs to CoNSEPT are enhancer sequence, transcription factor levels in one or many trans conditions, TF motifs (PWMs), and any prior knowledge of TF-TF interactions.

keywords: software; gene expression

published: 2020-08-18

Replication Data for "No buzz for bees: Media coverage of pollinator decline"

Althaus, Scott; Berenbaum, May; Jordan, Jenna; Shalmon, Dan (2020)

These data and code enable replication of the findings and robustness checks in "No buzz for bees: Media coverage of pollinator decline," published in Proceedings of the National Academy of Sciences of the United States of America (2020)". In this paper, we find that although widespread declines in insect biomass and diversity are increasing concern within the scientific community, it remains unclear whether attention to pollinator declines has also increased within information sources serving the general public. Examining patterns of journalistic attention to the pollinator population crisis can also inform efforts to raise awareness about the importance of declines of insect species providing ecosystem services beyond pollination. We used the Global News Index developed by the Cline Center for Advanced Social Research at the University of Illinois at Urbana-Champaign to track news attention to pollinator topics in nearly 25 million news items published by two American national newspapers and four international wire services over the past four decades. We provide a link to documentation of the Global News Index in the "relationships with articles, code, o. We found vanishingly low levels of attention to pollinator population topics relative to coverage of climate change, which we use as a comparison topic. In the most recent subset of ~10 million stories published from 2007 to 2019, 1.39% (137,086 stories) refer to climate change/global warming, while only 0.02% (1,780) refer to pollinator populations in all contexts and just 0.007% (679) refer to pollinator declines. Substantial increases in news attention were detectable only in U.S. national newspapers. We also find that while climate change stories appear primarily in newspaper “front sections”, pollinator population stories remain largely marginalized in “science” and “back section” reports. At the same time, news reports about pollinator populations increasingly link the issue to climate change, which might ultimately help raise public awareness to effect needed policy changes.

keywords: News Coverage; Text Analytics; Insects; Pollinator; Cline Center; Cline Center for Advanced Social Research; political; social; political science; Global News Index; Archer; news; mass communication; journalism

published: 2022-06-20

A Prototype Gutenberg-HathiTrust Sentence-level Parallel Corpus

Jiang, Ming; Dubnicek, Ryan; Worthey, Glen; Underwood, Ted; Downie, J. Stephen (2022)

This is a sentence-level parallel corpus in support of research on OCR quality. The source data comes from: (1) Project Gutenberg for human-proofread "clean" sentences; and, (2) HathiTrust Digital Library for the paired sentences with OCR errors. In total, this corpus contains 167,079 sentence pairs from 189 sampled books in four domains (i.e., agriculture, fiction, social science, world war history) published from 1793 to 1984. There are 36,337 sentences that have two OCR views paired with each clean version. In addition to sentence texts, this corpus also provides the location (i.e., sentence and chapter index) of each sentence in its belonging Gutenberg volume.

keywords: sentence-level parallel corpus; optical character recognition; OCR errors; Project Gutenberg; HathiTrust Digital Library; digital libraries; digital humanities;

published: 2022-04-29

Biological and Simulated datasets for testing the SCAMPP framework for phylogenetic placement methods

Wedell, Eleanor; Warnow, Tandy (2022)

Thank you for using these datasets! These files contain trees and reference alignments, as well as the selected query sequences for testing phylogenetic placement methods against and within the SCAMPP framework. There are four datasets from three different sources, each containing their source alignment and "true" tree, any estimated trees that may have been generated, and any re-estimated branch lengths that were created to be used with their requisite phylogenetic placement method. Three biological datasets (16S.B.ALL, PEWO/LTP_s128_SSU, and PEWO/green85) and one simulated dataset (nt78) is contained. See README.txt in each file for more information.

keywords: Phylogenetic Placement; Phylogenetics; Maximum Likelihood; pplacer; EPA-ng

published: 2024-03-01

Data for The location of large-scale soil moisture anomalies affects moisture transport and precipitation over southeastern South America

Chen, Chu-Chun; Dominguez, Francina (2024)

This dataset contains model output from the Community Earth System Model, Version 1 (CESM1; Hurrell et al., 2013) and variables from the European Centre for Medium-Range Weather Forecast (ECMWF) Reanalysis v5 (ERA5; Hersbach et al., 2020). These data were used for analysis in “The location of large-scale soil moisture anomalies affects moisture transport and precipitation over southeastern South America”, published in Geophysical Research Letters. Acknowledgments: This work was supported by NSF Award AGS-1852709. We acknowledge high-performance computing support from Cheyenne (doi:10.5065/D6RX99HX) provided by NCAR's Computational and Information Systems Laboratory, sponsored by the NSF. We thank Dr. Haiyan Teng for providing guidance on setting up the CESM experiments and offering valuable advice. References: Hersbach H, Bell B, Berrisford P, et al. The ERA5 global reanalysis. Q J R Meteorol Soc. 2020; 146: 1999–2049. https://doi.org/10.1002/qj.3803 Hurrell, J. W., and Coauthors, 2013: The Community Earth System Model: A Framework for Collaborative Research. Bull. Amer. Meteor. Soc., 94, 1339–1360, https://doi.org/10.1175/BAMS-D-12-00121.1

keywords: atmospheric sciences; climate modeling; land-atmosphere interactions; soil moisture; regional atmospheric circulation; southeastern South America

published: 2025-08-27

Data for Identifying the best high-biomass sorghum hybrids based on biomass yield potential and feedstock quality affected by nitrogen fertility management under various environments

Jang, Chunhwa; Namoi, Nictor; Lee, Jung Woo; Becker, Talon; Rooney, William; Lee, DoKyoung (2025)

Data were collected from agronomy fields in Urbana and Ewing, IL, during the 2022 and 2023 growing seasons. The dataset includes dry biomass yield, nitrogen, phosphorus, and potassium concentrations and removals, and chemical composition elements (cellulose, hemicellulose, lignin, and soluble fractions) for 13 high-biomass sorghum hybrids. data_sharing.xlsx contains 20 columns and 104 rows. Below is the explanation of all variables in the file: Year: 2022; 2023 Location: Urbana, IL; Ewing, IL N rate (kg-N/ha): 0; 112 Hybrid #: H1-H13 Pedigree: Pedigree for 13 hybrids Dry biomass yield (Mg/ha): Aboveground dry biomass yield N (g/kg): Nitrogen concentration in plant tissue P (g/kg): Phosphorus concentration in plant tissue K (g/kg): Potassium concentration in plant tissue N (kg/ha): Nitrogen removal by aboveground biomass P (kg/ha): Phosphorus removal by aboveground biomass K (kg/ha): Potassium removal by aboveground biomass Cellulose (g/kg): Cellulose concentration in plant tissue Hemicellulose (g/kg): Hemicellulose concentration in plant tissue Lignin (g/kg): Lignin concentration in plant tissue Soluble (g/kg): Soluble concentration in plant tissue Cellulose (Mg/ha): Cellulose content in aboveground biomass Hemicellulose (Mg/ha): Hemicellulose content in aboveground biomass Lignin (Mg/ha): Lignin content in aboveground biomass Soluble (Mg/ha): Soluble content in aboveground biomass

keywords: high-biomass sorghum hybrids; yield potential; environmental adaptability; feedstock quality; nutrient removal; N fertilization

published: 2025-09-26

Data from Biodiesel Production from Engineered Sugarcane Lipids under Uncertain Feedstock Compositions: Process Design and Techno-Economic Analysis

Arora, Amit; Singh, Vijay (2025)

In this study, different process schemes were designed and evaluated for biodiesel production from engineered cane lipids with uncertain fatty acid compositions. Four different process schemes were compared under (i) thermal glycerolysis and (ii) enzymatic glycerolysis approaches. These schemes were based on the biodiesel yield and economic indicators such as the net present value (NPV) and the minimum selling price (MSP) of biodiesel. A scheme with polar lipid separation under thermal glycerolysis resulted in the maximum NPV ($96.5 million) and minimum MSP ($1107/ton biodiesel), respectively. Through local sensitivity analysis, it was concluded that the cane lipid percentage is the most significant factor influencing process economics. A conjoint analysis of the lipid procurement price and cane lipid percent suggested that 15% cane lipids with a low lipid procurement price ($0.536/kg) results in a positive NPV. When the cane lipid price is higher (>$0.80/kg), a 20% lipid content should be considered to achieve a positive NPV. At 20% cane lipids, the worst-case and best-case scenarios were evaluated by analyzing the interplay of the three most important parameters, The best-case scenario revealed that the minimum NPV under any process scheme could yield more than $100 million (or MSP: $0.80/L), and the worst-case analysis showed that losses incurred by the plant could be as high as $80 million (MSP: $1.36/L). A Monte Carlo simulation indicated that there is a 70% chance of the plant being profitable (NPV > 0).

keywords: Conversion;Economics;Feedstock Bioprocessing;Modeling

published: 2025-10-21

Data for Transformation and Gene Editing in the Bioenergy Grass Miscanthus

Trieu, Anthony; Belaffif, Mohammad B.; Hirannaiah, Pradeepa; Manjunatha, Shilpa; Wood, Rebekah; Bathula, Yokshitha; Billingsley, Rebecca L.; Arpan, Anjali; Sacks, Erik; Clemente, Tom; Moose, Stephen; Reichert, Nancy A.; swaminathan, kankshita (2025)

Miscanthus, a C4 member of the family Poaceae, is a promising perennial crop for bioenergy, renewable bioproducts, and carbon sequestration. Species of interest include nothospecies Miscanthus x giganteus and its parental species M. sacchariflorus and M. sinensis. Use of biotechnology-based procedures to genetically improve miscanthus, to date, have only included plant transformation procedures for introduction of exogenous genes into the host genome at random, non-targeted sites.

keywords: Feedstock Production;Biomass Analytics;Genomics

published: 2023-07-14

Pollen of Podocarpus (Podocarpaceae): Airyscan confocal superresolution images

Punyasena, Surangi W.; Urban, Michael A.; Adaime, Marc-Elie; Romero, Ingrid; Jaramillo, Carlos (2023)

This dataset includes a total of 300 images of 45 extant species of Podocarpus (Podocarpaceae) and nine images of fossil specimens of the morphogenus Podocarpidites. The goal of this dataset is to capture the diversity of morphology within the genus and create an image database for training machine learning models. The images were taken using Airyscan confocal superresolution microscopy at 630x magnification (63x/NA 1.4 oil DIC). The images are in the CZI file format. They can be opened using Zeiss propriety software (Zen, Zen lite) or open microscopy software, such as ImageJ. More information on how to open CZI files can be found here: [https://www.zeiss.com/microscopy/us/products/software/zeiss-zen/czi-image-file-format.html] Please cite this dataset and listed publications when using these images.

keywords: optical superresolution microscopy; Zeiss Airyscan; CZI images; conifer; saccate pollen; Podocarpus; Podocarpidites; Smithsonian Tropical Research Institute

published: 2019-05-31

Frequent pattern subject transactions from the University of Illinois Library (2016 - 2018)

Hahn, Jim (2019)

The data are provided to illustrate methods in evaluating systematic transactional data reuse in machine learning. A library account-based recommender system was developed using machine learning processing over transactional data of 383,828 transactions (or check-outs) sourced from a large multi-unit research library. The machine learning process utilized the FP-growth algorithm over the subject metadata associated with physical items that were checked-out together in the library. The purpose of this research is to evaluate the results of systematic transactional data reuse in machine learning. The analysis herein contains a large-scale network visualization of 180,441 subject association rules and corresponding node metrics.

keywords: evaluating machine learning; network science; FP-growth; WEKA; Gephi; personalization; recommender systems

published: 2020-06-26

Data from: Quantifying Errors in the Aerosol Mixing-State Index Based on Limited Particle Sample Size

Gasparik, Jessica T.; Ye, Qing; Curtis, Jeffrey H.; Presto, Albert A.; Donahue, Neil M.; Sullivan, Ryan C.; West, Matthew; Riemer, Nicole (2020)

This dataset contains the PartMC-MOSAIC simulations used in the article "Quantifying Errors in the Aerosol Mixing-State Index Based on Limited Particle Sample Size". The 1000 simulations of output data is organized into a series of archived folders, each containing 100 scenarios. Within each scenario directory are 25 NetCDF files, which are the hourly output of a PartMC-MOSAIC simulation containing all information regarding the environment, particle and gas state. This dataset was used to investigate the impact of sample size on determining aerosol mixing state. This data may be useful as a data set for applying different types of estimators.

keywords: Atmospheric aerosols; single-particle measurements; sampling uncertainty; NetCDF

published: 2022-07-19

Effect of Micro-patterned Mucin on Quinolone and Rhamnolipid Profiles of Mucoid Pseudomonas aeruginosa under Antibiotic Stress

Parmar, Dharmeshkumar; Jia, Jin; Shrout, Joshua; Sweedler, Jonathan; Bohn, Paul (2022)

#### Details of Pseudomonas aeruginosa biofilm dataset #### ----------------*Folder Structure*------------------------------------- This dataset contains peak intensity tables extracted from mass spectrometry imaging (MSI) data using tools, SCiLS and MSI reader. There are 2 folders in "MSI-Data-Paeruginosa-biofilms-UIUC-DP-JVS-July2022.zip", each folder contains 3 sub-folders as listed below. 1. PellicleBiofilms-and-Supernatant [Pellicle biofilms collected from air-liquid interface and spend supernatant medium after 96 h incubation period]: (1) Full-Scan-Data-96h; (2) MSMS-data-from-C7-Quinolones-96h; and (3) MSMS-data-from-C9-Quinolones-96h 2. StaticBiofilms [Static biofilms grown on mucin surface]: (1) Full-Scan-Data; (2) MSMS-data-from-C7-Quinolones; and (3) MSMS-data-from-C9-Quinolones ----------------*File name*---------------------------------------------- Sample information is included in the file names for easy identification and processing. Attributes covered in file names are explained in the example below. *Example file name "Rep1-Stat-FRD1-mPat-48-FS"* ~ Each unit of information is separated by "-" ~Unit 1 - "Rep1" - Biological replicate ( Rep1, Rep2, and Rep3) ~Unit 2 - "Stat" - Sample type (Stat = Static Biofilm, Pel = Pellicle biofilm, Sup = Supernatant) ~Unit 3 - "FRD1" - Strain (FRD1 = Mucoid strain, PAO1C = Non-mucoid strain) ~Unit 4 - "mPat" - Type of mucin surface used (mPat = patterned mucin surface, mUni = uniform mucin surface) ~Unit 5 - "48" - Sample time point (hours = 48, 72, 96) ~Unit 6 - "FS" - Scan type used in MSI (FS = high resolution full-scan, 260 = targeted MS/MS of C7 quinolones (m/z 260), 288 = targeted MS/MS of C9 quinolones (m/z 288)) ----------------*File structure*------------------------------------------ All MSI data has been exported to CSV format. Each CSV files contains information about scan number, Coordinates (x,y,z), m/z values, extraction window (absolute), and corresponding intensities in the form of a matrix. ----------------*End of Information*--------------------------------------

keywords: mass spectrometry imaging (MSI); biofilm; antibiotic resistance; Pseudomonas aeruginosa; quorum sensing; rhamnolipids