Illinois Data Bank Dataset Search Results
Results
published:
2025-10-31
Lopes, Daiane; Dien, Bruce; Hector, Ronald; Singh, Vijay; Thompson, Stephanie R.; Slininger, Patricia J.; Boundy-Mills, Kyria; Jagtap, Sujit; Rao, Christopher V.
(2025)
Rhodotorula toruloides is being developed for the use in industrial biotechnology processes because of its favorable physiology. This includes its ability to produce and store large amounts of lipids in the form of intracellular lipid bodies. Nineteen strains were characterized for mating type, ploidy, robustness for growth, and accumulation of lipids on inhibitory switchgrass hydrolysate (SGH). Mating type was determined using a novel polymerase chain reaction (PCR)-based assay, which was validated using the classical microscopic test. Three of the strains were heterozygous for mating type (A1/A2). Ploidy analysis revealed a complex pattern. Two strains were triploid, eight haploid, and eight either diploid or aneuploid. Two of the A1/A2 strains were compared to their parents for growth on 75%v/v concentrated SGH. The A1/A2 strains were much more robust than the parental strains, which either did not grow or had extended lag times. The entire set was evaluated in 60%v/v SGH batch cultures for growth kinetics and biomass and lipid production. Lipid titers were 2.33–9.40 g/L with a median of 6.12 g/L, excluding the two strains that did not grow. Lipid yields were 0.032–0.131 (g/g) and lipid contents were 13.5–53.7% (g/g). Four strains had significantly higher lipid yields and contents. One of these strains, which had among the highest lipid yield in this study (0.131 ± 0.007 g/g), has not been previously described in the literature.
keywords:
Conversion;Hydrolysate;Lipidomics
published:
2025-09-22
The files in this dataset include the now-public domain full raw text and illustrations for the novel Gentlemen Prefer Blondes (GBP) by Anita Loos, and files comparing the two published versions of the novel in 1925, one in Harper's Bazar magazine and the other in book format by Boni & Liveright. These files comprise the underlying data for the scholarly digital edition of the novel edited by Daniel G. Tracy. The full citation for the publication, including the DOI link for those wishing access the text, is: Loos, Anita. Gentlemen Prefer Blondes. Edited by Daniel G. Tracy, Critical Edition. Windsor & Downs Press, 2025. https://doi.org/10.21900/wd.13
keywords:
literature; textual collation; digital editions; American Literature
published:
2017-02-28
Leesburg, VA to Indianapolis, Indiana:
Sampling Rate: 0.1 Hz
Total Travel Time: 31100007 ms or 518 minutes or 8.6 hours
Distance Traveled: 570 miles via I-70
Number of Data Points: 3112
Device used: Samsung Galaxy S4
Date Recorded: 2017-01-15
Parameters Recorded:
* ACCELEROMETER X (m/s²)
* ACCELEROMETER Y (m/s²)
* ACCELEROMETER Z (m/s²)
* GRAVITY X (m/s²)
* GRAVITY Y (m/s²)
* GRAVITY Z (m/s²)
* LINEAR ACCELERATION X (m/s²)
* LINEAR ACCELERATION Y (m/s²)
* LINEAR ACCELERATION Z (m/s²)
* GYROSCOPE X (rad/s)
* GYROSCOPE Y (rad/s)
* GYROSCOPE Z (rad/s)
* LIGHT (lux)
* MAGNETIC FIELD X (microT)
* MAGNETIC FIELD Y (microT)
* MAGNETIC FIELD Z (microT)
* ORIENTATION Z (azimuth °)
* ORIENTATION X (pitch °)
* ORIENTATION Y (roll °)
* PROXIMITY (i)
* ATMOSPHERIC PRESSURE (hPa)
* Relative Humidity (%)
* Temperature (F)
* SOUND LEVEL (dB)
* LOCATION Latitude
* LOCATION Longitude
* LOCATION Altitude (m)
* LOCATION Altitude-google (m)
* LOCATION Altitude-atmospheric pressure (m)
* LOCATION Speed (kph)
* LOCATION Accuracy (m)
* LOCATION ORIENTATION (°)
* Satellites in range
* GPS NMEA
* Time since start in ms
* Current time in YYYY-MO-DD HH-MI-SS_SSS format
Quality Notes:
There are some things to note about the quality of this data set that you may want to consider while doing preprocessing. This dataset was taken continuously but had multiple stops to refuel (without the data recording ceasing). This can be removed by parsing out all data that has a speed of 0. The mount for this dataset was fairly stable (as can be seen by the consistent orientation angle throughout the dataset). It was mounted tightly between two seats in the back of the vehicle. Unfortunately, the frequency for this dataset was set fairly low at one per ten seconds.
keywords:
smartphone; sensor; driving; accelerometer; gyroscope; magnetometer; gps; nmea; barometer; satellite; temperature; humidity
published:
2025-09-19
Xue, Pu; Si, Tong; Mishra, Shekhar; Zhang, Linzixuan; Choe, Kisurb; Sweedler, Jonathan; Zhao, Huimin
(2025)
Microbial cell factories have been extensively engineered to produce free fatty acids (FFAs) as key components of crucial nutrients, soaps, industrial chemicals, and fuels. However, our ability to control the composition of microbially synthesized FFAs is still limited, particularly, for producing medium‐chain fatty acids (MCFAs). This is mainly due to the lack of high‐throughput approaches for FFA analysis to engineer enzymes with desirable product specificity. Here we report a mass spectrometry (MS)‐based method for rapid profiling of MCFAs in Saccharomyces cerevisiae by using membrane lipids as a proxy. In particular, matrix‐assisted laser desorption/ionization time‐of‐flight (MALDI‐ToF) MS was used to detect shorter acyl chain phosphatidylcholines from membrane lipids and a higher m/z peak ratio at 730 and 758 was used as an indication for improved MCFA production. This colony‐based method can be performed at a rate of ~2 s per sample, representing a substantial improvement over gas chromatography‐MS (typically >30 min per sample) as the gold standard method for FFA detection. To demonstrate the power of this method, we performed site‐saturation mutagenesis of the yeast fatty acid synthase and identified nine missense mutations that resulted in improved MCFA production relative to the wild‐type strain. Colony‐based MALDI‐ToF MS screening provides an effective approach for engineering microbial fatty acid compositions in a high‐throughput manner.
keywords:
Conversion;Lipidomics;Metabolomics
published:
2025-12-02
Zhou, Yu; Mirts, Evan N.; Yook, Sangdo; Waugh, Matthew; Martini, Rachel; Jin, Yong-Su; Lu, Yi
(2025)
Engineering enzymes with novel reactivity and applying them in metabolic pathways to produce valuable products are quite challenging due to the intrinsic complexity of metabolic networks and the need for high in vivo catalytic efficiency. Triacetic acid lactone (TAL), naturally generated by 2-pyrone synthase (2PS), is a platform molecule that can be produced via microbial fermentation and further converted into value-added products. However, these conversions require extra synthetic steps under harsh conditions. We herein report a biocatalytic system for direct generation of TAL derivatives under mild conditions with controlled chemoselectivity by rationally engineering the 2PS active site and then rewiring the biocatalytic pathway in the metabolic network of E. coli to produce high-value products, such as kavalactone precursors, with yields up to 17 mg/L culture. Computer modeling indicates sterics and hydrogen-bond interactions play key roles in tuning the selectivity, efficiency, and yield.
keywords:
Conversion;Metabolomics
published:
2017-05-01
Indianapolis Int'l Airport to Urbana:
Sampling Rate: 2 Hz
Total Travel Time: 5901534 ms or 98.4 minutes
Number of Data Points: 11805
Distance Traveled: 124 miles via I-74
Device used: Samsung Galaxy S6
Date Recorded: 2016-11-27
Parameters Recorded:
* ACCELEROMETER X (m/s²)
* ACCELEROMETER Y (m/s²)
* ACCELEROMETER Z (m/s²)
* GRAVITY X (m/s²)
* GRAVITY Y (m/s²)
* GRAVITY Z (m/s²)
* LINEAR ACCELERATION X (m/s²)
* LINEAR ACCELERATION Y (m/s²)
* LINEAR ACCELERATION Z (m/s²)
* GYROSCOPE X (rad/s)
* GYROSCOPE Y (rad/s)
* GYROSCOPE Z (rad/s)
* LIGHT (lux)
* MAGNETIC FIELD X (microT)
* MAGNETIC FIELD Y (microT)
* MAGNETIC FIELD Z (microT)
* ORIENTATION Z (azimuth °)
* ORIENTATION X (pitch °)
* ORIENTATION Y (roll °)
* PROXIMITY (i)
* ATMOSPHERIC PRESSURE (hPa)
* SOUND LEVEL (dB)
* LOCATION Latitude
* LOCATION Longitude
* LOCATION Altitude (m)
* LOCATION Altitude-google (m)
* LOCATION Altitude-atmospheric pressure (m)
* LOCATION Speed (kph)
* LOCATION Accuracy (m)
* LOCATION ORIENTATION (°)
* Satellites in range
* GPS NMEA
* Time since start in ms
* Current time in YYYY-MO-DD HH-MI-SS_SSS format
Quality Notes:
There are some things to note about the quality of this data set that you may want to consider while doing preprocessing. This dataset was taken continuously as a single trip, no stop was made for gas along the way making this a very long continuous dataset. It starts in the parking lot of the Indianapolis International Airport and continues directly towards a gas station on Lincoln Avenue in Urbana, IL. There are a couple parts of the trip where the phones orientation had to be changed because my navigation cut out. These times are easy to account for based on Orientation X/Y/Z change. I would also advise cutting out the first couple hundred points or the points leading up to highway speed. The phone was mounted in the cupholder in the front seat of the car.
keywords:
smartphone; sensor; driving; accelerometer; gyroscope; magnetometer; gps; nmea; barometer; satellite
published:
2025-08-08
Bhatnagar, Nikita; Chung, Sarah S.; Hodge, John; Kim, Sang Yeol; Sands, Mia; Leakey, Andrew D. B.; Ort, Donald R.; Burgess, Steven J.
(2025)
Rubisco activase is an ATP-dependent chaperone that facilitates dissociation of inhibitory sugar phosphates from the catalytic sites of Rubisco during photosynthesis. In Arabidopsis, Rubisco activase is negatively regulated by dark-dependent phosphorylation of Thr78. The prevalence of Thr78 in Rubisco activase was investigated across sequences from 91 plant species, finding that 29 (∼32%) species shared a threonine in the same position. Analysis of seven C3 species with an antibody raised against a Thr78 phospho-peptide demonstrated that this position is phosphorylated in multiple genera. However, light-dependent dephosphorylation of Thr78 was observed only in Arabidopsis. Further, phosphorylation of Thr78 could not be detected in any of the four C4 grass species examined. The results suggest that despite conservation of Thr78 in Rubisco activase from a wide range of species, a regulatory role for phosphorylation at this site is more limited. This provides a case study for how variation in post-translational regulation can amplify functional divergence across the phylogeny of plants beyond what is explained by sequence variation in a metabolically important protein.
keywords:
photosynthesis; sorghum
published:
2025-10-16
Trinklein, Timothy; Lam, Fan; Sweedler, Jonathan
(2025)
This dataset contains MALDI imaging and fluorescence imaging data of 5xFAD mice and control animals. Processed data are provided at either MATLAB arrays or Bruker .slx / .sbd files compatible with SCiLS Lab.
- Animal_1_5xFAD_s1 and s2 : A MATLAB file of 50 micron spatial resolution imaging of whole brain slice from a 5xFAD animal.
- Slide28_Animal1_stitch_channels__Thioflavin S : A PNG file of the corresponding Thioflavin S- stained fluorescence image obtained post-MSI from the same section.
- Slide28_Animal1_stitch_merged : A PNG file of the corresponding merged imaged including brightfield, Thioflavin S (GFP channel) and Hoechst staining (DAPI channel) used for image registration
- mz_bins_use_neg.mat : A MATLAB array of the m/z channels all MSI images (whole brain slice, 50 micron spatial resolution) were binned to in order to enable comparison
- Animal3_S18_HR.mat : A MATLAB array of high-spatial-resolution (5 micron) imaging of a 5xFAD mouse hippocampus and cortex. Due to the large dataset, 22 m/z channels are included.
- Animal5_S17_HR.mat : A MATLAB array of high-spatial-resolution (5 micron) imaging of a wildtype mouse hippocampus and cortex
- mz_features_22.mat : A MATLAB array of the 22 m/z channels included in the high spatial resolution imaging data
- MALDI_Animal03_5xFAD_10um_neg.zip : A folder containing .slx and .sbd MALDI imaging data of a 5xFAD mouse (whole-brain section) at a 10 micron pitch (used in Figure 5)
- MALDI_animal3_5xFAD_5um_neg.zip : A folder containing .slx and .sbd MALDI imaging data of a 5xFAD mouse (region of interest) at a 5
micron pitch (used in Figure 4)
- TIMS_animal1_20um_5xfad_neg.zip : A folder containing .slx and .sbd MALDI TIMS imaging data of a 5xFAD mouse (region of interest) at a 20 micron pitch (used in Figure 7)
- TIMS_animal5_s13_20um_wt_neg.zip : A folder containing .slx and .sbd MALDI TIMS imaging data of a wild typemouse (region of interest) at a 20 micron pitch (used in Figure 7)
- Region Masks BMC.zip : A folder containing masks from brain regions of spatial transcriptomics data from the BMC Genomics data set
- Region Masks Cell.zip : A folder containing masks from brain regions of spatial transcriptomics data from the Cell data set
keywords:
amyloid beta; 5xfad, lipids; maldi;
published:
2025-12-23
Crawford, Reed; Dodd, Luke; O'Keefe, Joy
(2025)
This dataset contains the raw skin temperature data recorded from female Indiana bats (Myotis sodalis) recorded in Indiana and Kentucky from April through August of 2021. This dataset also contains the raw daily heterothermic response variable data that were used in this analysis. This dataset also includes the raw ambient temperature weather data recorded at our Indiana and Kentucky field sites. Lastly, this dataset contains the R script needed to analyze the above dataset.
keywords:
Artificial roost; bat box; conservation; physiology; thermoregulation; torpor
published:
2025-12-29
Wu, Yulun; Kudeki, Erhan
(2025)
Arecibo ISR CLP ion-line spectra obtained from RI receiver with 500 kHz bandwidth and 120-640 km altitude range, experiment dates September 23-26, 2016. Used for Mitigation of ion-temperature/composition ambiguity in the inversion of F-region ion-line spectra measured at Arecibo using coded long pulses.
keywords:
Remote sensing; Incoherent scatter radar; Arecibo Observatory
published:
2024-11-13
Tang, Zhichu; Chen, Wenxiang; Yin, Kaijun; Busch, Robert; Hou, Hanyu; Lin, Oliver; Lyu, Zhiheng; Zhang, Cheng; Yang, Hong; Zuo, Jian-Min ; Chen, Qian
(2024)
These datasets are for the four-dimensional scanning transmission electron microscopy (4D-STEM) and electron energy loss spectroscopy (EELS) experiments for cathode nanoparticles at different states. The raw 4D-STEM experiment datasets were collected by TEM image & analysis software (FEI) and were saved as SER files. The raw 4D-STEM datasets of SER files can be opened and viewed in MATLAB using our analysis software package of imToolBox available at https://github.com/flysteven/imToolBox. The raw EELS datasets were collected by DigitalMicrograph software and were saved as DM4 files. The raw EELS datasets can be opened and viewed in DigitalMicrograph software or using our analysis codes available at https://github.com/chenlabUIUC/OrientedPhaseDomain. All the datasets are from the work "Nanoscale Stacking Fault Engineering and Mapping in Spinel Oxides for Reversible Multivalent Ion Insertion" (2024).
The 4D-STEM experiment data include four example datasets for cathode nanoparticles collected at pristine and discharged states. Each dataset contains a stack of diffraction patterns collected at different probe positions scanned across the cathode nanoparticle.
1. Pristine untreated nanoparticle: "Pristine U-NP.ser"
2. Pristine 200ºC heated nanoparticle: "Pristine H200-NP.ser"
3. Untreated nanoparticle after first discharge in Zn-ion batteries: "Discharged U-NP.ser"
4. 200ºC heated nanoparticle after first discharge in Zn-ion batteries: "Discharged H200-NP.ser"
The EELS experiment data includes six example datasets for cathode nanoparticles collected at different states (in "EELS datasets.zip") as described below. Each EELS dataset contains the zero-loss and core-loss EELS spectra collected at different probe positions scanned across the cathode nanoparticle.
1. Pristine untreated nanoparticle: "Pristine U-NP EELS.zip"
2. Pristine 200ºC heated nanoparticle: "Prisitne H200-NP EELS.zip"
3. Untreated nanoparticle after first discharge in Zn-ion batteries: "Discharged U-NP EELS.zip"
4. Untreated nanoparticle after first charge in Zn-ion batteries: "Charged U-NP EELS.zip"
5. 200ºC heated nanoparticle after first discharge in Zn-ion batteries: "Discharged H200-NP EELS.zip"
6. 200ºC heated nanoparticle after first charge in Zn-ion batteries: "Charged H200-NP EELS.zip"
The details of the software package and codes that can be used to analyze the 4D-STEM datasets and EELS datasets are available at: https://github.com/chenlabUIUC/OrientedPhaseDomain. Once our paper is formally published, we will update the relationship of these datasets with our paper.
keywords:
4D-STEM; EELS; defects; strain; cathode; nanoparticle; energy storage
published:
2025-06-26
Zhang, Ruolin; Kontou, Eleftheria
(2025)
This dataset supports the analysis presented in the study on curbside electric vehicle (EV) charging infrastructure planning in San Francisco and the published paper titled "Urban electric vehicle infrastructure: Strategic planning for curbside charging." It includes spatial data layers and tabular data used to evaluate location suitability under multiple criteria, such as demand, accessibility, and environmental benefits. This dataset can be used to replicate the multi-criteria decision-making framework, perform additional spatial analyses, or inform policy decisions related to EV infrastructure siting in urban environments. The paper's DOI is https://doi.org/10.1016/j.jtrangeo.2025.104328.
keywords:
Electric Vehicles; Curbside Charging Stations; Multi-Criteria Decision-Making; Suitability Analysis; Urban Infrastructure
published:
2025-09-08
Lee, DoKyoung; Heaton, Emily; Umar, Muhammad; Jang, Chunhwa; Namoi, Nictor
(2025)
Purpose-grown perennial herbaceous species are nonfood crops specifically cultivated for bioenergy production and have the potential to secure bioenergy feedstock resources while enhancing ecosystem services. This study assessed soil greenhouse gas emissions (CO2 and N2O), nitrate (NO3-N) leaching reduction potential, evapotranspiration (ET), and water-use efficiency (WUE) of bioenergy switchgrass (Panicum virgatum L.) in comparison to corn (Zea mays L.). The study was conducted on field-scale plots in Urbana, IL, during the 2020–2022 growing seasons. Switchgrass was established in 2020 and urea-fertilized at 56 kg N ha−1 year−1. Corn management followed best management practices for the US Midwest, including no-till and 202 kg N ha−1 year−1 fertilization, applied as urea–ammonium nitrate (32%). Our results showed lower direct N2O emissions in switchgrass compared to corn. Although soil CO2 emissions did not differ significantly during the establishment year, emissions in subsequent years were over 50% higher in switchgrass than in corn, likely due to increased belowground biomass, which was over five times higher in switchgrass. Nitrate-N leaching decreased as the switchgrass stand matured, reaching 80% lower than in corn by the third year. Differences in ET and WUE between corn and switchgrass were not significant; however, results indicate a trend toward reduced WUE in switchgrass under drought, driven by lower aboveground biomass production. Our study demonstrates that switchgrass can be implemented at a commercial scale without negatively impacting the hydrological cycle, while potentially reducing N losses through nitrate-N leaching and soil N2O emissions, and enhancing belowground C storage.
keywords:
field data; perennial bioenergy grasses; soil; switchgrass
published:
2020-11-18
This is the dataset that accompanies the paper titled "A Dual-Frequency Radar Retrieval of Snowfall Properties Using a Neural Network", submitted for peer review in August 2020. Please see the github for the most up-to-date data after the revision process: https://github.com/dopplerchase/Chase_et_al_2021_NN
Authors: Randy J. Chase, Stephen W. Nesbitt and Greg M. McFarquhar Corresponding author: Randy J. Chase (randyjc2@illinois.edu)
Here we have the data used in the manuscript. Please email me if you have specific questions about units etc.
1) DDA/GMM database of scattering properties: base_df_DDA.csv
This is the combined dataset from the following papers: Leinonen & Moisseev, 2015; Leinonen & Szyrmer, 2015; Lu et al., 2016; Kuo et al., 2016; Eriksson et al., 2018. The column names are D: Maximum dimension in meters, M: particle mass in grams kg, sigma_ku: backscatter cross-section at ku in m^2, sigma_ka: backscatter cross-section at ka in m^2, sigma_w: backscatter cross-section at w in m^2. The first column is just an index column.
2) Synthetic Data used to train and test the neural network: Unrimed_simulation_wholespecturm_train_V2.nc, Unrimed_simulation_wholespecturm_test_V2.nc
This was the result of combining the PSDs and DDA/GMM particles randomly to build the training and test dataset.
3) Notebook for training the network using the synthetic database and Google Colab (tensorflow): Train_Neural_Network_Chase2020.ipynb
This is the notebook used to train the neural network.
4)Trained tensorflow neural network: NN_6by8.h5 This is the hdf5 tensorflow model that resulted from the training. You will need this to run the retrieval.
5) Scalers needed to apply the neural network: scaler_X_V2.pkl, scaler_y_V2.pkl These are the sklearn scalers used in training the neural network. You will need these to scale your data if you wish to run the retrieval.
6) <b>New in this version</b> - Example notebook of how to run the trained neural network on Ku- Ka- band observations. We showed this with the 3rd case in the paper: Run_Chase2021_NN.ipynb
7) <b>New in this version</b> - APR data used to show how to run the neural network retrieval: Chase_2021_NN_APR03Dec2015.nc
The data for the analysis on the observations are not provided here because of the size of the radar data. Please see the GHRC website (<a href="https://ghrc.nsstc.nasa.gov/home/">https://ghrc.nsstc.nasa.gov/home/</a>) if you wish to download the radar and in-situ data or contact me. We can coordinate transferring the exact datafiles used.
The GPM-DPR data are avail. here: <a href="http://dx.doi.org/10.5067/GPM/DPR/GPM/2A/05">http://dx.doi.org/10.5067/GPM/DPR/GPM/2A/05</a>
published:
2021-03-14
Kang, Jeon-Young; Michels, Alexander; Lyu, Fangzheng; Wang, Shaohua; Agbodo, Nelson; Freeman, Vincent L; Wang, Shaowen; Anand, Padmanabhan
(2021)
This dataset contains all the code, notebooks, datasets used in the study conducted to measure the spatial accessibility of COVID-19 healthcare resources with a particular focus on Illinois, USA. Specifically, the dataset measures spatial access for people to hospitals and ICU beds in Illinois. The spatial accessibility is measured by the use of an enhanced two-step floating catchment area (E2FCA) method (Luo & Qi, 2009), which is an outcome of interactions between demands (i.e, # of potential patients; people) and supply (i.e., # of beds or physicians). The result is a map of spatial accessibility to hospital beds. It identifies which regions need more healthcare resources, such as the number of ICU beds and ventilators. This notebook serves as a guideline of which areas need more beds in the fight against COVID-19.
## What's Inside
A quick explanation of the components of the zip file
* `COVID-19Acc.ipynb` is a notebook for calculating spatial accessibility and `COVID-19Acc.html` is an export of the notebook as HTML.
* `Data` contains all of the data necessary for calculations:
* `Chicago_Network.graphml`/`Illinois_Network.graphml` are GraphML files of the OSMNX street networks for Chicago and Illinois respectively.
* `GridFile/` has hexagonal gridfiles for Chicago and Illinois
* `HospitalData/` has shapefiles for the hospitals in Chicago and Illinois
* `IL_zip_covid19/COVIDZip.json` has JSON file which contains COVID cases by zip code from IDPH
* `PopData/` contains population data for Chicago and Illinois by census tract and zip code.
* `Result/` is where we write out the results of the spatial accessibility measures
* `SVI/`contains data about the Social Vulnerability Index (SVI)
* `img/` contains some images and HTML maps of the hospitals (the notebook generates the maps)
* `README.md` is the document you're currently reading!
* `requirements.txt` is a list of Python packages necessary to use the notebook (besides Jupyter/IPython). You can install the packages with `python3 -m pip install -r requirements.txt`
keywords:
COVID-19; spatial accessibility; CyberGISX
published:
2023-01-12
Mischo, William; Schlembach, Mary C.
(2023)
These processing and Pearson correlational scripts were developed to support the study that examined the correlational relationships between local journal authorship, local and external citation counts, full-text downloads, link-resolver clicks, and four global journal impact factor indices within an all-disciplines journal collection of 12,200 titles and six subject subsets at the University of Illinois at Urbana-Champaign (UIUC) Library. This study shows strong correlations in the all-disciplines set and most subject subsets. Special processing scripts and web site dashboards were created, including Pearson correlational analysis scripts for reading values from relational databases and displaying tabular results.
The raw data used in this analysis, in the form of relational database tables with multiple columns, is available at <a href="https://doi.org/10.13012/B2IDB-6810203_V1">https://doi.org/10.13012/B2IDB-6810203_V1</a>.
keywords:
Pearson Correlation Analysis Scripts; Journal Publication; Citation and Usage Data; University of Illinois at Urbana-Champaign Scholarly Communication
published:
2025-08-07
Keiser, Ashley D.; Heaton, Emily; VanLoocke, Andrew; Studt, Jacob; McDaniel, Marshall D.
(2025)
Bioenergy and bioproduct markets are expanding to meet demand for climate friendly goods and services. Perennial biomass crops are particularly well suited for this goal because of their high yields, low input requirements, and potential to increase soil carbon (C). However, it is unclear how much C is allocated into belowground pools by perennial bioenergy crops and whether the belowground benefits vary with nitrogen (N) fertilizer inputs. Using in situ 13C pulse-chase labeling, we tested whether the sterile perennial grass Miscanthus × giganteus (miscanthus) or annual maize transfers more photosynthetic C to belowground pools. The experiment took place at two sites in Central and Northwest (NW) Iowa with different management histories and two nitrogen (N) fertilizer rates (0 and 224 kg N ha-1 yr-1) to determine if the fate of plant-derived soil C depends on soil fertility and crop type (perennial or annual). Maize allocated a greater percentage of total new 13C to roots than miscanthus, but miscanthus had greater new 13C in total and belowground plant biomass. We found strong interactions between site and most soil measurements – including new 13C in mineral and particulate soil organic matter (SOM) pools –which appear to be driven by differences in historical fertilizer management. The NW Iowa site, with a history of manure inputs, had greater plant-available nutrients (phosphorus, potassium, and ammonium) in soils, and resulted in less 13C from miscanthus in SOM pools compared to maize (approximately 64% less in POM and 70% less in MAOM). In more nutrient-limited soils (Central site), miscanthus transferred 4.5 times more 13C than maize to the more stable mineral-associated SOM pool. Our results suggest that past management, including historical manure inputs that affect a site’s soil fertility, can influence the net C benefits of bioenergy crops.
Dataset includes tables/figures from article and supplementary info. Dryad contains raw data.
keywords:
land management; carbon; miscanthus; maize
published:
2025-10-07
Jagtap, Sujit Sadashiv; Bedekar, Ashwini Ashok; Singh, Vijay; Jin, Yong-Su; Rao, Christopher V.
(2025)
Yarrowia lipolytica was found natively to produce erythritol, mannitol, and arabitol during growth on glucose, fructose, mannose, and glycerol. Osmotic stress is known to increase sugar alcohol production, and was found to significantly increase erythritol production during growth on glycerol. To better understand erythritol production from glycerol, since it was the most promising sugar alcohol, we measured the expression of key genes and intracellular metabolites. Osmotic stress increased the expression of several key genes in the glycerol catabolic pathway and the pentose phosphate pathway. Analysis of intracellular metabolites revealed that amino acids, sugar alcohols, and polyamines are produced at higher levels in response to osmotic stress. Heterologous overexpression of the sugar alcohol phosphatase increased erythritol production and glycerol utilization in Y. lipolytica. We further increased erythritol production by increasing the expression of native glycerol kinase (GK), and transketolase (TKL). These data show the growth and titers produced.
keywords:
Conversion;Genome Engineering
published:
2025-11-13
Fan, Xinxin; Khanna, Madhu; Hartman, Theodore; VanLoocke, Andy
(2025)
The dataset consists of:
(1) The replication codes and data for the BEPAM model are contained in the "BEPAM_Supplementary Environment Policy Analysis.zip"
(2) Simulation results from the BEPAM model are contained in "ModelOutputs.zip" under the "BEPAM_Supplementary Environment Policy Analysis.zip"
published:
2025-09-08
Singh, Vijay; Raj, Tirath
(2025)
Miscanthus x giganteus (Mxg) is a promising perennial crop for producing natural colorants, renewable fuels, and bioproducts. However, natural recalcitrance and high pretreatment cost are major barriers to their complete conversion. In this study, a green processing method has been investigated for efficient recovery of natural pigments (anthocyanins), fermentable sugars, and pure lignin from Mxg genotypes using choline chloride-based natural deep eutectic solvents (NADES) systems. Interestingly, choline chloride: lactic acid (ChCl: LA) NADES-processed biomass resulted in 67.8 ± 2.1 μg g−1 of anthocyanins from dry biomass. A maximum of 87.4%–94.1% glucose yield was achieved after enzymatic saccharification. The effective extraction of lignin with high purity with higher β-aryl ether (βO4) bonds from advanced crops is crucial for lignin valorization. Notably, highly pure lignin (≈93.4% ± 1.4%) is achieved after low-temperature NADES pretreatment while retaining lignin’s native structure. 31P nuclear magnetic resonance demonstrated that total phenolics for ChCl: LA-lignin resulted in 1.20 mmol g−1 hydroxyls. The relative monolignol composition of syringyl (S), guaiacyl (G), and p-hydroxyphenyl (H) is 19.0, 65.7, and 14.3%, respectively, as evidenced by heteronuclear single quantum coherence analysis. This study provides a novel approach for obtaining high-purity lignin for catalytic depolymerization for oligomers and bifunctional monoaromatics production and leverages current cellulosic biorefinery technologies.
keywords:
biomass analytics; feedstock bioprocessing; inter-brc; miscanthus
published:
2017-09-08
Park, Jungsik; Le, Brian; Sklenar, Joseph; Chern, Gia-wei; Watts, Justin; Schiffer, Peter
(2017)
Transport and MFM data of brickwork artificial spin ice composed of permalloy are included, which are reproductions of the data in an article named "Magnetic response of brickwork artificial spin ice". Transport data represent magnetic response of connected brickwork artificial spin ice, and MFM data represent how both connected and disconnected brickwork artificial spin ice react to external magnetic fields. SEM images of typical samples are included, where individual nanowire leg (island) is approximately 660 nm long and 140 nm wide with a 40 nm thickness. For the transport, each sample was measured in a longitudinal and a transverse geometry. Red curves are the 2500 Oe to -2500 Oe sweeps and the blue curves are -2500 Oe to 2500 Oe sweeps. Transport measurements were taken by using a standard 4-wire technique. Each plot was saved in pdf format.
keywords:
Magnetotransport
published:
2023-12-20
Xie, Yuxuan Richard; Castro, Daniel C.; Rubakhin, Stanislav S.; Trinklein, Timothy J.; Sweedler, Jonathan V.; Fan, Lam
(2023)
Important Note: the raw transient files need to be downloaded through this separate link: https://uofi.box.com/s/oagdxhea1wi8tvfij4robj0z0w8wq7j4. Once downloaded, place the file within the within the .d folder in the unzipped 20210930_ShortTransient_S3_5 folder to perform reconstruction step.
The minimal datasets to run the computational pipeline MEISTER introduced in the manuscript titled "Integrative Multiscale Biochemical Mapping of the Brain via Deep-Learning-Enhanced High-Throughput Mass Spectrometry". The key steps of our computational pipeline include (1) tissue mass spectrometry imaging (MSI) reconstruction; (2) multimodal image registration and 3D reconstruction; (3) regional analysis; and (4) single-cell and tissue data integration. Detailed protocols to reproduce our results in the manuscript are provided with an example data set shared for learning the protocols. Our computational processing codes are implemented mostly in Python as well as MATLAB (for image registration).
keywords:
deep learning;mass spectrometry;single cells
published:
2023-08-03
Dalling, James William
(2023)
This file contains the delta 15N values for leaf material collected from Cyathea rojasiana tree ferns before and after fertilization using ammonium -15N chloride solution to determine whether 15N update is possible from senescent leaves.
Details of the experiment are provided in the online supplement to the published paper. Briefly, In February 2022 we selected three mature C. rojasiana individuals 1-1.5m in height that had leaves rooted in the soil and one new developing (but unexpanded) leaf. For each fern, two plastic pots (10 x 10 x 12 cm) were filled with a 50:50 mixture of washed river sand and soil from the Chorro watershed. For each pot, one senescent leaf that was rooted in the soil was carefully excavated and its roots transplanted into the pot. Pots were then fertilized by adding 30 ml of a 0.02 M 15N solution of ammonium-15N chloride (98% 15N; Sigma-Aldrich 299251; St Louis, MO) to yield a target concentration of 2 µg15N cm-3 of soil. After fertilization pots were carefully enclosed within thick plastic bags, and sealed around the senescent leaf rachis to prevent leaching any of 15N from the pot to the surrounding soil.
At the time of N fertilization, pinnae of the youngest fully expanded leaf were collected from each fern. One pinna was collected from the base of the leaf and one from the distal end of the leaf. In March 2022, after 28 days the roots were removed from pots and two additional leaf pinnae sampled from each fern: one from the base and one from the distal end of the youngest (now fully expanded) leaf. Leaf samples were dried for 72 hours at 60 C and then leaf lamina tissue finely ground with a bead beater. The delta 15N for each leaf sample determined at the University of Illinois, Urbana-Champaign using a Thermo Delta V Advantage IRMS run in combination with a Costech 4010 Elemental Analyzer. Samples were run in continuous flow relative to laboratory standards that were calibrated with USGS 40, 41, and NBS 19 reference materials.
keywords:
15N; Cyathea rojasiana; N fertilization; montane forest
published:
2025-11-18
Rodriguez-Zas, Sandra
(2025)
The data set correspponds to gene expression measurements from an RNA-seq experiment profiling the amygdala of pigs representing 3 stimuli and 2 sexes. The experiment was approved by IACUC. Information on ~ 12,000 genes (rows) across 36 samples (36 columns) and a column for gene identification are included in the dataset. A readme, and metadata and a license files are being uploaded with the compressed data file.
keywords:
RNA-seq; stimuli; sex; amygdala
published:
2025-09-15
HamediRad, Mohammad; Weisberg, Scott; Chao, Ran; Lian, Jiazhang; Zhao, Huimin
(2025)
Golden Gate assembly is one of the most widely used DNA assembly methods due to its robustness and modularity. However, despite its popularity, the need for BsaI-free parts, the introduction of scars between junctions, as well as the lack of a comprehensive study on the linkers hinders its more widespread use. Here, we first developed a novel sequencing scheme to test the efficiency and specificity of 96 linkers of 4-bp length and experimentally verified these linkers and their effects on Golden Gate assembly efficiency and specificity. We then used this sequencing data to generate 200 distinct linker sets that can be used by the community to perform efficient Golden Gate assemblies of different sizes and complexity. We also present a single-pot scarless Golden Gate assembly and BsaI removal scheme and its accompanying assembly design software to perform point mutations and Golden Gate assembly. This assembly scheme enables scarless assembly without compromising efficiency by choosing optimized linkers near assembly junctions.
keywords:
Conversion;Genome Engineering;Genomics