Illinois Data Bank Dataset Search Results
Results
published:
2019-09-17
Mishra, Shubhanshu
(2019)
Trained models for multi-task multi-dataset learning for sequence tagging in tweets.
Sequence tagging tasks include POS, NER, Chunking, and SuperSenseTagging.
Models were trained using: <a href="https://github.com/socialmediaie/SocialMediaIE/blob/master/SocialMediaIE/scripts/multitask_multidataset_experiment.py">https://github.com/socialmediaie/SocialMediaIE/blob/master/SocialMediaIE/scripts/multitask_multidataset_experiment.py</a>
See <a href="https://github.com/socialmediaie/SocialMediaIE">https://github.com/socialmediaie/SocialMediaIE</a> and <a href="https://socialmediaie.github.io">https://socialmediaie.github.io</a> for details.
If you are using this data, please also cite the related article:
Shubhanshu Mishra. 2019. Multi-dataset-multi-task Neural Sequence Tagging for Information Extraction from Tweets. In Proceedings of the 30th ACM Conference on Hypertext and Social Media (HT '19). ACM, New York, NY, USA, 283-284. DOI: https://doi.org/10.1145/3342220.3344929
keywords:
twitter; deep learning; machine learning; trained models; multi-task learning; multi-dataset learning;
published:
2022-11-07
Jones, Todd; Di Giovanni, Alexander; Hauber, Mark; Ward, Michael
(2022)
Dataset associated with Jones et al. ECY22-0118.R3 submission: Ontogenetic effects of brood parasitism by the Brown-headed Cowbird on host offspring. Excel CSV files with all of the data used in analyses and file with descriptions of each column.
keywords:
brood parasitism; cowbirds; host-parasite systems; ontogeny; post-fledging; songbirds
published:
2022-12-31
Maffeo, Christopher; Wilson, Jim; Quednau, Lauren; Aksimentiev, Aleksei
(2022)
Trajectory data for Nature Nanotechnology manuscript "DNA double helix, a tiny electromotor" that demonstrates how an electric field applied along the helical axis of a DNA or RNA molecule will generate an electroosmotic flow that causes the duplex to spin about that axis, much like a turbine.
keywords:
All-atom MD simulation; DNA; nanotechnology; motors and rotors
published:
2024-02-26
Harsh, Vipul; Zhou, Wenxuan; Ashok, Sachin; Mysore, Radhika Niranjan; Godfrey, Brighten; Banerjee, Sujata
(2024)
Traces created using DeathStarBench (https://github.com/delimitrou/DeathStarBench) benchmark of microservice applications with injected failures on containers. Failures consist of disk/CPU/memory failures.
keywords:
Murphy;Performance Diagnosis;Microservice;Failures
published:
2024-12-11
MMAudio pretrained models. These models can be used in the open-sourced codebase https://github.com/hkchengrex/MMAudio
<b>Note:</b> mmaudio_large_44k_v2.pth and Readme.txt are added to this V2. Other 4 files stay the same.
published:
2021-05-07
Prepared by Vetle Torvik 2021-05-07
The dataset comes as a single tab-delimited Latin-1 encoded file (only the City column uses non-ASCII characters).
• How was the dataset created?
The dataset is based on a snapshot of PubMed (which includes Medline and PubMed-not-Medline records) taken in December, 2018. (NLMs baseline 2018 plus updates throughout 2018). Affiliations are linked to a particular author on a particular article. Prior to 2014, NLM recorded the affiliation of the first author only. However, MapAffil 2018 covers some PubMed records lacking affiliations that were harvested elsewhere, from PMC (e.g., PMID 22427989), NIH grants (e.g., 1838378), and Microsoft Academic Graph and ADS (e.g. 5833220). Affiliations are pre-processed (e.g., transliterated into ASCII from UTF-8 and html) so they may differ (sometimes a lot; see PMID 27487542) from PubMed records. All affiliation strings where processed using the MapAffil procedure, to identify and disambiguate the most specific place-name, as described in:
Torvik VI. MapAffil: A bibliographic tool for mapping author affiliation strings to cities and their geocodes worldwide. D-Lib Magazine 2015; 21 (11/12). 10p
• Look for Fig. 4 in the following article for coverage statistics over time:
Palmblad, M., Torvik, V.I. Spatiotemporal analysis of tropical disease research combining Europe PMC and affiliation mapping web services. Trop Med Health 45, 33 (2017). <a href="https://doi.org/10.1186/s41182-017-0073-6">https://doi.org/10.1186/s41182-017-0073-6</a>
Expect to see big upticks in coverage of PMIDs around 1988 and for non-first authors in 2014.
• The code and back-end data is periodically updated and made available for query by PMID at http://abel.ischool.illinois.edu/cgi-bin/mapaffil/search.py
• What is the format of the dataset?
The dataset contains 52,931,957 rows (plus a header row). Each row (line) in the file has a unique PMID and author order, and contains the following eighteen columns, tab-delimited. All columns are ASCII, except city which contains Latin-1.
1. PMID: positive non-zero integer; int(10) unsigned
2. au_order: positive non-zero integer; smallint(4)
3. lastname: varchar(80)
4. firstname: varchar(80); NLM started including these in 2002 but many have been harvested from outside PubMed
5. initial_2: middle name initial
6. orcid: From 2019 ORCID Public Data File https://orcid.org/ and from PubMed XML
7. year: year of the publication
8. journal: name of journal that the publication is published
9. affiliation: author's affiliation??
10. disciplines: extracted from departments, divisions, schools, laboratories, centers, etc. that occur on at least unique 100 affiliations across the dataset, some with standardization (e.g., 1770799), English translations (e.g., 2314876), or spelling corrections (e.g., 1291843)
11. grid: inferred using a high-recall technique focused on educational institutions (but, for experimental purposes, includes a few select hospitals, national institutes/centers, international companies, governmental agencies, and 200+ other IDs [RINGGOLD, Wikidata, ISNI, VIAF, http] for institutions not in GRID). Based on 2019 GRID version https://www.grid.ac/
12. type: EDU, HOS, EDU-HOS, ORG, COM, GOV, MIL, UNK
13. city: varchar(200); typically 'city, state, country' but could include further subdivisions; unresolved ambiguities are concatenated by '|'
14. state: Australia, Canada and USA (which includes territories like PR, GU, AS, and post-codes like AE and AA)
15. country
16. lat: at most 3 decimals (only available when city is not a country or state)
17. lon: at most 3 decimals (only available when city is not a country or state)
18. fips: varchar(5); for USA only retrieved by lat-lon query to https://geo.fcc.gov/api/census/block/find
keywords:
PubMed, MEDLINE, Digital Libraries, Bibliographic Databases; Author Affiliations; Geographic Indexing; Place Name Ambiguity; Geoparsing; Geocoding; Toponym Extraction; Toponym Resolution; institution name disambiguation
published:
2021-05-14
This is the complete dataset for the "Anomalous density fluctuations in a strange metal" Proceedings of the National Academy of Sciences publication (https://doi.org/10.1073/pnas.1721495115). This is an integration of the Zenodo dataset which includes raw M-EELS data.
<b>METHODOLOGICAL INFORMATION</b>
1. Description of methods used for collection/generation of data: Data have been collected with a M-EELS instrument and according to the data acquisition protocol described in the original PNAS publication and in SciPost Phys. 3, 026 (2017) (doi: 10.21468/SciPostPhys.3.4.026)
2. Methods for processing the data: Raw data were collected with a channeltron-based M-EELS apparatus described in the reference PNAS publication and analyzed according to the procedure outlined both in the PNAS paper and in SciPost Phys. 3, 026 (2017) (doi: 10.21468/SciPostPhys.3.4.026). The raw M-EELS spectra at each momentum have been subject to minor data processing involving:
(a) averaging of different acquisitions at the same conditions,
(b) energy binning,
(c) division of an effective Coulomb matrix element (which yields a structure factor S(q,\omega)),
(d) antisymmetrization (which yields the imaginary chi)
All these procedures are described in the PNAS paper.
3. Instrument- or software-specific information needed to interpret the data: These data are simple .txt or .dat files which can be read with any standard data analysis software, notably Python notebooks, MatLab, Origin, IgorPro, and others. We do not include scripts in order to provide maximum flexibility.
4. Relationship between files, if important: We divided in different folders raw data, structure factors and imaginary chi.
<b>DATA-SPECIFIC INFORMATION</b>
There are 8 folders within the Data_public_deposition_v1.zip. Each folder contain data needed to create the corresponding figure in the publication.
<b>1. Fig1:</b> This folder contains 21 DAT files needed to plot the theory data in panels C and D, following this naming conventions:
[chiA]or[chiB]or[Pi]_q_number.dat
With chiA is the imaginary RPA charge susceptibility with a Coulomb interaction of electronically weakly coupled layers
chiB is the imaginary RPA charge susceptibility with the usual 4\pi e^2/q^2 Coulomb interaction.
Pi is the imaginary Lindhard polarizability.
q is momentum in reciprocal lattice units
Number is the numerical momentum value in reciprocal lattice units
<b>2. Fig2:</b> Files needed to plot Fig. 2 of the PNAS paper. Contains 3 folders as listed below. The files in this folder are named following this convention: Bi2212_295K_(1,-1)_50eV_161107_q_number_2.16_avg.dat,
295K is the sample temperature
(1,-1) is the momentum direction in reciprocal lattice units
50 eV is the incident e beam energy
161107 is the start date of the experiment in yymmdd format
Q is the momentum
Number is the momentum in reciprocal lattice units
2.16 is the energy range covered by the data in eV
Avg identifies averaged data
ImChi: is the imaginary susceptibility obtained by antisymmetryzing the structure factor
Raw_avg_data: raw averaged M-EELS spectra
Sqw: Structure factors derived from the M-EELS spectra
<b>3. Fig3:</b> Files needed to plot Fig. 3 of the PNAS paper. OP/ OD prefix identifies optimally doped or overdosed sample data, respectively.
ImChi: is the imaginary susceptibility obtained by antisymmetryzing the structure factor
Raw_avg_data: raw averaged M-EELS spectra
Sqw: Structure factors derived from the M-EELS spectra
<b>4. Fig4:</b> Files needed to plot Fig. 4 of the PNAS paper. The _fit_parameters.dat file contains the fit parameters extracted according to the fit procedure described in the manuscript and at all momenta.
ImChi: is the imaginary susceptibility obtained by antisymmetryzing the structure factor
Raw_avg_data: raw averaged M-EELS spectra
Sqw: Structure factors derived from the M-EELS spectra
<b>5. FigS1:</b> Files needed to plot Fig. S1 of the PNAS paper. There are 5 files in this folder. DAT files are M-EELS data following the prior naming convention, while the two .txt files are digitized data from N. Nücker, U. Eckern, J. Fink, and P. Müller, Long-Wavelength Collective Excitations of Charge Carriers in High-Tc Superconductors, Phys. Rev. B 44, 7155(R) (1991), and K. H. G. Schulte, The interplay of Spectroscopy and Correlated Materials, Ph.D. thesis, University of Groningen (2002).
<b>6. FigS2:</b> Files needed to plot Fig. S2 of the PNAS paper.
ImChi: is the imaginary susceptibility obtained by antisymmetryzing the structure factor
Raw_avg_data: raw averaged M-EELS spectra
Sqw: Structure factors derived from the M-EELS spectra
<b>7. FigS3:</b> Files needed to plot Fig. S3 of the PNAS paper. There are 2 files in this folder:
20K_phi_0_q_0.dat: is a M-EELS raw intensity at zero momentum transfer on Bi2212 at 20 K
295K_phi_0_q_0.dat: is a M-EELS raw intensity at zero momentum transfer on Bi2212 at 295 K
<b>8. FigS4:</b> Files needed to plot Fig. S4 of the PNAS paper. The _fit_parameters.dat file contains the fit parameters extracted according to the fit procedure described in the manuscript and at all momenta.
ImChi: is the imaginary susceptibility obtained by antisymmetryzing the structure factor
Raw_avg_data: raw averaged M-EELS spectra
Sqw: Structure factors derived from the M-EELS spectra
keywords:
Momentum resolved electron energy loss spectroscopy (M-EELS); cuprates; plasmons; strange metal
published:
2022-09-29
Merrill, Loren; Jones, Todd; Brawn, Jeffrey; Ward, Michael
(2022)
Dataset associated with Merrill et al. ECE-2021-05-00793.R1 submission: Early life patterns of growth are linked to levels of phenotypic trait covariance and post-fledging mortality across avian species. Excel CSV files with all of the data used in analyses and file with descriptions of each column.
keywords:
canalization; developmental flexibility; early-life stress; nest predation; phenotypic correlation; trait covariance
published:
2025-02-14
Sinaiko, Guy; Dietrich, Christopher
(2025)
This dataset includes the original data (including photographs as .jpg files and sound recordings as .wav files) and detailed descriptions of workflows for analyses of acoustic and morphometric data for the Neoaliturus tenellus (beet leafhopper) species complex. Files needed for different parts of the two analytical workflows are included in the "Acoustics.zip" and "PCA.zip" archives. The "Folder Structure.png" file contains a diagram of the folder structure of the two archives. Each archive contains a "ReadMe" file with instructions for repeating the analyses. File and folder names including the two-letter abbreviations TB, TD, TN and TP refer to four different putative species (operational taxonomic units, or OTUs, of the Neoaliturus tenellus complex.
keywords:
Hemiptera; Cicadellidae; integrative taxonomy; courtship; morphology
published:
2025-10-14
Jagtap, Sujit Sadashiv; Deewan, Anshu; Liu, Jing-Jing; Walukiewicz, Hanna E.; Yun, Eun Ju; Jin, Yong-Su; Rao, Christopher V.
(2025)
Rhodosporidium toruloides is an oleaginous yeast capable of producing a variety of biofuels and bioproducts from diverse carbon sources. Despite numerous studies showing its promise as a platform microorganism, little is known about its metabolism and physiology. In this work, we investigated the central carbon metabolism in R. toruloides IFO0880 using transcriptomics and metabolomics during growth on glucose, xylose, acetate, or soybean oil. These substrates were chosen because they can be derived from plants. Significant changes in gene expression and metabolite concentrations were observed during growth on these four substrates. We mapped these changes onto the governing metabolic pathways to better understand how R. toruloides reprograms its metabolism to enable growth on these substrates. One notable finding concerns xylose metabolism, where poor expression of xylulokinase induces a bypass leading to arabitol production. Collectively, these results further our understanding of central carbon metabolism in R. toruloides during growth on different substrates. They may also help guide the metabolic engineering and development of better models of metabolism for R. toruloides.
keywords:
Conversion;Metabolomics;Transcriptomics
published:
2020-12-02
Yang, Pan; Cai, Ximing; Khanna, Madhu
(2020)
The dataset includes the survey results about farmers’ perceptions of marginal land availability and the likelihood of a land pixel being marginal based on a machine learning model trained from the survey.
Two spreadsheet files are the farmer and farm characteristics (marginal_land_survey_data_shared.xlsx), and the existing land use of marginal lands (land_use_info_sharing.xlsx).
<b>Note:</b> the blank cells in these two spreadsheets mean missing values in the survey response.
The GeoTiff file includes two bands, one the marginal land likelihood in the Midwestern states (0-1), the other the dominant reason of land marginality (0-5; 0 for farm size, 1 for growing season precipitation, 2 for root zone soil water capacity, 3 for average slope, 4 for growing season mean temperature, and 5 for growing season diurnal range of temperature). To read the data, please use a GIS software such as ArcGIS or QGIS.
keywords:
marginal land; survey
published:
2022-10-14
Zhou, Shan; Li, Jiahui; Lu, Jun; Liu, Haihua; Kim, Ji-Young; Kim, Ahyoung; Yao, Lehan; Liu, Chang; Qian, Chang; Hood, Zachary D. ; Lin, Xiaoying; Chen, Wenxiang; Gage, Thomas E. ; Arslan, Ilke; Travesset, Alex; Sun, Kai; Kotov, Nicholas A.; Chen, Qian
(2022)
This dataset is the raw data including SEM, TEM, PINEM images and FDTD simulation as well as pairwise interaction calculation results.
published:
2022-08-23
Seyfried, Georgia; Corrales, Adriana; Kent, Angela; Dalling, James; Yang, Wendy
(2022)
This dataset contains soil chemical properties used to variation in soil fungal communities beneath Oreomunnea mexicana trees in the manuscript "Watershed-scale variation in potential fungal community contributions to ectomycorrhizal biogeochemical syndromes"
keywords:
Acid-base chemistry; Ectomycorrhizal fungi; Exploration type; Nitrogen cycling; Nitrogen isotopes; Plant-soil (below-ground) interactions; Saprotrophic fungi; Tropical forest
published:
2022-08-25
Souza-Cole, Ian; Ward, Michael; Rebecca, Mau; Jeffrey, Foster; Benson, Thomas
(2022)
Data in this publication were used to analyze the factors that influence the abundance of eastern whip-poor-wills in the Midwest and to describe the diet of this species. These data were collected in Illinois in 2019 and 2020. Procedures were approved by the Illinois Institutional Animal Care and Use Committee (IACUC), protocol no. 19006
keywords:
eastern whip-poor-will; Antrostomus vociferus; abundance; moths; nightjars; Lepidoptera; metabarcoding
published:
2024-09-03
Bishop, Rebecca; Jonk, Kaitlyn M; Migliorisi, Alessandro; Austin, Scott M; Mullins, Emma C; Wilkins, Pamela
(2024)
Healthy mares were administered phenylephrine to induce transient polycythemia secondary to splenic contraction. Data was collected at baseline (T0), 5 minutes (T1) and 2 hours (T2) post-phenylephrine infusion. Collected data included baseline CBC, chemistry, fibrinogen, and serum amyloid A; at each time point viscoelastic coagulation profiles (VCM Vet), traditional in-vitro coagulation profiles, and ultrasonographic measurements of the spleen were obtained.
keywords:
horse; coagulation; polycythemia; blood clotting; viscoelastic testing
published:
2025-10-14
Jia, Yuyao; Kumar, Deepak; Winkler-Moser, Jill K.; Dien, Bruce S.; Rausch, Kent D.; Tumbleson, M.E.; Singh, Vijay
(2025)
Efforts to engineer high-productivity crops to accumulate oils in their vegetative tissue present the possibility of expanding biodiesel production. However, processing the new crops for lipid recovery and ethanol production from cell wall saccharides is challenging and expensive. In a previous study using corn germ meal as a model substrate, we reported that liquid hot water (LHW) pretreatment enriched the lipid concentration by 2.2 to 4.2 fold. This study investigated combining oil recovery with ethanol production by extracting oil following LHW and simultaneous saccharification and co-fermentation (SSCF) of the biomass. Corn germ meal was again used to model the oil-bearing energy crops. Pretreated germ meal hydrolysate or solids (160 °C and 180 °C for 10 minutes) were fermented, and lipids were extracted from both the spent fermentation whole broth and fermentation solids, which were recovered by centrifugation and convective drying. Lipid contents in spent fermentation solids increased 3.7 to 5.7 fold compared to the beginning germ meal. The highest lipid yield achieved after fermentation was 36.0 mg lipid g−1 raw biomass; the maximum relative amount of triacylglycerol (TAG) was 50.9% of extracted oil. Although the fermentation step increased the lipid concentration of the recovered solids, it did not improve the lipid yields of pretreated biomass and detrimentally affected oil compositions by increasing the relative concentrations of free fatty acids.
keywords:
Conversion;Hydrolysate;Lipidomics
published:
2024-05-10
Dietrich, Christopher; Walden, Kimberly; Cao, Yanghui; Hernandez, Alvaro; Rendon, Gloria; Robinson, Gene; Skinner, Rachel; Stein, Jeffrey; Fields, Christopher
(2024)
The data provided in this submission are the gene annotations for the Illinois EBP pilot project samples, as well as the predicted proteins for each sample in FASTA format.
keywords:
Earth Biogenome Project;genome assembly;Insecta;non-model species;sequencing;annotation
published:
2025-10-03
Kang, Nam Kyu; Lee, Jaewon; Ort, Donald; Jin, Yong-Su
(2025)
L-malic acid is widely used in the food, chemical, and pharmaceutical industries. Here, we report on production of malic acid from xylose, the second most abundant sugar in lignocellulosic hydrolysates, by engineered Saccharomyces cerevisiae. To enable malic acid production in a xylose-assimilating S. cerevisiae, we overexpressed PYC1 and PYC2, coding for pyruvate carboxylases, a truncated MDH3 coding for malate dehydrogenase, and SpMAE1, coding for a Schizosaccharomyces pombe malate transporter. Additionally, both the ethanol- and glycerol-producing pathways were blocked to enhance malic acid production. The resulting strain produced malic acid from both glucose and xylose, but it produced much higher titers of malic acid from xylose than glucose. Interestingly, the engineered strain had higher malic acid yield from lower concentrations (10 g L‒1) of xylose, with no ethanol production, than from higher xylose concentrations (20 and 40 g L‒1). As such, a fed-batch culture maintaining xylose concentrations at low levels was conducted and 61.2 g L‒1 of malic acid was produced, with a productivity of 0.32 g L‒1 h. These results represent successful engineering of S. cerevisiae for the production of malic acid from xylose, confirming that that xylose offers the efficient production of various biofuels and chemicals by engineered S. cerevisiae.
keywords:
Conversion;Feedstock Production;Genome Engineering
published:
2023-12-23
Rodriguez-Zas, Sandra
(2023)
Supplemental document corresponding to a submission to Physiological Genomics (Data supplements and source materials must now be deposited in a community-recognized data repository or to a generalist public access repository if no community resource is available. See "Author/Production Requirements" for more information.) https://pg.msubmit.net/
keywords:
Supplemental, Physiological Genomics
published:
2025-03-20
This dataset contains white-tailed deer (Odocoileus virginianus) land cover utility score (deer LCU score) data for every TRS (township, range, and section), township-range, and county in Illinois, USA, based on annual National Land Cover Database (NLCD) data released for all years between 2000 and 2023. LCU data is provided in CSV files for each spatial scale, with TRS data split into 2 CSV files due to size limits. Rasters (TIF) showing all deer habitat in Illinois are also provided to show the location, quality, and quantity of deer habitat. A metadata file is also included for additional information.
keywords:
habitat; white-tailed deer; deer; Odocoileus virginianus; land cover; land classification; landscape; habitat suitability index; ecology; environment
published:
2021-06-14
Kelkar, Varun A.; Anastasio, Mark A.
(2021)
This repository contains the weights for two StyleGAN2 networks trained on two composite T1 and T2 weighted open-source brain MR image datasets, and one StyleGAN2 network trained on the Flickr Face HQ image dataset. Example images sampled from the respective StyleGANs are also included.
The datasets themselves are not included in this repository. The weights are stored as `.pkl` files. The code and instructions to load and use the weights can be found at https://github.com/comp-imaging-sci/pic-recon . Additional details and citations can be found in the file "README.md".
keywords:
StyleGAN2; Generative adversarial network (GAN); MRI; Medical imaging
published:
2023-06-01
Results of RT-LAMP reactions for influenza A virus diagnostic development.
keywords:
swine influenza; LAMP; gBlock
published:
2025-11-03
von Haden, Adam C.; Eddy, William; Burnham, Mark B.; Brzostek, Edward; Yang, Wendy; DeLucia, Evan H.
(2025)
Root exudation is a key process for plant nutrient acquisition, but the controls on root exudation and its relationship to soil C and N processes in agroecosystems are unclear. We hypothesized that root exudation rates would be related to root morphological traits, N fertilization, and soil moisture. We also anticipated that root exudation would be correlated with bulk soil enzyme activity. Root exudation, root traits, and bulk soil extracellular enzyme activity were assessed in maize (Zea mays L.), soybean (Glycine max (L.) Merr.), biomass sorghum (Sorghum bicolor (L.) Moench), giant miscanthus (Miscanthus × giganteus), and switchgrass (Panicum virgatum L.). Measurements were taken in situ during two growing seasons with contrasting precipitation regimes, and N fertilization rate was varied in sorghum during one year. Specific root exudation (per unit root surface area) was negatively related to root diameter and was generally higher in annuals than perennials. Sorghum N fertilization did not affect root exudation rates, and soil moisture regime had no effect on annual root exudation rates within maize, sorghum, and miscanthus. Specific root exudation was negatively related to bulk soil C- and N-degrading soil enzyme activities. Intrinsic plant characteristics appeared more important than environmental variables in controlling in situ root exudation rates. The relationships between root diameter, root exudation, and soil C and N processes link root morphological traits to soil functions and demonstrate the potential tradeoffs among plant nutrient acquisition strategies in agroecosystems.
keywords:
Sustainability;Biomass Analytics;Field Data
published:
2022-05-26
Madhavan, Vidya; Aishwarya, Anuva
(2022)
The data files are for the paper entitled: Long-lifetime spin excitations near domain walls in 1T-TaS2 to be published in PNAS. The data was obtained on a 300 mK custom designed Unisoku scanning tunneling microscope using the Nanonis module. All the data files have been named based on the Figure numbers that they represent.
keywords:
Mott Insulator; Spins; Charge Density Wave; Domain walls; Long lifetime
published:
2021-10-27
de Jesús Astacio, Luis Miguel ; Prabhakara, Kaumudi Hassan; Li, Zeqian; Mickalide, Harry; Kuehn , Seppe
(2021)
Shared dataset consists of 16S sequencing data of microbial communities. Each community is composed of heterotrophic bacteria derived from one of two soil samples and the model algae Chlamydomonas reinhardtii. Each comunity was placed in a materially closed environment with an initial supply of carbon in the media and subjected to light-dark cycles. The closed microbial ecosystems (CES) survived via carbon cycling. Each CES was subjected to rounds of dilution, after which the community was sequenced (data provided here). The shared dataset allowed us to conclude that CES consistently self-assembled to cycle carbon (data not provided) via conserved metabolic capabilites (data not provided) dispite differences in taxonomic composition (data provided).
---------------------------
Naming convention:
[soil sample = A or B][CES replicate = 1,2,3, or 4]_[round number = 1,2,3,or 4]_[reverse read = R or forward read = F]_filt.fastq
Example -- A1_r1_F_filt.fastq means soil sample A, CES replicate 1, end of round1, forward read
keywords:
16S seq; .fastq; closed microbial ecosystems; carbon cycling