Metagenomic sequencing and reconstruction of 82 microbial genomes from barley seed communities

Tshisekedi, Kalonji A.; De Maayer, Pieter; Botes, Angela

doi:10.1038/s41597-024-03332-x

Download PDF

Data Descriptor
Open access
Published: 10 May 2024

Metagenomic sequencing and reconstruction of 82 microbial genomes from barley seed communities

Kalonji A. Tshisekedi¹,
Pieter De Maayer¹ &
Angela Botes¹

Scientific Data volume 11, Article number: 484 (2024) Cite this article

248 Accesses
9 Altmetric
Metrics details

Subjects

Abstract

Barley (Hordeum vulgare) is essential to global food systems and the brewing industry. Its physiological traits and microbial communities determine malt quality. Although microbes influence barley from seed health to fermentation, there is a gap in metagenomic insights during seed storage. Crucially, elucidating the changes in microbial composition associated with barley seeds is imperative for understanding how these fluctuations can impact seed health and ultimately, influence both agricultural yield and quality of barley-derived products. Whole metagenomes were sequenced from eight barley seed samples obtained at different storage time points from harvest to nine months. After binning, 82 metagenome-assembled genomes (MAGs) belonging to 26 distinct bacterial genera were assembled, with a substantial proportion of potential novel species. Most of our MAG dataset (61%) showed over 90% genome completeness. This pioneering barley seed microbial genome retrieval provides insights into species diversity and structure, laying the groundwork for understanding barley seed microbiome interactions at the genome level.

Genomic analyses reveal the stepwise domestication and genetic mechanism of curd biogenesis in cauliflower

Article Open access 07 May 2024

Evaluation of 16S rRNA gene sequencing for species and strain-level microbiome analysis

Article Open access 06 November 2019

Unveiling microbial diversity: harnessing long-read sequencing technology

Article 30 April 2024

Background & Summary

Seed microbiomes are essential to plant health, growth, and resilience, and play an important role in the physiological processes required for effective crop development¹. The barley seed microbiome, in particular, is of critical importance, influencing not only crop yield but also the quality of barley-derived products^2,3. Barley (Hordeum vulgare) has been integral to agriculture since the early phases of human civilization⁴. Its significance in the modern era is two-fold: as a fundamental component of the global food system, and as a crucial ingredient in the brewing industry^3,5. While the physiological attributes of barley influence malt quality, the microbial communities associated with barley also play an essential role, from sowing to malting².

Malting barley seeds are colonised by rich and diverse microbial communities, encompassing both endophytic and epiphytic organisms^1,6,7. These microorganisms, which can be both beneficial and detrimental, have the potential to affect seed health, germination success, and the quality of fermentation products^8,9,10. Several studies highlight the diversity of microbial populations associated with malting barley and their potential effects on brewing product quality^8,11,12. Understanding these microbial communities and their genomic content can provide insights into seed storage longevity, contamination risks, and their potential impact on subsequent production stages. However, there is a notable gap in comprehensive metagenomic datasets focusing on these microbial communities, especially during the seed storage phase.

Metagenome sequencing can provide profound insights into microbial ecosystems without necessitating laboratory cultivation^13,14,15. This approach not only provides a comprehensive understanding of the taxonomic and functional variations among phytomicrobial communities, but also sheds light on the complex interrelationships across these communities and their plant hosts^16,17. In the context of barley seed storage, acquiring this understanding using omics paves the way for developing microbial management strategies, optimising storage conditions, mitigating losses, and ensuring consistent production of premium malt.

Whole metagenomes were sequenced from eight samples of barley seeds stored in siloes at four different time points (two samples per time point), namely at harvest and after three, six and nine months, respectively (Table S1). The metagenomic data was assembled into nearly complete microbial genomes. A total of 82 metagenome-assembled genomes (MAGs) were assembled from these metagenomes (Table S2). The completeness of the MAGs was evaluated using CheckM v1.2.2¹⁸. All MAGs demonstrated completeness >75%, with 50/82 being >90% complete. These completeness values are in alignment with the high-quality draft criterion of the Minimum Information about a Metagenome-Assembled Genome (MIMAG) standards for Bacteria and Archaea¹⁹ (Fig. 1, Table S2).

Furthermore, minimal levels of sequence heterogeneity were observed for all 82 MAGs. Approximately 91% (75/82) of the MAGs registered contamination levels <5%, whereas the remaining seven MAGS exhibited contaminant levels between 5 and 10%, ensuring the reliability and integrity of our dataset (Fig. 1 and Table S2). We identified a notable negative correlation between genome completeness and contamination (r = −0.498, p < 0.00001; Fig. 2A). In parallel, our data demonstrated a positive relationship between genome size and the N50 metric (r = 0.251, p = 0.023; Fig. 2B), indicating that larger genomes are often associated with superior assembly contiguity.

Taxonomic evaluation using the Genome Taxonomy Database Toolkit (GTDB-Tk)²⁰ revealed that the barley-associated MAG dataset was dominated by members of the phylum Pseudomonadota (formerly the Proteobacteria), comprising 53.7% (44/82) of the total MAGs (Table S2) This is consistent with the findings from a previous amplicon sequencing-based study of barley seed endophytic microbial communities⁷. However, in contrast to the previous findings, we identified Bacteroidota (16/82) as the second most prevalent phylum. The abundances of Actinobacteria and Bacillota (Firmicutes) in our study also differed from those previously reported⁷, underscoring the inherent variability of barley seed microbiomes (Fig. 1 and Table S2).

Temporal shifts in genera abundance over nine months

The barley-seed derived MAGs were classified into 26 bacterial genera across eight phyla and six classes (Table S2). The microbiome was characterised by several dominant genera, with thirteen, nine, seven and six MAGs belonging to the genera Erwinia, Pseudomonas, Chryseobacterium and Paenibacillus, respectively (Fig. 3). Notably, 16 MAGs could not be accurately classified at the species level, highlighting the underexplored microbial diversity associated with barley seeds (Fig. 4, Table S2).

The barley seed microbiome shows discernible shifts during storage (Fig. 5). While the genera Erwinia and Duffyella remain pertinent from harvest through prolonged storage, there is a notable downshift and upshift in the presence of genera Chryseobacterium and Pseudomonas_E, respectively, during silo storage. These shifts may provide insights into the role of the barley seed microbiome in both seed health and disease. Chryseobacterium sp. have been observed to counteract the effects of Magnaporthe oryzae, a cause of barley blast disease, primarily by detaching fungal spores from leaf surfaces²¹, and may contribute to maintaining seed health in the field. Duffyella also garnered interest due to its observed ability to curb the growth of Fusarium tricinctum, another pathogen affecting barley^22,23. All Erwinia MAGs identified in the study were classified in the species E. persicina, a known broad host range phytopathogen, which has been linked to pink seed disease in barley²⁴. Pseudomonas-like taxa in this study were classified as part of the novel genus Pseudomonas_E as predicted by the GTDB classification database²⁰.

Methods

Sample collection and processing

Malting barley (Hordeum vulgare) samples, of a single cultivar (Kadie), were sourced from Anheuser-Busch InBev (AB-Inbev) in South Africa., specifically from Storage facilities in the Western Cape province, South Africa, were selected. Samples were collected at four distinct time points: immediately post-harvest and then after three, six, and nine months of storage in silos. At each time point, three samples were collected. All samples were aseptically collected and stored at −20 °C to inhibit microbial growth.

DNA isolation and sequencing

Approximately 10 g of barley was crushed using a sterilised mortar and pestle. The resulting residue was suspended in 40 ml of phosphate buffered saline (PBS) solution (pH 7.4). The suspension was briefly vortexed to homogenise the mixture, followed by sonication at 18 W amplitude with a 30-s on-off pulsating schedule for 7 min. The mixture was centrifuged at 4000 × g for 1 min to separate the supernatant, which was transferred to an autoclaved polycarbonate filter holder and filter membrane (0.45 µm pore filter, Sartorius-Stedim Biotech) prepared filter membrane system.

Metagenomic DNA was extracted from the filter using the ZymoBIOMICS DNA/RNA Miniprep Kit (Zymo Research), following the protocol recommended by the manufacturer. A Nanodrop Lite Spectrophotometer (Thermo Fisher Scientific) was used to validate the integrity and purity and quantify the DNA. The metagenomic DNA samples were sequenced using the Illumina NovaSeq. 6000 platform (paired end reads, 2 × 250 bp) at Molecular Research (MRDNA, Texas, USA). The total number of reads obtained was approximately 365.27 million. On average, each sample yielded around 22.83 million reads, with the maximum number of reads for a single sample being approximately 38.26 million and the minimum around 10.36 million. These metrics provide an overview of the sequencing depth achieved in our study. For a detailed breakdown of read counts for each sample (Table S1).

Metagenomic data analysis

Raw sequence reads were evaluated for quality using FastQC v0.12.1²⁵ and MultiQC v1.15²⁶. Trimmomatic V0.36²⁷ was used to filter out reads shorter than 36 bp or with an average quality score lower than 15. The removal of host DNA was performed using Bowtie2 v2.5.1²⁸ and SAMtools v1.19²⁹. Initially, an index database employing the reference genome of barley (Hordeum vulgare, Accession number: GCF_904849725.1) was constructed using the bowtie2-build command. Subsequently, read mapping to the host sequence database with Bowtie2 was conducted, preserving both aligned and unaligned paired end reads. Following this, SAMtools was used to convert the sam file into a bam format. The required unmapped reads were precisely isolated by applying SAMtools SAM-flag filters (-f 12 and -F 256), which selected pairs where both reads (R1 and R2) were unmapped. Finally, the SAMtools sort and SAMtools fastq commands were used to separate the paired end reads into distinct fastq files. Host DNA contamination varied across samples with the mean contamination ratio was approximately 0.5757%, with the minimum at 0.0059% (3,088 contaminated reads out of 52,678,404) and the maximum at 2.7368% (567,134 contaminated reads out of 20,155,530) (Table S1). Thereafter, the reads were then assembled using metaSPAdes v3.15.3³⁰ with default parameters. The integrity and quality of the final assemblies were evaluated using QUAST v5.2.0³¹.

Metagenomic binning and refinement

Metagenomic binning was performed based on tetranucleotide frequencies, coverage, and GC content using the MetaWRAP v1.3³² pipeline with default parameters using the tools MaxBin v2.0³³, metaBAT2³⁴, and CONCOCT v1.0.0³⁵. The bins were refined further using the MetaWRAP-Bin_refinement module with the parameters -c 70 and -× 10 (completeness >70% and contamination <10%) to improve bin quality. The completeness and contamination levels of these genome segments were evaluated using CheckM v1.2.2¹⁸ as part of the MetaWRAP workflow. Subsequently, the bins were reassembled using the MetaWRAP-reassemble_bins module (parameters: -c 70 × 10). The refined bins were dereplicated at a 95% average nucleotide identity (ANI) threshold using dRep v2.6.2³⁶, culminating in 82 nonredundant MAGs.

Phylogenetic analysis and classification of MAGS

For taxonomic assignment of MAGs, the classify_wf workflow from GTDB-Tk v3.4.2²⁰ was employed in tandem with the reference data GTDB release207v2²⁰, all executed with default settings. A comprehensive phylogenetic tree encompassing 82 species-level bacterial MAGs was derived from 120 bacterial marker genes using the gtdbtk_infer module in GTDB-TK. To improve interpretation and visualisation, the tree was annotated using iTOL v5³⁷.

Data Records

The data records are available Figshare³⁸.

The 82 MAGs have been deposited at DDBJ/ENA/GenBank under the accession numbers listed in Table 1^{39,40,41,42,43,44,45,46,47,48,49,50,51,52,53,54,55,56,57,58,59,60,61,62,63,64,65,66,67,68,69,70,71,72,73,74,75,76,77,78,79,80,81,82,83,84,85,86,87,88,89,90,91,92,93,94,95,96,97,98,99,100,101,102,103,104,105,106,107,108,109,110,111,112,113,114,115,116,117,118,119}.

Table 1 Genomic characteristics and accession numbers of 82 microbial genomes from barley seed communities described in this study.

Full size table

Additional metadata and details about each MAGs are available in the Supplementary Table S2.

The raw reads used to reconstruct the MAGs have been deposited to the NCBI Sequence Read Archive¹²⁰.

Technical Validation

Implementation of robust software applications, such as FastQC, MultiQC, and Trimmomatic, all of which were designed to curate and refine the sequence data. Combining the comprehensive MetaWRAP pipeline with dependable tools such as CheckM and GTDB-tk strengthened the binning, genome assembly, and taxonomic assignment processes. The culmination of these exhaustive validation stages is a dataset that is not only technically sound, but also a model of dependability and reproducibility in metagenomic research.

Code availability

No unique codes were used in the compilation or processing of this dataset. When applicable, the software versions and any deviations from default settings are explicitly indicated.

References

Barret, M. et al. Emergence Shapes the Structure of the Seed Microbiota. Applied and Environmental Microbiology 81, 1257–1266 (2015).
Article ADS PubMed PubMed Central Google Scholar
Noots, I., Delcour, J. A. & Michiels, C. W. From field barley to malt: detection and specification of microbial activity for quality aspects. Crit Rev Microbiol 25, 121–153 (1999).
Article CAS PubMed Google Scholar
Langridge, P. Economic and Academic Importance of Barley. In: Stein, N., Muehlbauer, G. J. (eds). The Barley Genome, pp 1–10 Springer International Publishing: Cham, (2018).
Newman. A Brief History of Barley Foods. CFW. https://doi.org/10.1094/CFW-51-0004 (2006).
Verstegen, H., Köneke, O., Korzun, V., von Broock, R. The World Importance of Barley and Challenges to Further Improvements. In: Kumlehn, J., Stein, N. (eds). Biotechnological Approaches to Barley Improvement, pp 3–19 (Springer: Berlin, Heidelberg, 2014).
Flannigan, B. Distribution of seed-borne micro-organisms in naked barley and wheat before harvest. Transactions of the British Mycological Society 62, 51–58 (1974).
Article Google Scholar
Bziuk, N. et al. The treasure inside barley seeds: microbial diversity and plant beneficial bacteria. Environmental Microbiome 16, 20 (2021).
Article PubMed PubMed Central Google Scholar
Bokulich, N. A. & Bamforth, C. W. The microbiology of malting and brewing. Microbiol Mol Biol Rev 77, 157–172 (2013).
Article CAS PubMed PubMed Central Google Scholar
Flannigan, B. The microbiota of barley and malt. In: Priest, F. G., Campbell, I. (eds). Brewing Microbiology, pp 113–180 Springer US: Boston, MA, (2003).
Han, B., Xie, Y., Zhang, M., Lu, J. & Cai, G. Impact of barley endophytic Pantoea agglomerans on the malt filterability. Eur Food Res Technol 249, 1403–1409 (2023).
Article CAS Google Scholar
Laitila, A., Kotaviita, E., Peltola, P., Home, S. & Wilhelmson, A. Indigenous Microbial Community of Barley Greatly Influences Grain Germination and Malt Quality. Journal of the Institute of Brewing 113, 9–20 (2007).
Article CAS Google Scholar
Harley, H. H. O. Producing Quality Barley for the Malting Industry. (2015).
Adams, I. P., Fox, A., Boonham, N., Massart, S. & De Jonghe, K. The impact of high throughput sequencing on plant health diagnostics. Eur J Plant Pathol 152, 909–919 (2018).
Article CAS Google Scholar
Sharma, M., Sudheer, S., Usmani, Z., Rani, R., Gupta, P. Deciphering the Omics of Plant-Microbe Interaction: Perspectives and New Insights. Current Genomics 21: 343–362.
Pervaiz T, Lotfi A, Salman Haider M, Haifang J, Fang J. High Throughput Sequencing Advances and Future Challenges. J Plant Biochem Physiol 05, https://doi.org/10.4172/2329-9029.1000188 (2017).
Regalado, J. et al. Combining whole-genome shotgun sequencing and rRNA gene amplicon analyses to improve detection of microbe–microbe interaction networks in plant leaves. ISME J 14, 2116–2130 (2020).
Article CAS PubMed PubMed Central Google Scholar
Fadiji, A. E., Ayangbenro, A. S. & Babalola, O. O. Shotgun metagenomics reveals the functional diversity of root-associated endophytic microbiomes in maize plant. Current Plant Biology 25, 100195 (2021).
Article CAS Google Scholar
Parks, D. H., Imelfort, M., Skennerton, C. T., Hugenholtz, P. & Tyson, G. W. CheckM: assessing the quality of microbial genomes recovered from isolates, single cells, and metagenomes. Genome Res 25, 1043–1055 (2015).
Article CAS PubMed PubMed Central Google Scholar
Bowers, R. M. et al. Minimum information about a single amplified genome (MISAG) and a metagenome-assembled genome (MIMAG) of bacteria and archaea. Nat Biotechnol 35, 725–731 (2017).
Article CAS PubMed PubMed Central Google Scholar
Chaumeil, P.-A., Mussig, A. J., Hugenholtz, P. & Parks, D. H. GTDB-Tk: a toolkit to classify genomes with the Genome Taxonomy Database. Bioinformatics 36, 1925–1927 (2020).
Article CAS Google Scholar
Kitagawa, H., Shimoi, S., Inoue, K., Park, P. & Ikeda, K. Durable and broad-spectrum disease protection measure against airborne phytopathogenic fungi by using the detachment action of gelatinolytic bacteria. Biological Control 71, 1–6 (2014).
Article Google Scholar
Gnonlonfoun, E. et al. Inhibition of the Growth of Fusarium tricinctum and Reduction of Its Enniatin Production by Erwinia gerundensis Isolated from Barley Kernels. Journal of the American Society of Brewing Chemists 81, 340–350 (2023).
Article CAS Google Scholar
Gnonlonfoun, E. et al. Impact of Erwinia gerundensis as a Biocontrol Agent on the Sanitary and Technological Quality of Barley Malt. Journal of the American Society of Brewing Chemists 0, 1–14 (2023).
Google Scholar
Kawaguchi, A. et al. Pink seed of barley caused by Erwinia persicina. J Gen Plant Pathol 87, 106–109 (2021).
Article Google Scholar
Andrews, S. Babraham Bioinformatics - FastQC A Quality Control tool for High Throughput Sequence Data. https://www.bioinformatics.babraham.ac.uk/projects/fastqc/ (accessed 5 Sep2019) (2010).
Ewels, P., Magnusson, M., Lundin, S. & Käller, M. MultiQC: summarize analysis results for multiple tools and samples in a single report. Bioinformatics 32, 3047–3048 (2016).
Article CAS PubMed PubMed Central Google Scholar
Bolger, A. M., Lohse, M. & Usadel, B. Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics 30, 2114–2120 (2014).
Article CAS PubMed PubMed Central Google Scholar
Langmead, B. & Salzberg, S. L. Fast gapped-read alignment with Bowtie 2. Nat Methods 9, 357–359 (2012).
Article CAS PubMed PubMed Central Google Scholar
Li, H. et al. The Sequence Alignment/Map format and SAMtools. Bioinformatics 25, 2078–2079 (2009).
Article PubMed PubMed Central Google Scholar
Nurk, S., Meleshko, D., Korobeynikov, A. & Pevzner, P. A. metaSPAdes: a new versatile metagenomic assembler. Genome Res 27, 824–834 (2017).
Article CAS PubMed PubMed Central Google Scholar
Gurevich, A., Saveliev, V., Vyahhi, N. & Tesler, G. QUAST: quality assessment tool for genome assemblies. Bioinformatics 29, 1072–1075 (2013).
Article CAS PubMed PubMed Central Google Scholar
Uritskiy, G. V., DiRuggiero, J. & Taylor, J. MetaWRAP—a flexible pipeline for genome-resolved metagenomic data analysis. Microbiome 6, 158 (2018).
Article PubMed PubMed Central Google Scholar
Wu, Y.-W., Simmons, B. A. & Singer, S. W. MaxBin 2.0: an automated binning algorithm to recover genomes from multiple metagenomic datasets. Bioinformatics 32, 605–607 (2016).
Article CAS PubMed Google Scholar
Kang, D. D. et al. MetaBAT 2: an adaptive binning algorithm for robust and efficient genome reconstruction from metagenome assemblies. PeerJ 7, e7359 (2019).
Article PubMed PubMed Central Google Scholar
Alneberg, J. et al. Binning metagenomic contigs by coverage and composition. Nat Methods 11, 1144–1146 (2014).
Article CAS PubMed Google Scholar
Olm, M. R., Brown, C. T., Brooks, B. & Banfield, J. F. dRep: a tool for fast and accurate genomic comparisons that enables improved genome recovery from metagenomes through de-replication. ISME J 11, 2864–2868 (2017).
Article CAS PubMed PubMed Central Google Scholar
Letunic, I. & Bork, P. Interactive Tree Of Life (iTOL) v5: an online tool for phylogenetic tree display and annotation. Nucleic Acids Research 49, W293–W296 (2021).
Article CAS PubMed PubMed Central Google Scholar
Tshisekedi, K. A., De Maayer, P. & Botes, A. Metagenomic sequencing and reconstruction of 82 microbial genomes from barley seed communities., Figshare, https://doi.org/10.6084/m9.figshare.24354352.v1 (2023).
NCBI GenBank. https://identifiers.org/ncbi/insdc.gca:GCA_037032585.1 (2023).
NCBI GenBank. https://identifiers.org/ncbi/insdc.gca:GCA_037032605.1 (2023).
NCBI GenBank. https://identifiers.org/ncbi/insdc.gca:GCA_037031965.1 (2023).
NCBI GenBank. https://identifiers.org/ncbi/insdc.gca:GCA_037032645.1 (2023).
NCBI GenBank. https://identifiers.org/ncbi/insdc.gca:GCA_037032685.1 (2023).
NCBI GenBank. https://identifiers.org/ncbi/insdc.gca:GCA_037032705.1 (2023).
NCBI GenBank. https://identifiers.org/ncbi/insdc.gca:GCA_037032665.1 (2023).
NCBI GenBank. https://identifiers.org/ncbi/insdc.gca:GCA_037031985.1 (2023).
NCBI GenBank. https://identifiers.org/ncbi/insdc.gca:GCA_037032725.1 (2023).
NCBI GenBank. https://identifiers.org/ncbi/insdc.gca:GCA_037032745.1 (2023).
NCBI GenBank. https://identifiers.org/ncbi/insdc.gca:GCA_037032045.1 (2023).
NCBI GenBank. https://identifiers.org/ncbi/insdc.gca:GCA_037032795.1 (2023).
NCBI GenBank. https://identifiers.org/ncbi/insdc.gca:GCA_037032765.1 (2023).
NCBI GenBank. https://identifiers.org/ncbi/insdc.gca:GCA_037032785.1 (2023).
NCBI GenBank. https://identifiers.org/ncbi/insdc.gca:GCA_037032825.1 (2023).
NCBI GenBank. https://identifiers.org/ncbi/insdc.gca:GCA_037032845.1 (2023).
NCBI GenBank. https://identifiers.org/ncbi/insdc.gca:GCA_037032005.1 (2023).
NCBI GenBank. https://identifiers.org/ncbi/insdc.gca:GCA_037032865.1 (2023).
NCBI GenBank. https://identifiers.org/ncbi/insdc.gca:GCA_037032905.1 (2023).
NCBI GenBank. https://identifiers.org/ncbi/insdc.gca:GCA_037032925.1 (2023).
NCBI GenBank. https://identifiers.org/ncbi/insdc.gca:GCA_037032025.1 (2023).
NCBI GenBank. https://identifiers.org/ncbi/insdc.gca:GCA_037032945.1 (2023).
NCBI GenBank. https://identifiers.org/ncbi/insdc.gca:GCA_037032885.1 (2023).
NCBI GenBank. https://identifiers.org/ncbi/insdc.gca:GCA_037032965.1 (2023).
NCBI GenBank. https://identifiers.org/ncbi/insdc.gca:GCA_037033005.1 (2023).
NCBI GenBank. https://identifiers.org/ncbi/insdc.gca:GCA_037033045.1 (2023).
NCBI GenBank. https://identifiers.org/ncbi/insdc.gca:GCA_037032985.1 (2023).
NCBI GenBank. https://identifiers.org/ncbi/insdc.gca:GCA_037033025.1 (2023).
NCBI GenBank. https://identifiers.org/ncbi/insdc.gca:GCA_037033065.1 (2023).
NCBI GenBank. https://identifiers.org/ncbi/insdc.gca:GCA_037033085.1 (2023).
NCBI GenBank. https://identifiers.org/ncbi/insdc.gca:GCA_037033105.1 (2023).
NCBI GenBank. https://identifiers.org/ncbi/insdc.gca:GCA_037033125.1 (2023).
NCBI GenBank. https://identifiers.org/ncbi/insdc.gca:GCA_037033145.1 (2023).
NCBI GenBank. https://identifiers.org/ncbi/insdc.gca:GCA_037033165.1 (2023).
NCBI GenBank. https://identifiers.org/ncbi/insdc.gca:GCA_037033185.1 (2023).
NCBI GenBank. https://identifiers.org/ncbi/insdc.gca:GCA_037033205.1 (2023).
NCBI GenBank. https://identifiers.org/ncbi/insdc.gca:GCA_037033245.1 (2023).
NCBI GenBank. https://identifiers.org/ncbi/insdc.gca:GCA_037033225.1 (2023).
NCBI GenBank. https://identifiers.org/ncbi/insdc.gca:GCA_037033265.1 (2023).
NCBI GenBank. https://identifiers.org/ncbi/insdc.gca:GCA_037033285.1 (2023).
NCBI GenBank. https://identifiers.org/ncbi/insdc.gca:GCA_037033305.1 (2023).
NCBI GenBank. https://identifiers.org/ncbi/insdc.gca:GCA_037033325.1 (2023).
NCBI GenBank. https://identifiers.org/ncbi/insdc.gca:GCA_037033345.1 (2023).
NCBI GenBank. https://identifiers.org/ncbi/insdc.gca:GCA_037033365.1 (2023).
NCBI GenBank. https://identifiers.org/ncbi/insdc.gca:GCA_037033385.1 (2023).
NCBI GenBank. https://identifiers.org/ncbi/insdc.gca:GCA_037033405.1 (2023).
NCBI GenBank. https://identifiers.org/ncbi/insdc.gca:GCA_037033425.1 (2023).
NCBI GenBank. https://identifiers.org/ncbi/insdc.gca:GCA_037033485.1 (2023).
NCBI GenBank. https://identifiers.org/ncbi/insdc.gca:GCA_037033465.1 (2023).
NCBI GenBank. https://identifiers.org/ncbi/insdc.gca:GCA_037033445.1 (2023).
NCBI GenBank. https://identifiers.org/ncbi/insdc.gca:GCA_037033505.1 (2023).
NCBI GenBank. https://identifiers.org/ncbi/insdc.gca:GCA_037033525.1 (2023).
NCBI GenBank. https://identifiers.org/ncbi/insdc.gca:GCA_037033545.1 (2023).
NCBI GenBank. https://identifiers.org/ncbi/insdc.gca:GCA_037033565.1 (2023).
NCBI GenBank. https://identifiers.org/ncbi/insdc.gca:GCA_037033605.1 (2023).
NCBI GenBank. https://identifiers.org/ncbi/insdc.gca:GCA_037033585.1 (2023).
NCBI GenBank. https://identifiers.org/ncbi/insdc.gca:GCA_037033625.1 (2023).
NCBI GenBank. https://identifiers.org/ncbi/insdc.gca:GCA_037033645.1 (2023).
NCBI GenBank. https://identifiers.org/ncbi/insdc.gca:GCA_037033685.1 (2023).
NCBI GenBank. https://identifiers.org/ncbi/insdc.gca:GCA_037033665.1 (2023).
NCBI GenBank. https://identifiers.org/ncbi/insdc.gca:GCA_037033705.1 (2023).
NCBI GenBank. https://identifiers.org/ncbi/insdc.gca:GCA_037033725.1 (2023).
NCBI GenBank. https://identifiers.org/ncbi/insdc.gca:GCA_037033745.1 (2023).
NCBI GenBank. https://identifiers.org/ncbi/insdc.gca:GCA_037033765.1 (2023).
NCBI GenBank. https://identifiers.org/ncbi/insdc.gca:GCA_037033785.1 (2023).
NCBI GenBank. https://identifiers.org/ncbi/insdc.gca:GCA_037033825.1 (2023).
NCBI GenBank. https://identifiers.org/ncbi/insdc.gca:GCA_037033805.1 (2023).
NCBI GenBank. https://identifiers.org/ncbi/insdc.gca:GCA_037033845.1 (2023).
NCBI GenBank. https://identifiers.org/ncbi/insdc.gca:GCA_037033885.1 (2023).
NCBI GenBank. https://identifiers.org/ncbi/insdc.gca:GCA_037033865.1 (2023).
NCBI GenBank. https://identifiers.org/ncbi/insdc.gca:GCA_037033925.1 (2023).
NCBI GenBank. https://identifiers.org/ncbi/insdc.gca:GCA_037033905.1 (2023).
NCBI GenBank. https://identifiers.org/ncbi/insdc.gca:GCA_037033945.1 (2023).
NCBI GenBank. https://identifiers.org/ncbi/insdc.gca:GCA_037034005.1 (2023).
NCBI GenBank. https://identifiers.org/ncbi/insdc.gca:GCA_037033985.1 (2023).
NCBI GenBank. https://identifiers.org/ncbi/insdc.gca:GCA_037034025.1 (2023).
NCBI GenBank. https://identifiers.org/ncbi/insdc.gca:GCA_037033965.1 (2023).
NCBI GenBank. https://identifiers.org/ncbi/insdc.gca:GCA_037034045.1 (2023).
NCBI GenBank. https://identifiers.org/ncbi/insdc.gca:GCA_037034065.1 (2023).
NCBI GenBank. https://identifiers.org/ncbi/insdc.gca:GCA_037034085.1 (2023).
NCBI GenBank. https://identifiers.org/ncbi/insdc.gca:GCA_037034105.1 (2023).
NCBI Sequence Read Archive. https://identifiers.org/ncbi/insdc.sra:SRP479463 (2023).

Download references

Acknowledgements

This project was funded by the South African National Research Foundation (NRF) and Anheuser-Busch InBev.

Author information

Authors and Affiliations

School of Molecular and Cell Biology, Wits University, Johannesburg, South Africa
Kalonji A. Tshisekedi, Pieter De Maayer & Angela Botes

Authors

Kalonji A. Tshisekedi
View author publications
You can also search for this author in PubMed Google Scholar
Pieter De Maayer
View author publications
You can also search for this author in PubMed Google Scholar
Angela Botes
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

K.T. designed the methodology, performed the analysis, prepared the figure and tables, and wrote the paper. P.D.M. wrote and reviewed drafts of the paper. A.B. and conceived the study, wrote, and reviewed drafts of the paper.

Corresponding author

Correspondence to Angela Botes.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Table S1

Table S2

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Tshisekedi, K.A., De Maayer, P. & Botes, A. Metagenomic sequencing and reconstruction of 82 microbial genomes from barley seed communities. Sci Data 11, 484 (2024). https://doi.org/10.1038/s41597-024-03332-x

Download citation

Received: 23 October 2023
Accepted: 30 April 2024
Published: 10 May 2024
DOI: https://doi.org/10.1038/s41597-024-03332-x