- Home
- Publications
- International Journal of Systematic and Evolutionary Microbiology
- Volume 67, Issue 5
- Article

oa Introducing EzBioCloud: a taxonomically united database of 16S rRNA gene sequences and whole-genome assemblies
- Authors: Seok-Hwan Yoon1 , Sung-Min Ha1 , Soonjae Kwon1 , Jeongmin Lim1 , Yeseul Kim1 , Hyungseok Seo1 , Jongsik Chun1
-
- VIEW AFFILIATIONS
-
1 Department of ChunLab, Inc, Seoul National University, Seoul, Republic of Korea
- *Correspondence: Jongsik Chun, [email protected] or [email protected]
- First Published Online: 30 May 2017, International Journal of Systematic and Evolutionary Microbiology 67: 1613-1617, doi: 10.1099/ijsem.0.001755
- Subject: Evolution, Phylogeny and Biodiversity
- Received:
- Accepted:
- Cover date:
- This is an open access article published by the Microbiology Society under the Creative Commons Attribution License




Introducing EzBioCloud: a taxonomically united database of 16S rRNA gene sequences and whole-genome assemblies, Page 1 of 1
< Previous page | Next page > /docserver/preview/fulltext/ijsem/67/5/1613_ijsem001755-1.gif
-
The recent advent of DNA sequencing technologies facilitates the use of genome sequencing data that provide means for more informative and precise classification and identification of members of the Bacteria and Archaea. Because the current species definition is based on the comparison of genome sequences between type and other strains in a given species, building a genome database with correct taxonomic information is of paramount need to enhance our efforts in exploring prokaryotic diversity and discovering novel species as well as for routine identifications. Here we introduce an integrated database, called EzBioCloud, that holds the taxonomic hierarchy of the Bacteria and Archaea, which is represented by quality-controlled 16S rRNA gene and genome sequences. Whole-genome assemblies in the NCBI Assembly Database were screened for low quality and subjected to a composite identification bioinformatics pipeline that employs gene-based searches followed by the calculation of average nucleotide identity. As a result, the database is made of 61 700 species/phylotypes, including 13 132 with validly published names, and 62 362 whole-genome assemblies that were identified taxonomically at the genus, species and subspecies levels. Genomic properties, such as genome size and DNA G+C content, and the occurrence in human microbiome data were calculated for each genus or higher taxa. This united database of taxonomy, 16S rRNA gene and genome sequences, with accompanying bioinformatics tools, should accelerate genome-based classification and identification of members of the Bacteria and Archaea. The database and related search tools are available at www.ezbiocloud.net/.
-
Three supplementary figures are available with the online Supplementry Material.
- Keyword(s): identification, 16S rRNA gene, average nucleotide identity, database, genome
© 2017 IUMS | Published by the Microbiology Society
-
1. Chun J, Lee JH, Jung Y, Kim M, Kim S et al. EzTaxon: a web-based tool for the identification of prokaryotes based on 16S ribosomal RNA gene sequences. Int J Syst Evol Microbiol 2007;57:2259–2261 [CrossRef][PubMed]
-
2. Cole JR, Wang Q, Fish JA, Chai B, Mcgarrell DM et al. Ribosomal database project: data and tools for high throughput rRNA analysis. Nucleic Acids Res 2014;42:D633–D642 [CrossRef][PubMed]
-
3. Kim OS, Cho YJ, Lee K, Yoon SH, Kim M et al. Introducing EzTaxon-e: a prokaryotic 16S rRNA gene sequence database with phylotypes that represent uncultured species. Int J Syst Evol Microbiol 2012;62:716–721 [CrossRef][PubMed]
-
4. Quast C, Pruesse E, Yilmaz P, Gerken J, Schweer T et al. The SILVA ribosomal RNA gene database project: improved data processing and web-based tools. Nucleic Acids Res 2013;41:D590–D596 [CrossRef][PubMed]
-
5. Fox GE, Wisotzkey JD, Jurtshuk P. How close is close: 16S rRNA sequence identity may not be sufficient to guarantee species identity. Int J Syst Bacteriol 1992;42:166–170 [CrossRef][PubMed]
-
6. Kim M, Oh HS, Park SC, Chun J. Towards a taxonomic coherence between average nucleotide identity and 16S rRNA gene sequence similarity for species demarcation of prokaryotes. Int J Syst Evol Microbiol 2014;64:346–351 [CrossRef][PubMed]
-
7. Wayne LG, Brenner DJ, Colwell RR, Grimont PAD, Kandler O et al. International Committee on Systematic Bacteriology. Report of the ad hoc committee on reconciliation of approaches to bacterial systematics. Int J Syst Bacteriol 1987;37:463–464[CrossRef]
-
8. Chun J, Rainey FA. Integrating genomics into the taxonomy and systematics of the Bacteria and Archaea. Int J Syst Evol Microbiol 2014;64:316–324 [CrossRef][PubMed]
-
9. Richter M, Rosselló-Móra R. Shifting the genomic gold standard for the prokaryotic species definition. Proc Natl Acad Sci USA 2009;106:19126–19131 [CrossRef][PubMed]
-
10. Lee I, Kim YO, Park SC, Chun J. OrthoANI: an improved algorithm and software for calculating average nucleotide identity. Int J Syst Evol Microbiol 2015;66:1100–1103 [CrossRef][PubMed]
-
11. Snitkin ES, Zelazny AM, Thomas PJ, Stock F, Henderson DK et al. Tracking a hospital outbreak of carbapenem-resistant Klebsiella pneumoniae with whole-genome sequencing. Sci Transl Med 2012;4:148ra116 [CrossRef][PubMed]
-
12. Kyrpides NC, Hugenholtz P, Eisen JA, Woyke T, Göker M et al. Genomic encyclopedia of bacteria and archaea: sequencing a myriad of type strains. PLoS Biol 2014;12:e1001920 [CrossRef][PubMed]
-
13. Jeon YS, Lee K, Park SC, Kim BS, Cho YJ et al. EzEditor: a versatile sequence alignment editor for both rRNA- and protein-coding genes. Int J Syst Evol Microbiol 2014;64:689–691 [CrossRef][PubMed]
-
14. Stamatakis A. RAxML version 8: a tool for phylogenetic analysis and post-analysis of large phylogenies. Bioinformatics 2014;30:1312–1313 [CrossRef][PubMed]
-
15. Myers EW, Miller W. Optimal alignments in linear space. Comput Appl Biosci 1988;4:11–17 [CrossRef][PubMed]
-
16. Edgar RC. Search and clustering orders of magnitude faster than BLAST. Bioinformatics 2010;26:2460–2461 [CrossRef][PubMed]
-
17. Teeling H, Meyerdierks A, Bauer M, Amann R, Glöckner FO. Application of tetranucleotide frequencies for the assignment of genomic fragments. Environ Microbiol 2004;6:938–947 [CrossRef][PubMed]
-
18. Langille MG, Zaneveld J, Caporaso JG, Mcdonald D, Knights D et al. Predictive functional profiling of microbial communities using 16S rRNA marker gene sequences. Nat Biotechnol 2013;31:814–821 [CrossRef][PubMed]
-
19. Richter M, Rosselló-Móra R, Oliver Glöckner F, Peplies J. JSpeciesWS: a web server for prokaryotic species circumscription based on pairwise genome comparison. Bioinformatics 2016;32:929–931 [CrossRef][PubMed]
-
20. Kirby BM, Everest GJ, Meyers PR. Phylogenetic analysis of the genus Kribbella based on the gyrB gene: proposal of a gyrB-sequence threshold for species delineation in the genus Kribbella. Antonie van Leeuwenhoek 2010;97:131–142 [CrossRef][PubMed]
-
21. Thompson CC, Thompson FL, Vandemeulebroecke K, Hoste B, Dawyndt P et al. Use of recA as an alternative phylogenetic marker in the family Vibrionaceae. Int J Syst Evol Microbiol 2004;54:919–924 [CrossRef][PubMed]
-
22. Kim M, Park SC, Baek I, Chun J. Large-scale evaluation of experimentally determined DNA G+C contents with whole genome sequences of prokaryotes. Syst Appl Microbiol 2015;38:79–83 [CrossRef][PubMed]

Supplementary Data
Data loading....

Article metrics loading...

Full text loading...
Author and Article Information
-
This Journal
/content/journal/ijsem/10.1099/ijsem.0.001755dcterms_title,dcterms_subject,pub_serialTitlepub_serialIdent:journal/ijsem AND -contentType:BlogPost104 -
Other Society Journals
/content/journal/ijsem/10.1099/ijsem.0.001755dcterms_title,dcterms_subject-pub_serialIdent:journal/ijsem AND -contentType:BlogPost104 -
PubMed
-
Google Scholar
Figure data loading....