Nucleic Acids Research, 2001, Vol. 29, No. 1 22-28
© 2001 Oxford University Press
The COG database: new developments in phylogenetic classification of proteins from complete genomes
National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA
The database of Clusters of Orthologous Groups of proteins (COGs), which represents an attempt on a phylogenetic classification of the proteins encoded in complete genomes, currently consists of 2791 COGs including 45 350 proteins from 30 genomes of bacteria, archaea and the yeast Saccharomyces cerevisiae (http://www.ncbi.nlm.nih.gov/COG). In addition, a supplement to the COGs is available, in which proteins encoded in the genomes of two multicellular eukaryotes, the nematode Caenorhabditis elegans and the fruit fly Drosophila melanogaster, and shared with bacteria and/or archaea were included. The new features added to the COG database include information pages with structural and functional details on each COG and literature references, improvements of the COGNITOR program that is used to fit new proteins into the COGs, and classification of genomes and COGs constructed by using principal component analysis.
* To whom correspondence should be addressed. Tel: +1 301 435 5913; Fax: +1 301 480 9241; Email: koonin{at}ncbi.nlm.nih.gov
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
F. Escartin, S. Skouloubris, U. Liebl, and H. Myllykallio Flavin-dependent thymidylate synthase X limits chromosomal DNA replication PNAS, July 22, 2008; 105(29): 9948 - 9952. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. X. Tran, M. S. Trent, and C. Whitfield The LptA Protein of Escherichia coli Is a Periplasmic Lipid A-binding Protein Involved in the Lipopolysaccharide Export Pathway J. Biol. Chem., July 18, 2008; 283(29): 20342 - 20349. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. Takarada, M. Sekine, H. Kosugi, Y. Matsuo, T. Fujisawa, S. Omata, E. Kishi, A. Shimizu, N. Tsukatani, S. Tanikawa, et al. Complete Genome Sequence of the Soil Actinomycete Kocuria rhizophila J. Bacteriol., June 15, 2008; 190(12): 4139 - 4146. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. Morita, H. Toh, S. Fukuda, H. Horikawa, K. Oshima, T. Suzuki, M. Murakami, S. Hisamatsu, Y. Kato, T. Takizawa, et al. Comparative Genome Analysis of Lactobacillus reuteri and Lactobacillus fermentum Reveal a Genomic Island for Reuterin and Cobalamin Production DNA Res, June 1, 2008; 15(3): 151 - 161. [Abstract] [Full Text] [PDF] |
||||
![]() |
O. Gonzalez and R. Zimmer Assigning functional linkages to proteins using phylogenetic profiles and continuous phenotypes Bioinformatics, May 15, 2008; 24(10): 1257 - 1263. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. Ke, N. Sakiyama, R. Sawada, M. Sonoyama, and S. Mitaku Vertebrate Genomes Code Excess Proteins with Charge Periodicity of 28 Residues J. Biochem., May 1, 2008; 143(5): 661 - 665. [Abstract] [Full Text] [PDF] |
||||
![]() |
X. Hu, W. Fan, B. Han, H. Liu, D. Zheng, Q. Li, W. Dong, J. Yan, M. Gao, C. Berry, et al. Complete Genome Sequence of the Mosquitocidal Bacterium Bacillus sphaericus C3-41 and Comparison with Those of Closely Related Bacillus Species J. Bacteriol., April 15, 2008; 190(8): 2892 - 2902. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. G. Vitreschak, A. A. Mironov, V. A. Lyubetsky, and M. S. Gelfand Comparative genomic analysis of T-box regulatory systems in bacteria RNA, April 1, 2008; 14(4): 717 - 735. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. Shi and P. G. Falkowski Genome evolution in cyanobacteria: The stable core and the variable shell PNAS, February 19, 2008; 105(7): 2510 - 2515. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. Goto, A. Yamashita, H. Hirakawa, M. Matsutani, K. Todo, K. Ohshima, H. Toh, K. Miyamoto, S. Kuhara, M. Hattori, et al. Complete Genome Sequence of Finegoldia magna, an Anaerobic Opportunistic Pathogen DNA Res, February 7, 2008; (2008) dsm030v1. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. Mock, M. P. Samanta, V. Iverson, C. Berthiaume, M. Robison, K. Holtermann, C. Durkin, S. S. BonDurant, K. Richmond, M. Rodesch, et al. From the Cover: Whole-genome expression profiling of the marine diatom Thalassiosira pseudonana identifies genes involved in silicon bioprocesses PNAS, February 5, 2008; 105(5): 1579 - 1584. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Vivero, R. C. Banos, J. F. Mariscotti, J. C. Oliveros, F. Garcia-del Portillo, A. Juarez, and C. Madrid Modulation of Horizontally Acquired Genes by the Hha-YdgT Proteins in Salmonella enterica Serovar Typhimurium J. Bacteriol., February 1, 2008; 190(3): 1152 - 1156. [Abstract] [Full Text] [PDF] |
||||
![]() |
B. W. Davies and G. C. Walker A Highly Conserved Protein of Unknown Function Is Required by Sinorhizobium meliloti for Symbiosis and Environmental Stress Protection J. Bacteriol., February 1, 2008; 190(3): 1118 - 1123. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Brilli, R. Fani, and P. Lio Current trends in the bioinformatic sequence analysis of metabolic pathways in prokaryotes Brief Bioinform, January 1, 2008; 9(1): 34 - 45. [Abstract] [Full Text] [PDF] |
||||
![]() |
I. Biswas, L. Drake, D. Erkina, and S. Biswas Involvement of Sensor Kinases in the Stress Tolerance Response of Streptococcus mutans J. Bacteriol., January 1, 2008; 190(1): 68 - 77. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. A. Bennett, R. M. Aimino, and J. R. McCormick Streptomyces coelicolor Genes ftsL and divIC Play a Role in Cell Division but Are Dispensable for Colony Formation J. Bacteriol., December 15, 2007; 189(24): 8982 - 8992. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. Sorek, Y. Zhu, C. J. Creevey, M. P. Francino, P. Bork, and E. M. Rubin Genome-Wide Experimental Determination of Barriers to Horizontal Gene Transfer Science, November 30, 2007; 318(5855): 1449 - 1452. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. N. Kim, A. Roth, and R. R. Breaker Guanine riboswitch variants from Mesoplasma florum selectively recognize 2'-deoxyguanosine PNAS, October 9, 2007; 104(41): 16092 - 16097. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. M. Sturino and T. R. Klaenhammer Inhibition of bacteriophage replication in Streptococcus thermophilus by subunit poisoning of primase Microbiology, October 1, 2007; 153(10): 3295 - 3302. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. O. Allen, C. M. Fauron, P. Minx, L. Roark, S. Oddiraju, G. N. Lin, L. Meyer, H. Sun, K. Kim, C. Wang, et al. Comparisons Among Two Fertile and Three Male-Sterile Mitochondrial Genomes of Maize Genetics, October 1, 2007; 177(2): 1173 - 1192. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Grana, A. Haouz, A. Buschiazzo, I. Miras, A. Wehenkel, V. Bondet, W. Shepard, F. Schaeffer, S. T. Cole, and P. M. Alzari The crystal structure of M. leprae ML2640c defines a large family of putative S-adenosylmethionine-dependent methyltransferases in mycobacteria Protein Sci., September 1, 2007; 16(9): 1896 - 1904. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. I. Sadreyev, M. Tang, B.-H. Kim, and N. V. Grishin COMPASS server for remote homology inference Nucleic Acids Res., July 13, 2007; 35(suppl_2): W653 - W658. [Abstract] [Full Text] [PDF] |
||||
![]() |
Y. Moriya, M. Itoh, S. Okuda, A. C. Yoshizawa, and M. Kanehisa KAAS: an automatic genome annotation and pathway reconstruction server Nucleic Acids Res., July 13, 2007; 35(suppl_2): W182 - W185. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. K. Saini and D. Fischer FRalanyzer: a tool for functional analysis of fold-recognition sequence-structure alignments Nucleic Acids Res., July 13, 2007; 35(suppl_2): W499 - W502. [Abstract] [Full Text] [PDF] |
||||
![]() |
W. Wang, A. V. Perepelov, L. Feng, S. D. Shevelev, Q. Wang, S. N. Senchenkova, W. Han, Y. Li, A. S. Shashkov, Y. A. Knirel, et al. A group of Escherichia coli and Salmonella enterica O antigens sharing a common backbone structure Microbiology, July 1, 2007; 153(7): 2159 - 2167. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. J. Miller, L. Shuvalova, E. Evdokimova, A. Savchenko, A. F. Yakunin, and W. F. Anderson Structural and biochemical characterization of a novel Mn2+-dependent phosphodiesterase encoded by the yfcE gene Protein Sci., July 1, 2007; 16(7): 1338 - 1348. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Linz, A. Radtke, and A. von Haeseler A Likelihood Framework to Measure Horizontal Gene Transfer Mol. Biol. Evol., June 1, 2007; 24(6): 1312 - 1319. [Abstract] [Full Text] [PDF] |
||||
![]() |
B. Andreopoulos, A. An, X. Wang, M. Faloutsos, and M. Schroeder Clustering by common friends finds locally significant proteins mediating modules Bioinformatics, May 1, 2007; 23(9): 1124 - 1131. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. Wu, F. Mao, V. Olman, and Y. Xu Hierarchical classification of functionally equivalent genes in prokaryotes Nucleic Acids Res., April 1, 2007; 35(7): 2125 - 2140. [Abstract] [Full Text] [PDF] |
||||
![]() |
Z. Yu, P.-A. Genest, B. ter Riet, K. Sweeney, C. DiPaolo, R. Kieft, E. Christodoulou, A. Perrakis, J. M. Simmons, R. P. Hausinger, et al. The protein that binds to DNA base J in trypanosomatids has features of a thymidine hydroxylase Nucleic Acids Res., April 1, 2007; 35(7): 2107 - 2115. [Abstract] [Full Text] [PDF] |
||||
![]() |
B. Berger, R. D. Pridmore, C. Barretto, F. Delmas-Julien, K. Schreiber, F. Arigoni, and H. Brussow Similarity and Differences in the Lactobacillus acidophilus Group Identified by Polyphasic Analysis and Comparative Genomics J. Bacteriol., February 15, 2007; 189(4): 1311 - 1321. [Abstract] [Full Text] [PDF] |
||||
![]() |
Q. Ren, K. Chen, and I. T. Paulsen TransportDB: a comprehensive database resource for cytoplasmic membrane transport systems and outer membrane channels Nucleic Acids Res., January 12, 2007; 35(suppl_1): D274 - D279. [Abstract] [Full Text] [PDF] |
||||
![]() |
Z. Ouyang and R. Isaacson Identification and Characterization of a Novel ABC Iron Transport System, fit, in Escherichia coli Infect. Immun., December 1, 2006; 74(12): 6949 - 6956. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. Renesto, C. Abergel, P. Decloquement, D. Moinier, S. Azza, H. Ogata, P. Fourquet, J.-P. Gorvel, and J.-M. Claverie Mimivirus Giant Particles Incorporate a Large Fraction of Anonymous and Unique Gene Products J. Virol., December 1, 2006; 80(23): 11678 - 11685. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. Goffard and G. Weiller Extending MapMan: application to legume genome arrays Bioinformatics, December 1, 2006; 22(23): 2958 - 2959. [Abstract] [Full Text] [PDF] |
||||
![]() |
F. Choulet, B. Aigle, A. Gallois, S. Mangenot, C. Gerbaud, C. Truong, F.-X. Francou, C. Fourrier, M. Guerineau, B. Decaris, et al. Evolution of the Terminal Regions of the Streptomyces Linear Chromosome Mol. Biol. Evol., December 1, 2006; 23(12): 2361 - 2369. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. Makarova, A. Slesarev, Y. Wolf, A. Sorokin, B. Mirkin, E. Koonin, A. Pavlov, N. Pavlova, V. Karamychev, N. Polouchine, et al. Comparative genomics of the lactic acid bacteria PNAS, October 17, 2006; 103(42): 15611 - 15616. [Abstract] [Full Text] [PDF] |
||||
![]() |
W. G. Reeve, L. Brau, J. Castelli, G. Garau, C. Sohlenkamp, O. Geiger, M. J. Dilworth, A. R. Glenn, J. G. Howieson, and R. P. Tiwari The Sinorhizobium medicae WSM419 lpiA gene is transcriptionally activated by FsrR and required to enhance survival in lethal acid conditions. Microbiology, October 1, 2006; 152(Pt 10): 3049 - 3059. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Bern, D. Goldberg, and E. Lyashenko Data mining for proteins characteristic of clades Nucleic Acids Res., September 11, 2006; 34(16): 4342 - 4353. [Abstract] [Full Text] [PDF] |
||||
![]() |
F. Choulet, A. Gallois, B. Aigle, S. Mangenot, C. Gerbaud, C. Truong, F.-X. Francou, F. Borges, C. Fourrier, M. Guerineau, et al. Intraspecific Variability of the Terminal Inverted Repeats of the Linear Chromosome of Streptomyces ambofaciens. J. Bacteriol., September 1, 2006; 188(18): 6599 - 6610. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. S. Lott, B. Paget, J. M. Johnston, L. T. J. Delbaere, J. A. Sigrell-Simon, M. J. Banfield, and E. N. Baker The Structure of an Ancient Conserved Domain Establishes a Structural Basis for Stable Histidine Phosphorylation and Identifies a New Family of Adenosine-specific Kinases J. Biol. Chem., August 4, 2006; 281(31): 22131 - 22141. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. Bryson, V. Loux, R. Bossy, P. Nicolas, S. Chaillou, M. van de Guchte, S. Penaud, E. Maguin, M. Hoebeke, P. Bessieres, et al. AGMIAL: implementing an annotation strategy for prokaryote genomes as a distributed system Nucleic Acids Res., July 19, 2006; 34(12): 3533 - 3545. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Ranjan, J. Seshadri, V. Vindal, S. Yellaboina, and A. Ranjan iCR: a web tool to identify conserved targets of a regulatory protein across the multiple related prokaryotic species. Nucleic Acids Res., July 1, 2006; 34(Web Server issue): W584 - W587. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. A. Providenti, J. M. O'Brien, J. Ruff, A. M. Cook, and I. B. Lambert Metabolism of Isovanillate, Vanillate, and Veratrate by Comamonas testosteroni Strain BR6020 J. Bacteriol., June 1, 2006; 188(11): 3862 - 3869. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Kanjilal-Kolar, S. S. Basu, M. I. Kanipes, Z. Guan, T. A. Garrett, and C. R. H. Raetz Expression Cloning of Three Rhizobium leguminosarum Lipopolysaccharide Core Galacturonosyltransferases J. Biol. Chem., May 5, 2006; 281(18): 12865 - 12878. [Abstract] [Full Text] [PDF] |
||||
![]() |
P.-E. Fournier, K. Suhre, G. Fournous, and D. Raoult Estimation of prokaryote genomic DNA G+C content by sequencing universally conserved genes. Int J Syst Evol Microbiol, May 1, 2006; 56(Pt 5): 1025 - 1029. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Wels, C. Francke, R. Kerkhoven, M. Kleerebezem, and R. J. Siezen Predicting cis-acting elements of Lactobacillus plantarum by comparative genomics with different taxonomic subgroups Nucleic Acids Res., April 13, 2006; 34(7): 1947 - 1958. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Domka, J. Lee, and T. K. Wood YliH (BssR) and YceP (BssS) Regulate Escherichia coli K-12 Biofilm Formation by Influencing Cell Signaling Appl. Envir. Microbiol., April 1, 2006; 72(4): 2449 - 2459. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. A. Providenti, R. E. Shaye, K. D. Lynes, N. T. McKenna, J. M. O'Brien, S. Rosolen, R. C. Wyndham, and I. B. Lambert The Locus Coding for the 3-Nitrobenzoate Dioxygenase of Comamonas sp. Strain JS46 Is Flanked by IS1071 Elements and Is Subject to Deletion and Inversion Events Appl. Envir. Microbiol., April 1, 2006; 72(4): 2651 - 2660. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. C. Chiu, E. K. Lee, M. G. Egan, I. N. Sarkar, G. M. Coruzzi, and R. DeSalle OrthologID: automation of genome-scale ortholog identification within a parsimony framework Bioinformatics, March 15, 2006; 22(6): 699 - 707. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. Moussard, G. Henneke, D. Moreira, V. Jouffe, P. Lopez-Garcia, and C. Jeanthon Thermophilic lifestyle for an uncultured archaeon from hydrothermal vents: evidence from environmental genomics. Appl. Envir. Microbiol., March 1, 2006; 72(3): 2268 - 2271. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Campillos, C. von Mering, L. J. Jensen, and P. Bork Identification and analysis of evolutionarily cohesive functional modules in protein networks Genome Res., March 1, 2006; 16(3): 374 - 382. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. L. Marsden, D. Lee, M. Maibaum, C. Yeats, and C. A. Orengo Comprehensive genome analysis of 203 genomes provides structural genomics with new insights into protein family space Nucleic Acids Res., February 15, 2006; 34(3): 1066 - 1080. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. Toh, B. L. Weiss, S. A.H. Perkin, A. Yamashita, K. Oshima, M. Hattori, and S. Aksoy Massive genome erosion and functional adaptations provide insights into the symbiotic lifestyle of Sodalis glossinidius in the tsetse host Genome Res., February 1, 2006; 16(2): 149 - 156. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. Dagan, R. Blekhman, and D. Graur The "Domino Theory" of Gene Death: Gradual and Mass Gene Extinction Events in Three Lineages of Obligate Symbiotic Bacterial Pathogens Mol. Biol. Evol., February 1, 2006; 23(2): 310 - 316. [Abstract] [Full Text] [PDF] |
||||
![]() |
I. Uchiyama Hierarchical clustering algorithm for comprehensive orthologous-domain classification in multiple genomes Nucleic Acids Res., January 25, 2006; 34(2): 647 - 658. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Steele, M. Marcone, C. Gyles, V.L. Chan, and J. Odumeru Enzymatic activity of Campylobacter jejuni hippurate hydrolase Protein Eng. Des. Sel., January 1, 2006; 19(1): 17 - 25. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Kanehisa, S. Goto, M. Hattori, K. F. Aoki-Kinoshita, M. Itoh, S. Kawashima, T. Katayama, M. Araki, and M. Hirakawa From genomics to chemical genomics: new developments in KEGG Nucleic Acids Res., January 1, 2006; 34(suppl_1): D354 - D357. [Abstract] [Full Text] [PDF] |
||||
![]() |
|




















