Nucleic Acids Research, 2003, Vol. 31, No. 1 348-352
© 2003 Oxford University Press
ProtoNet: hierarchical classification of the protein space
School of Computer Science and Engineering, Institute of Life Sciences, The Hebrew University, Jerusalem 91904, Israel 1 Department of Biological Chemistry, Institute of Life Sciences, The Hebrew University, Jerusalem 91904, Israel
*To whom correspondence should be addressed. Tel: +972 26585425; Fax: +972 26586448; Email: michall{at}mail.ls.huji.ac.il
ABSTRACT
The ProtoNet site provides an automatic hierarchical clustering of the SWISS-PROT protein database. The clustering is based on an all-against-all BLAST similarity search. The similarities' E-score is used to perform a continuous bottom-up clustering process by applying alternative rules for merging clusters. The outcome of this clustering process is a classification of the input proteins into a hierarchy of clusters of varying degrees of granularity. ProtoNet (version 1.3) is accessible in the form of an interactive web site at http://www.protonet.cs.huji.ac.il. ProtoNet provides navigation tools for monitoring the clustering process with a vertical and horizontal view. Each cluster at any level of the hierarchy is assigned with a statistical index, indicating the level of purity based on biological keywords such as those provided by SWISS-PROT and InterPro. ProtoNet can be used for function prediction, for defining superfamilies and subfamilies and for large-scale protein annotation purposes.
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
V. G. Tarcea, T. Weymouth, A. Ade, A. Bookvich, J. Gao, V. Mahavisno, Z. Wright, A. Chapman, M. Jayapandian, A. Ozgur, et al. Michigan molecular interactions r2: from interacting proteins to pathways Nucleic Acids Res., October 31, 2008; (2008) gkn722v1. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Brilli, R. Fani, and P. Lio Current trends in the bioinformatic sequence analysis of metabolic pathways in prokaryotes Brief Bioinform, January 1, 2008; 9(1): 34 - 45. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Xiong, C. E. Bauer, and A. Pancholy Insight into the haem d1 biosynthesis pathway in heliobacteria through bioinformatics analysis Microbiology, October 1, 2007; 153(10): 3548 - 3562. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Jayapandian, A. Chapman, V. G. Tarcea, C. Yu, A. Elkiss, A. Ianni, B. Liu, A. Nandi, C. Santos, P. Andrews, et al. Michigan Molecular Interactions (MiMI): putting the jigsaw puzzle together Nucleic Acids Res., January 12, 2007; 35(suppl_1): D566 - D571. [Abstract] [Full Text] [PDF] |
||||
![]() |
M.-J. Han and S. Y. Lee The Escherichia coli Proteome: Past, Present, and Future Prospects Microbiol. Mol. Biol. Rev., June 1, 2006; 70(2): 362 - 439. [Abstract] [Full Text] [PDF] |
||||
![]() |
O. Sasson, N. Kaplan, and M. Linial Functional annotation prediction: All for one and one for all Protein Sci., June 1, 2006; 15(6): 1557 - 1562. [Abstract] [Full Text] [PDF] |
||||
![]() |
I. Uchiyama Hierarchical clustering algorithm for comprehensive orthologous-domain classification in multiple genomes Nucleic Acids Res., January 25, 2006; 34(2): 647 - 658. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. Rattei, R. Arnold, P. Tischler, D. Lindner, V. Stumpflen, and H. W. Mewes SIMAP: the similarity matrix of proteins Nucleic Acids Res., January 1, 2006; 34(suppl_1): D252 - D256. [Abstract] [Full Text] [PDF] |
||||
![]() |
I. Bahir and M. Linial ProTeus: identifying signatures in protein termini Nucleic Acids Res., July 1, 2005; 33(suppl_2): W277 - W280. [Abstract] [Full Text] [PDF] |
||||
![]() |
I. Kifer, O. Sasson, and M. Linial Predicting fold novelty based on ProtoNet hierarchical classification Bioinformatics, April 1, 2005; 21(7): 1020 - 1027. [Abstract] [Full Text] [PDF] |
||||
![]() |
Q. J. Su, L. Lu, S. Saxonov, and D. L. Brutlag eBLOCKs: enumerating conserved protein blocks to achieve maximal sensitivity and specificity Nucleic Acids Res., January 1, 2005; 33(suppl_1): D178 - D182. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. Kaplan, O. Sasson, U. Inbar, M. Friedlich, M. Fromer, H. Fleischer, E. Portugaly, N. Linial, and M. Linial ProtoNet 4.0: A hierarchical classification of one million protein sequences Nucleic Acids Res., January 1, 2005; 33(suppl_1): D216 - D218. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Mohseni-Zadeh, A. Louis, P. Brezellec, and J.-L. Risler PHYTOPROT: a database of clusters of plant proteins Nucleic Acids Res., January 1, 2004; 32(90001): D351 - 353. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. Kaplan, A. Vaaknin, and M. Linial PANDORA: keyword-based analysis of protein sets by integration of annotation sources Nucleic Acids Res., October 1, 2003; 31(19): 5617 - 5626. [Abstract] [Full Text] [PDF] |
||||





