Nucleic Acids Research, 2002, Vol. 30, No. 1 47-49
© 2002 Oxford University Press
BRENDA, enzyme data and metabolic information
University of Cologne, Institute of Biochemistry, Zülpicher Straße 47, 50674 Köln, Germany
Received August 21, 2001; Revised and Accepted October 10, 2001.
| ABSTRACT |
|---|
|
|
|---|
BRENDA is a comprehensive relational database on functional and molecular information of enzymes, based on primary literature. The database contains information extracted and evaluated from approximately 46 000 references, holding data of at least 40 000 different enzymes from more than 6900 different organisms, classified in approximately 3900 EC numbers. BRENDA is an important tool for biochemical and medical research covering information on properties of all classified enzymes, including data on the occurrence, catalyzed reaction, kinetics, substrates/products, inhibitors, cofactors, activators, structure and stability. All data are connected to literature references which in turn are linked to PubMed. The data and information provide a fundamental tool for research of enzyme mechanisms, metabolic pathways, the evolution of metabolism and, furthermore, for medicinal diagnostics and pharmaceutical research. The database is a resource for data of enzymes, classified according to the EC system of the IUBMB Enzyme Nomenclature Committee, and the entries are cross-referenced to other databases, i.e. organism classification, protein sequence, protein structure and literature references. BRENDA provides an academic web access at http://www.brenda.uni-koeln.de.
| INTRODUCTION |
|---|
|
|
|---|
BRENDA (BRaunschweig ENzyme DAtabase) was created in 1987 at the German National Research Center for Biotechnology in Braunschweig (GBF) and is now continued at the University of Cologne, Institute of Biochemistry. This enzyme information system was developed to collect and store enzyme functional data and has been an ongoing effort for >10 years. It was first published as a series of books [Enzyme Handbook, Springer (1)] with the intention from the very beginning to provide the data in a database as a retrieval system.
In the last few years all information has been transferred from a full text to a relational database system and is accessible to the academic community from http://www.brenda.uni-koeln.de. Commercial users have to purchase a license at http://www.science-factory.com.
Enzymes, the largest and most diverse group among the proteins, play an essential role in the metabolism of each organism. All chemical reactions and metabolic steps within the cell are catalyzed and regulated by enzymes. The development and progress of projects on structural and functional genomics suggest that the systematic collection and accessibility of functional information of gene products are indispensable to understanding biological functions and the correlation between phenotype and genotype.
BRENDA represents a protein function database, containing comprehensive enzymatic and metabolic data, extracted, continuously updated and evaluated from the primary literature. The key developments in the last few years were the conversion of the database to an organism-specific information system and the improvement of the validation and the correction of data and the standardization of the entries to create prerequisites for a systematic access and analysis.
| CONTENTS OF BRENDA |
|---|
|
|
|---|
BRENDA contains all enzymes classified according to the system of the EC numbers, which was implemented in 1955 by the International Commission of Enzymes [now the International Union of Biochemistry and Molecular Biology, IUBMB (2)]. This nomenclature is based on the reaction the enzymes catalyzes and not on the individual enzyme molecule. Presently BRENDA contains data of approximately 3900 EC numbers, which represent more than 40 000 different protein molecules, given by the combination of EC number and organism (obviously in many cases organisms have more than one enzyme with the same EC number but, as the functional data on enzymes as given in the primary literature are rarely associated to a specific sequence, a more reliable estimation is not possible in the present situation; this will change with the progress of the genome sequencing projects).
The database covers organism-specific information on functional and molecular properties, in detail on the nomenclature, reaction and specificity, enzyme structure, stability, application and engineering, organism, ligands, literature references and links to other databases (Table 1).
|
The data for all enzymes having the same EC number are periodically updated by manual extraction of parameters from the literature references accessible via literature databases, i.e. Chemical Abstracts and PubMed [NCBI (3)] and the full information for each EC number is continuously checked for internal inconsistencies. Depending on scientific needs and the progress in research the data fields are subject to an ongoing development.
The data and information in BRENDA are stored in 52 tables containing approximately 460 000 entries directly extracted from the primary literature in a relational database system to enable different search features. Enzymes can be searched by their EC numbers (3870 entries), their names or synonyms (22 936 entries) or by the organisms (6921 single entries), in which the enzyme reaction is detected. All other information fields (Table 1) can be searched individually or by combination searches, which can be performed organism specifically. Therefore, it is possible to find a specific enzyme for a specific organism or even for a specific organ or tissue. Furthermore, a search for ligands, which may have a dual role (e.g. substrate/inhibitor or cofactor/inhibitor) may be performed. Kinetic data for enzymeligand interaction can be searched.
| LIGANDS |
|---|
|
|
|---|
A major part in BRENDA is the information of ligands, which function as natural or in vitro substrates/products, inhibitors, activating compounds, cofactors, bound metals, etc. Altogether, approximately 320 000 enzymeligand relationships are stored with more than 33 000 different chemical compounds functioning as ligand. In BRENDA the ligands are stored as compound names, SMILES (4) strings and as Molfiles. The latter two forms are interchangeable with respect to the connectivity information. The two-dimensional chemical structures of these compounds can be displayed as images.
| METABOLISM |
|---|
|
|
|---|
The data in BRENDA allow the calculation or simulation of metabolic pathways by extracting the information of substrate/product chains and the corresponding kinetic data of the preceding and following enzymes in the Boehringer and KEGG metabolism (with the risk of including pathways with non-natural compounds).
Based on the representation of metabolic networks as directed graphs, navigation operation will be made possible. This will give answers to questions on the structure of the metabolic paths, e.g. on shortest or alternate paths for different organisms.
| ENZYME AND DISEASE INFORMATION |
|---|
|
|
|---|
In order to keep up with the quickly growing scientific literature, automatic information extraction techniques were tested to include disease-related knowledge to BRENDA. References in electronic format are taken from the PubMed database, parsed for relevant key phrases and associated with correlated enzymes. Information on 789 enzymes and their associated human diseases has been included into the BRENDA database (5).
Additionally, the Online Mendelian Inheritance in Man [OMIM (2,6)] repository, a well-annotated catalog of human genes and genetic disorders, was parsed for enzyme information. In this way a total of 630 EC numbers in BRENDA could be linked to 2100 OMIM entries.
| FOOTNOTES |
|---|
* To whom correspondence should be addressed. Tel: +49 221 470 6440; Fax: +49 221 470 5092; Email: d.schomburg{at}uni-koeln.de
| REFERENCES |
|---|
|
|
|---|
-
1 Schomburg,D. and Schomburg,I. (2001) Springer Handbook of Enzymes, 2nd edn. Springer, Heidelberg, Gemany.
2 ,
3 Wheeler,D.L., Church,D.M., Lash,A.E., Leipe,D.D., Madden,T.L., Pontius,J.U., Schuler,G.D., Schriml,L.M., Tatusova,T.A., Wagner,L. and Rapp,B.A. (2001) Database resources of the National Center for Biotechnology Information. Nucleic Acids Res., 29, 1116. Updated article in this issue: Nucleic Acids Res. (2002), 30, 1316.
4 Weininger,D. (1988) SMILES 1. Introduction and encoding rules. J. Chem. Inf. Comput. Sci., 28, 3136.
5 Schomburg,I., Hofmann,O., Baensch,C., Chang,A. and Schomburg,D. (2000) Enzyme data and metabolic information: BRENDA, a resource for research in biology, biochemistry, and medicine. Gene Funct. Dis., 34, 109118.
6 McKusick,V.A. (1998) Mendelian Inheritance in Man. Catalogs of Human Genes and Genetic Disorders, 12th edn. The Johns Hopkins University Press, Baltimore, MD.
This article has been cited by other articles:
![]() |
T. Handorf and O. Ebenhoh MetaPath Online: a web server implementation of the network expansion algorithm Nucleic Acids Res., July 13, 2007; 35(suppl_2): W613 - W618. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. J. Miller, L. Shuvalova, E. Evdokimova, A. Savchenko, A. F. Yakunin, and W. F. Anderson Structural and biochemical characterization of a novel Mn2+-dependent phosphodiesterase encoded by the yfcE gene Protein Sci., July 1, 2007; 16(7): 1338 - 1348. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. K. Parker, K. M. Curtin, and M. L. Vasil Purification and Characterization of Mycobacterial Phospholipase A: an Activity Associated with Mycobacterial Cutinase J. Bacteriol., June 1, 2007; 189(11): 4153 - 4160. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Barthelmes, C. Ebeling, A. Chang, I. Schomburg, and D. Schomburg BRENDA, AMENDA and FRENDA: the enzyme information system in 2007 Nucleic Acids Res., January 12, 2007; 35(suppl_1): D511 - D514. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. R. Stam, E. G.J. Danchin, C. Rancurel, P. M. Coutinho, and B. Henrissat Dividing the large glycoside hydrolase family 13 into subfamilies: towards improved functional annotations of {alpha}-amylase-related proteins Protein Eng. Des. Sel., December 1, 2006; 19(12): 555 - 562. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. Sanford, M. L.K. Yip, C. White, and J. Parkinson Cell++--simulating biochemical pathways Bioinformatics, December 1, 2006; 22(23): 2918 - 2925. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. D. Cummings, M. A. Farnum, and M. I. Nelen Universal Screening Methods and Applications of ThermoFluor(R) J Biomol Screen, October 1, 2006; 11(7): 854 - 863. [Abstract] [PDF] |
||||
![]() |
S.-H. Sheu, D. R. Lancia Jr, K. H. Clodfelter, M. R. Landon, and S. Vajda PRECISE: a Database of Predicted and Consensus Interaction Sites in Enzymes Nucleic Acids Res., January 1, 2005; 33(suppl_1): D206 - D211. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. Nagano EzCatDB: the Enzyme Catalytic-mechanism Database Nucleic Acids Res., January 1, 2005; 33(suppl_1): D407 - D412. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. M. Mason, M. D. Naidu, M. Barcia, D. Porti, S. S. Chavan, and C. C. Chu IL-4-Induced Gene-1 Is a Leukocyte L-Amino Acid Oxidase with an Unusual Acidic pH Preference and Lysosomal Localization J. Immunol., October 1, 2004; 173(7): 4561 - 4567. [Abstract] [Full Text] [PDF] |
||||
![]() |
F. Campagne, S. Neves, C.-w. Chang, L. Skrabanek, P. T. Ram, R. Iyengar, and H. Weinstein Quantitative Information Management for the Biochemical Computation of Cellular Networks Sci. Signal., August 31, 2004; 2004(248): pl11 - pl11. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. F. SCHAEFER Pathway Databases Ann. N.Y. Acad. Sci., May 1, 2004; 1020(1): 77 - 91. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. Martin-Ruiz, G. Saretzki, J. Petrie, J. Ladhoff, J. Jeyapalan, W. Wei, J. Sedivy, and T. von Zglinicki Stochastic Variation in Telomere Shortening Rate Causes Heterogeneity of Human Fibroblast Replicative Life Span J. Biol. Chem., April 23, 2004; 279(17): 17826 - 17833. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. D. Thompson, V. Prigent, and O. Poch LEON: multiple aLignment Evaluation Of Neighbours Nucleic Acids Res., February 24, 2004; 32(4): 1298 - 1307. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. T. Porter, G. J. Bartlett, and J. M. Thornton The Catalytic Site Atlas: a resource of catalytic sites and residues identified in enzymes using structural data Nucleic Acids Res., January 1, 2004; 32(90001): D129 - 133. [Abstract] [Full Text] [PDF] |
||||
![]() |
I. Schomburg, A. Chang, C. Ebeling, M. Gremse, C. Heldt, G. Huhn, and D. Schomburg BRENDA, the enzyme database: updates and major new developments Nucleic Acids Res., January 1, 2004; 32(90001): D431 - 433. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Fleischmann, M. Darsow, K. Degtyarenko, W. Fleischmann, S. Boyce, K. B. Axelsen, A. Bairoch, D. Schomburg, K. F. Tipton, and R. Apweiler IntEnz, the integrated relational enzyme database Nucleic Acids Res., January 1, 2004; 32(90001): D434 - 437. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. Claudel-Renard, C. Chevalet, T. Faraut, and D. Kahn Enzyme-specific profiles for genome annotation: PRIAM Nucleic Acids Res., November 15, 2003; 31(22): 6633 - 6639. [Abstract] [Full Text] [PDF] |
||||
![]() |
C.Z. Cai, L.Y. Han, Z.L. Ji, X. Chen, and Y.Z. Chen SVM-Prot: web-based support vector machine software for functional classification of a protein from its primary sequence Nucleic Acids Res., July 1, 2003; 31(13): 3692 - 3697. [Abstract] [Full Text] [PDF] |
||||
![]() |
V. Matys, E. Fricke, R. Geffers, E. Gossling, M. Haubrock, R. Hehl, K. Hornischer, D. Karas, A. E. Kel, O. V. Kel-Margoulis, et al. TRANSFAC(R): transcriptional regulation, from patterns to profiles Nucleic Acids Res., January 1, 2003; 31(1): 374 - 378. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||









