Nucleic Acids Research, 2002, Vol. 30, No. 1 354-360
© 2002 Oxford University Press
Olfactory Receptor Database: a metadata-driven automated population from sources of gene and protein sequences
1Center for Medical Informatics, 2Section for Neurobiology and 3Department of Molecular, Cellular and Developmental Biology, Yale University School of Medicine, 333 Cedar Street, New Haven, CT 05611, USA
The Olfactory Receptor Database (ORDB; http://senselab.med.yale.edu/senselab/ordb) is a central repository of olfactory receptor (OR) and olfactory receptor-like gene and protein sequences. To deal with the very large OR gene family, we have constructed an algorithm that automatically downloads sequences from web sources such as GenBank and SWISS-PROT into the database. The algorithm uses hypertext markup language (HTML) parsing techniques that extract information relevant to ORDB. The information is then correlated with the metadata in the ORDB knowledge base to encode the unstructured text extracted into the structured format compliant with the database architecture, entity attribute value with classes and relationship (EAV/CR), which supports the SenseLab project as a whole. Three population methods: batch, automatic and semi-automatic population are discussed. The data is imported into the database using extensible markup language (XML).
* To whom correspondence should be addressed at: Center for Medical Informatics, Yale University School of Medicine, 333 Cedar Street, New Haven, CT 05611, USA. Tel: +1 203 764 9136; Fax: +1 203 764 6717; Email: chiquito.crasto{at}yale.edu
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
L. French and P. Pavlidis Informatics in neuroscience Brief Bioinform, November 1, 2007; 8(6): 446 - 456. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. J. Crasto, L. N. Marenco, N. Liu, T. M. Morse, K.-H. Cheung, P. C. Lai, G. Bahl, P. Masiar, H. Y.K. Lam, E. Lim, et al. SenseLab: new developments in disseminating neuroscience information Brief Bioinform, May 17, 2007; (2007) bbm018v1. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. D. Emes, S. A. Beatson, C. P. Ponting, and L. Goodstadt Evolution and Comparative Genomics of Odorant- and Pheromone-Associated Genes in Rodents Genome Res., April 1, 2004; 14(4): 591 - 602. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. Marenco, N. Tosches, C. Crasto, G. Shepherd, P. L. Miller, and P. M. Nadkarni Achieving Evolvable Web-Database Bioscience Applications Using the EAV/CR Framework: Recent Advances J. Am. Med. Inform. Assoc., September 1, 2003; 10(5): 444 - 453. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. M. Young and B. J. Trask The sense of smell: genomics of vertebrate odorant receptors Hum. Mol. Genet., May 15, 2002; 11(10): 1153 - 1160. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. Lo Conte, S. E. Brenner, T. J. P. Hubbard, C. Chothia, and A. G. Murzin SCOP database in 2002: refinements accommodate structural genomics Nucleic Acids Res., January 1, 2002; 30(1): 264 - 267. [Abstract] [Full Text] [PDF] |
||||




