Skip Navigation

This Article
Right arrow Abstract Freely available
Right arrow Print PDF (489K) Freely available
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in ISI Web of Science
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrow Search for citing articles in:
ISI Web of Science (57)
Right arrowRequest Permissions
Right arrow Commercial Re-use Guidelines
for Open Access NAR Content
Google Scholar
Right arrow Articles by Bahl, A.
Right arrow Articles by Stoeckert, C. J.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Bahl, A.
Right arrow Articles by Stoeckert, C. J., Jr
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

Nucleic Acids Research, 2002, Vol. 30, No. 1 87-90
© 2002 Oxford University Press

PlasmoDB: the Plasmodium genome resource. An integrated database providing tools for accessing, analyzing and mapping expression and sequence data (both finished and unfinished)

Amit Bahl, Brian Brunk1, Ross L. Coppel2, Jonathan Crabtree1, Sharon J. Diskin1, Martin J. Fraunholz, Gregory R. Grant1, Dinesh Gupta, Robert L. Huestis2, Jessica C. Kissinger, Philip Labo, Li Li, Shannon K. McWeeney1, Arthur J. Milgram, David S. Roos*, Jonathan Schug1 and Christian J. Stoeckert Jr1

Department of Biology, University of Pennsylvania, 415 South University Avenue, Philadelphia, PA 19104-6018, USA, 1Center for Bioinformatics, University of Pennsylvania, Philadelphia, PA, USA and 2Department of Microbiology, Monash University, Clayton, Victoria 3800, Australia

Received September 17, 2001; Accepted October 2, 2001.


    ABSTRACT
 TOP
 ABSTRACT
 SCOPE
 CONTENT OF THE CURRENT...
 FUTURE PLANS
 DATA ACCESS AND DATA...
 ADDITIONAL INFORMATION
 REFERENCES
 
PlasmoDB (http://PlasmoDB.org) is the official database of the Plasmodium falciparum genome sequencing consortium. This resource incorporates finished and draft genome sequence data and annotation emerging from Plasmodium sequencing projects. PlasmoDB currently houses information from five parasite species and provides tools for cross-species comparisons. Sequence information is also integrated with other genomic-scale data emerging from the Plasmodium research community, including gene expression analysis from EST, SAGE and microarray projects. The relational schemas used to build PlasmoDB [Genomics Unified Schema (GUS) and RNA Abundance Database (RAD)] employ a highly structured format to accommodate the diverse data types generated by sequence and expression projects. A variety of tools allow researchers to formulate complex, biologically based queries of the database. A version of the database is also available on CD-ROM (Plasmodium GenePlot), facilitating access to the data in situations where Internet access is difficult (e.g. by malaria researchers working in the field). The goal of PlasmoDB is to enhance utilization of the vast quantities of data emerging from genome-scale projects by the global malaria research community.


    SCOPE
 TOP
 ABSTRACT
 SCOPE
 CONTENT OF THE CURRENT...
 FUTURE PLANS
 DATA ACCESS AND DATA...
 ADDITIONAL INFORMATION
 REFERENCES
 
The disease malaria is caused by members of the genus Plasmodium. There are more than 100 recognized species of Plasmodium, four of which infect humans. One species, Plasmodium falciparum, is the causative agent of cerebral malaria and is responsible for the deaths of more than 1 million people each year, mostly children in sub-Saharan Africa. The P.falciparum genome project was initiated in 1996 by an international consortium (1), and PlasmoDB, the official database of this consortium, went live in June 2000 (2).

While the data sources and available tools at PlasmoDB have increased dramatically over the last year, the philosophy behind PlasmoDB has remained unchanged. Our goals, as illustrated in Figure 1, are: (i) to provide a single, user-friendly point of access for the data emerging from the three genome sequencing centers involved in the sequencing of Plasmodium genomes (Sanger, Stanford and TIGR/NMRC) and from the malaria research community; (ii) to provide tools for viewing and querying both finished and unfinished genomic sequence data and ancillary information; (iii) to integrate genomic data with other types of data, including (but not limited to) gene expression data and mapping data; (iv) to provide a database with the resources and tools to handle cross-species comparisons as data from other Plasmodium species become available; and (v) to ensure that scientists everywhere, including researchers in the field and other Internet-limited areas, have access to the data via an easily-obtainable, user-friendly and query-capable CD-ROM version of the database, Plasmodium GenePlot.



View larger version (42K):
[in this window]
[in a new window]
 
Figure 1. Data for PlasmoDB is gathered from several sources and stored in relational databases, GUS and RAD, before it is integrated and processed, then served to users on the World Wide Web or on CD. Dashed lines indicate items that will be added in the future; QTL, quantitative trait loci; POP, populations studies; SNP, single nucleotide polymorphisms.

 

    CONTENT OF THE CURRENT RELEASE
 TOP
 ABSTRACT
 SCOPE
 CONTENT OF THE CURRENT...
 FUTURE PLANS
 DATA ACCESS AND DATA...
 ADDITIONAL INFORMATION
 REFERENCES
 
Data
Version 3.2 of the database contains sequence (Genomic, GSS and EST) and mapping data (microsatellite and optical), official gene annotations and expression data (EST, SAGE and microarray). Additional data sets are created from the genomic sequence data in PlasmoDB via a series of automated analyses, including gene predictions (using various algorithms), protein feature predictions (signal peptides, transmembrane domains, secondary structure, hydrophobicity plots, AA content, Pfam or ProSite domains, etc.) and Gene Ontology (GO) function predictions.

The genomic sequence data available in the current release of PlasmoDB (v3.2) consists of finished and unfinished sequences for five Plasmodium species (P.falciparum, P.yoelii, P.chabaudi, P.berghei and P.vivax). In the October 14, 2001 release, 7 of the 14 P.falciparum chromosomes are complete, and the entire genome is represented by 1183 contiguous sequences (contigs). Data for five additional Plasmodium species (including 5-fold genomic coverage for P.yoelii, 3-fold coverage for P.knowlesi, and various GSS, YAC and EST sequences for these speicies, P.berghei, P.chabaudi and P.vivax) are also available, and have been mapped to the P.falciparum genomic sequence. Additional data from ongoing sequencing projects focused on several Plasmodium species will be incorporated into PlasmoDB as they become available.

Official annotations, automated predictions, mapping information and expression data are available for P.falciparum. BLAST similarities to the NCBI non-redundant protein database are available for P.falciparum and P.yoelii, as are similarity comparisons between these two species (Fig. 2). All sequence information is available as BLAST-searchable databases, and sequence data along with any accompanying official annotations or automated analyses are available for download in FASTA, GenBank and EMBL file formats.



View larger version (52K):
[in this window]
[in a new window]
 
Figure 2. Composite of three screen shots demonstrating PlasmoDB features for the adenylosuccinate lyase gene, PFB0295w, located on Chr2. The official annotation from TIGR is shown in (A). A graphical view of protein features is available from the link in (B), which shows BLAST hits, predicted secondary structure, domains and motifs and signal peptides. A graphical view of the genomic sequence surrounding the gene is available from the link in (C) which shows BLASTX similarities to the non-redundant protein database (NRDB; the quality of the hit similarity is indicated by the color of the box) proteins and the location of SAGE tags, ESTs and GSSs, automated gene predictions and official annotation. Similarities to P.yoelii are shown in (D) as well as BLAST hits to the NRDB (E), predicted protein motifs (F) and predicted GO functions (G). Each part of the display is a link to further information.

 
Tools/queries
In addition to the usual data-browsing interface (much of which is accessible from the front page), PlasmoDB offers a powerful, yet easy-to-use, query interface for identifying genes of interest. Basic queries on sequence information stored in GUS (3) cover text keyword matching, sequence similarities, intron/exon structure, protein domains, location, prediction method, predicted function and expression data stored in RAD (4). A history mechanism tracks queries executed in each session allowing users to combine query results, revisit searches or download sequences. A BLAST server is available for both DNA and predicted amino acid sequences.

Plasmodium GenePlot
The Plasmodium GenePlot CD-ROM is a stand-alone platform-independent compilation of all available finished and unfinished DNA sequences from the P.falciparum genome. The data are accessible online, or via CD-ROM. Data are organized by chromosome, and the results of three different gene prediction algorithms and BLAST analysis against the GenBank NRDB are presented graphically, along with official annotations (where available). Annotations, BLAST hits and gene predictions are dynamically searchable by chromosome. Features and annotations can be reviewed via mouse pointer, and features are retrievable by clicking on the image. Tools provided on the CD allow the user to manipulate (translate, reverse complement, convert file format, etc.) any DNA sequence, either from the CD or provided by the user. The Plasmodium GenePlot CD can be used on any computer platform with a compatible web browser (Netscape Communicator 4.7 or Internet Explorer 5.0).

Tutorials and information on how to use the database
In an effort to provide better guidance for the use of tools and database queries that may be new to users, a graphical web-based tutorial has been established. This tutorial is located on the front page of the database and registration is not required for its use. A ‘NEWS’ link has also been added to explain features, data and tools added since the last release, and news stories are archived under ‘What’s new at PlasmoDB’. A ‘?’ link has been added to the menus to provide information on each of the available tools/queries.


    FUTURE PLANS
 TOP
 ABSTRACT
 SCOPE
 CONTENT OF THE CURRENT...
 FUTURE PLANS
 DATA ACCESS AND DATA...
 ADDITIONAL INFORMATION
 REFERENCES
 
1. PlasmoDB will incorporate official gene and other sequence annotation released by the sequencing centers, while retaining automated predictions to provide alternative views.

2. GenePlot CDs will be linked to entries in PlasmoDB (when Internet access is available) and PlasmoDB will be linked to other databases as these become available.

3. We anticipate rapid growth in the volume of gene expression data and sequence information on related species.

4. New data types to be incorporated include population polymorphisms (including SNPs), QTL data and metabolic pathways.

5. Finally, we look forward to feedback from the malaria community to guide the development and prioritization of new features and data.


    DATA ACCESS AND DATA DEPOSITION
 TOP
 ABSTRACT
 SCOPE
 CONTENT OF THE CURRENT...
 FUTURE PLANS
 DATA ACCESS AND DATA...
 ADDITIONAL INFORMATION
 REFERENCES
 
PlasmoDB is located at http://PlasmoDB.org and the Plasmodium GenePlot CD is available via the same URL (interactively or as a downloadable image) or as a stand-alone CD-ROM. To obtain the GenePlot/WHO malaria CD-ROM set, contact Ross Coppel, Department of Microbiology, Monash University, Clayton, Victoria 3800, Australia; Email: ross.coppel{at}med.monash.edu.au To report errors, submit suggestions or contribute data to the database, contact PlasmoDB at help{at}PlasmoDB.org or contact the corresponding author.


    ADDITIONAL INFORMATION
 TOP
 ABSTRACT
 SCOPE
 CONTENT OF THE CURRENT...
 FUTURE PLANS
 DATA ACCESS AND DATA...
 ADDITIONAL INFORMATION
 REFERENCES
 
1. Data release policies for P.falciparum (http://PlasmoDB.org/drpf.shtml) and P.yoelii (http://PlasmoDB.org/drpy.shtml) genomic sequence data. Acceptance of these policies is required for full access to PlasmoDB.

2. Sources. Links to the various data sets that are available on PlasmoDB with links to the original data source and statistics (number of nucleotides, number of contigs, release date, contact name, etc.); http://PlasmoDB.org/bdbs.shtml.

3. Status. Statistics on progress of the P.falciparum genome sequencing effort. Links to each of the original data sources are provided along with statistics on the estimated size of each chromosome, release date for current data, status of sequence assembly and number of nucleotides available; http://PlasmoDB.org/about_data.shtml.

4. Tutorial. Walk through a graphic tutorial on the use of PlasmoDB; http://PlasmoDB.org/Tutorial.shtml.


    ACKNOWLEDGEMENTS
 
Financial support for PlasmoDB was provided by the Burroughs Wellcome Fund. Additional computational support was provided by Rahul Dave, Haralabos Tikos and the Eniac2000 project (supported in part through equipment donations from Intel and by the University of Pennsylvania Genomics Institute).

We thank the numerous researchers who have collaborated with, and contributed to, PlasmoDB by depositing data (both published and unpublished) in the database, by making software available and by making useful suggestions on how to improve this community resource.

We wish to thank the scientists and funding agencies comprising the international Malaria Genome Project for making sequence data from the genome of P.falciparum (3D7) public prior to publication of the completed sequence. The Sanger Centre (UK) provided sequences for chromosomes 1, 3–9 and 13, with financial support from the Wellcome Trust. A consortium composed of The Institute for Genome Research (TIGR) along with the Naval Medical Research Center (USA), sequenced chromosomes 2, 10, 11 and 14, with support from NIAID/NIH, the Burroughs Wellcome Fund and the Department of Defense. The Stanford Genome Technology Center (USA) sequenced chromosome 12, with support from the Burroughs Wellcome Fund.

Preliminary sequence and/or preliminary annotated sequence data from the P.yoelii genome was obtained from the TIGR website (www.tigr.org). This sequencing program is carried on in collaboration with the Naval Medical Research Center and is supported by the US Department of Defense. Given that this information is considered preliminary and may contain inaccuracies, it is expected that no one will publish analyses, based on the preliminary data contained in this web site, or of genes on a whole-chromosome or genome scale, without prior permission from TIGR, which plans to publish the completed and annotated sequence in a peer-reviewed journal.


    FOOTNOTES
 
* To whom correspondence should be addressed. Tel: +1 215 898 2118; Fax: +1 215 898 8780; Email: droos{at}sas.upenn.edu Back


    REFERENCES
 TOP
 ABSTRACT
 SCOPE
 CONTENT OF THE CURRENT...
 FUTURE PLANS
 DATA ACCESS AND DATA...
 ADDITIONAL INFORMATION
 REFERENCES
 

    1 Fletcher,C. (1998) The Plasmodium falciparum Genome Project. Parasitol. Today, 14, 342–344.

    2 The Plasmodium Genome Database Collaborative (2001) PlasmoDB: an integrative database of the Plasmodium falciparum genome. Tools for accessing and analyzing finished and unfinished sequence data. Nucleic Acids Res., 29, 66–69.[Abstract/Free Full Text]

    3 Davidson,S., Crabtree,J., Brunk,B.P., Schug,J., Tannen,V., Overton,G.C. and Stoeckert,C.J.,Jr (2001) K2/Klesli and GUS: experiments in integrated access to genomic data sources. IBM Systems J., 40, 512–531.

    4 Stoeckert,C., Pizarro,A., Manduchi,E., Gibson,M., Brunk,B., Crabtree,J., Schug,J., Shen-Orr,S. and Overton,C.G. (2001) A relational schema for both array-based and SAGE gene expression experiments. Bioinformatics, 17, 300–308.[Abstract/Free Full Text]


Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?


This article has been cited by other articles:


Home page
J. Biol. Chem.Home page
G. Pessi, J.-Y. Choi, J. M. Reynolds, D. R. Voelker, and C. B. Mamoun
In Vivo Evidence for the Specificity of Plasmodium falciparum Phosphoethanolamine Methyltransferase and Its Coupling to the Kennedy Pathway
J. Biol. Chem., April 1, 2005; 280(13): 12461 - 12466.
[Abstract] [Full Text] [PDF]


Home page
Mol Biol EvolHome page
L. Collins and D. Penny
Complex Spliceosomal Organization Ancestral to Extant Eukaryotes
Mol. Biol. Evol., April 1, 2005; 22(4): 1053 - 1066.
[Abstract] [Full Text] [PDF]


Home page
Mol Biol EvolHome page
R. Bourgon, M. Delorenzi, T. Sargeant, A. N. Hodder, B. S. Crabb, and T. P. Speed
The Serine Repeat Antigen (SERA) Gene Family Phylogeny in Plasmodium: The Impact of GC Content and Reconciliation of Gene and Species Trees
Mol. Biol. Evol., November 1, 2004; 21(11): 2161 - 2171.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
C. S. Janssen, R. S. Phillips, C. M. R. Turner, and M. P. Barrett
Plasmodium interspersed repeats: the major multigene superfamily of malaria parasites
Nucleic Acids Res., October 26, 2004; 32(19): 5712 - 5720.
[Abstract] [Full Text] [PDF]


Home page
J. Cell Sci.Home page
M. Yang, I. Coppens, S. Wormsley, P. Baevova, H. C. Hoppe, and K. A. Joiner
The Plasmodium falciparum Vps4 homolog mediates multivesicular body formation
J. Cell Sci., August 1, 2004; 117(17): 3831 - 3838.
[Abstract] [Full Text] [PDF]


Home page
J. Biol. Chem.Home page
K. Chaudhary, J. A. Darling, L. M. Fohl, W. J. Sullivan Jr., R. G. K. Donald, E. R. Pfefferkorn, B. Ullman, and D. S. Roos
Purine Salvage Pathways in the Apicomplexan Parasite Toxoplasma gondii
J. Biol. Chem., July 23, 2004; 279(30): 31221 - 31227.
[Abstract] [Full Text] [PDF]


Home page
Genome ResHome page
T. Y. Sam-Yellowe, L. Florens, J. R. Johnson, T. Wang, J. A. Drazba, K. G. Le Roch, Y. Zhou, S. Batalov, D. J. Carucci, E. A. Winzeler, et al.
A Plasmodium Gene Family Encoding Maurer's Cleft Membrane Proteins: Structural Properties and Expression Profiling
Genome Res., June 1, 2004; 14(6): 1052 - 1059.
[Abstract] [Full Text] [PDF]


Home page
J. Biol. Chem.Home page
N. L. Hiller, T. Akompong, J. S. Morrow, A. A. Holder, and K. Haldar
Identification of a Stomatin Orthologue in Vacuoles Induced in Human Erythrocytes by Malaria Parasites: A ROLE FOR MICROBIAL RAFT PROTEINS IN APICOMPLEXAN VACUOLE BIOGENESIS
J. Biol. Chem., November 28, 2003; 278(48): 48413 - 48421.
[Abstract] [Full Text] [PDF]


Home page
BloodHome page
C.-A. Lobo, M. Rodriguez, M. Reid, and S. Lustigman
Glycophorin C is the receptor for the Plasmodium falciparum erythrocyte binding ligand PfEBP-2 (baebl)
Blood, June 1, 2003; 101(11): 4628 - 4631.
[Abstract] [Full Text] [PDF]


Home page
Mol. Biol. CellHome page
T. Spielmann, D. J. P. Fergusen, and H.-P. Beck
etramps, a New Plasmodium falciparum Gene Family Coding for Developmentally Regulated and Highly Charged Membrane Proteins Located at the Parasite-Host Cell Interface
Mol. Biol. Cell, April 1, 2003; 14(4): 1529 - 1544.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
A. Bahl, B. Brunk, J. Crabtree, M. J. Fraunholz, B. Gajria, G. R. Grant, H. Ginsburg, D. Gupta, J. C. Kissinger, P. Labo, et al.
PlasmoDB: the Plasmodium genome resource. A database integrating experimental and computational data
Nucleic Acids Res., January 1, 2003; 31(1): 212 - 215.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
J. C. Kissinger, B. Gajria, L. Li, I. T. Paulsen, and D. S. Roos
ToxoDB: accessing the Toxoplasma gondii genome
Nucleic Acids Res., January 1, 2003; 31(1): 234 - 236.
[Abstract] [Full Text] [PDF]


Home page
ScienceHome page
D. C. Greenbaum, A. Baruch, M. Grainger, Z. Bozdech, K. F. Medzihradszky, J. Engel, J. DeRisi, A. A. Holder, and M. Bogyo
A Role for the Protease Falcipain 1 in Host Cell Invasion by the Human Malaria Parasite
Science, December 6, 2002; 298(5600): 2002 - 2006.
[Abstract] [Full Text] [PDF]


Home page
Infect. Immun.Home page
D. L. Baldi, R. Good, M. T. Duraisingh, B. S. Crabb, and A. F. Cowman
Identification and Disruption of the Gene Encoding the Third Member of the Low-Molecular-Mass Rhoptry Complex in Plasmodium falciparum
Infect. Immun., September 1, 2002; 70(9): 5236 - 5245.
[Abstract] [Full Text] [PDF]


Home page
J. Biol. Chem.Home page
J. L. Lovett, N. Marchesini, S. N. J. Moreno, and L. D. Sibley
Toxoplasma gondii Microneme Secretion Involves Intracellular Ca2+ Release from Inositol 1,4,5-Triphosphate (IP3)/Ryanodine-sensitive Stores
J. Biol. Chem., July 12, 2002; 277(29): 25870 - 25876.
[Abstract] [Full Text] [PDF]


Home page
J. Cell Biol.Home page
K. A. Joiner and D. S. Roos
Secretory traffic in the eukaryotic parasite Toxoplasma gondii: less is more
J. Cell Biol., May 13, 2002; 157(4): 557 - 563.
[Abstract] [Full Text] [PDF]


Home page
Proc. Natl. Acad. Sci. USAHome page
B. Striepen, M. W. White, C. Li, M. N. Guerini, S.-B. Malik, J. M. Logsdon Jr., C. Liu, and M. S. Abrahamsen
Genetic complementation in apicomplexan parasites
PNAS, April 30, 2002; 99(9): 6304 - 6309.
[Abstract] [Full Text] [PDF]


Home page
Proc. Natl. Acad. Sci. USAHome page
B. Striepen, M. W. White, C. Li, M. N. Guerini, S.-B. Malik, J. M. Logsdon Jr., C. Liu, and M. S. Abrahamsen
Genetic complementation in apicomplexan parasites
PNAS, April 30, 2002; 99(9): 6304 - 6309.
[Abstract] [Full Text] [PDF]


This Article
Right arrow Abstract Freely available
Right arrow Print PDF (489K) Freely available
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in ISI Web of Science
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrow Search for citing articles in:
ISI Web of Science (57)
Right arrowRequest Permissions
Right arrow Commercial Re-use Guidelines
for Open Access NAR Content
Google Scholar
Right arrow Articles by Bahl, A.
Right arrow Articles by Stoeckert, C. J.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Bahl, A.
Right arrow Articles by Stoeckert, C. J., Jr
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?