Profiling the malaria genome

A gene survey of three species of malaria parasite with comparison to other apicomplexan species

Jane M R Carlton, Ralhston Muller, Charles A. Yowell, Michelle R. Fluegge, Kenneth A. Sturrock, Jonathan R. Pritt, Esmeralda Vargas-Serrato, Mary R. Galinski, John W. Barnwell, Nicola Mulder, Alexander Kanapin, Simon E. Cawley, Winston A. Hide, John B. Dame

Research output: Contribution to journalArticle

Abstract

We have undertaken the first comparative pilot gene discovery analysis of approximately 25 000 random genomic and expressed sequence tags (ESTs) from three species of Plasmodium, the infectious agent that causes malaria. A total of 5482 genome survey sequences (GSSs) and 5582 ESTs were generated from mung bean nuclease (MBN) and cDNA libraries, respectively, of the ANKA line of the rodent malaria parasite Plasmodium berghei, and 10 874 GSSs generated from MBN libraries of the Salvador I and Belem lines of Plasmodium vivax, the most geographically wide-spread human malaria pathogen. These tags, together with 2438 Plasmodium falciparum sequences present in GenBank, were used to perform first-pass assembly and transcript reconstruction, and non-redundant consensus sequence datasets created. The datasets were compared against public protein databases and more than 1000 putative new Plasmodium proteins identified based on sequence similarity. Homologs of previously characterized Plasmodium genes were also identified, increasing the number of P. vivax and P. berghei sequences in public databases at least 10-fold. Comparative studies with other species of Apicomplexa identified interesting homologs of possible therapeutic or diagnostic value. A gene prediction program, Phat, was used to predict probable open reading frames for proteins in all three datasets. Predicted and non-redundant BLAST-matched proteins were submitted to InterPro, an integrated database of protein domains, signatures and families, for functional classification. Thus a partial predicted proteome was created for each species. This first comparative analysis of Plasmodium protein coding sequences represents a valuable resource for further studies on the biology of this important pathogen.

Original languageEnglish (US)
Pages (from-to)201-210
Number of pages10
JournalMolecular and Biochemical Parasitology
Volume118
Issue number2
DOIs
StatePublished - 2001

Fingerprint

Plasmodium
Malaria
Parasites
Genome
Plasmodium vivax
Plasmodium berghei
Expressed Sequence Tags
Genes
Proteins
Databases
Apicomplexa
Protein Databases
Nucleic Acid Databases
Consensus Sequence
Genetic Association Studies
Proteome
Plasmodium falciparum
Gene Library
Open Reading Frames
Libraries

Keywords

  • Apicomplexa
  • Comparative genomics
  • Malaria
  • Proteome

ASJC Scopus subject areas

  • Molecular Biology
  • Parasitology

Cite this

Profiling the malaria genome : A gene survey of three species of malaria parasite with comparison to other apicomplexan species. / Carlton, Jane M R; Muller, Ralhston; Yowell, Charles A.; Fluegge, Michelle R.; Sturrock, Kenneth A.; Pritt, Jonathan R.; Vargas-Serrato, Esmeralda; Galinski, Mary R.; Barnwell, John W.; Mulder, Nicola; Kanapin, Alexander; Cawley, Simon E.; Hide, Winston A.; Dame, John B.

In: Molecular and Biochemical Parasitology, Vol. 118, No. 2, 2001, p. 201-210.

Research output: Contribution to journalArticle

Carlton, JMR, Muller, R, Yowell, CA, Fluegge, MR, Sturrock, KA, Pritt, JR, Vargas-Serrato, E, Galinski, MR, Barnwell, JW, Mulder, N, Kanapin, A, Cawley, SE, Hide, WA & Dame, JB 2001, 'Profiling the malaria genome: A gene survey of three species of malaria parasite with comparison to other apicomplexan species', Molecular and Biochemical Parasitology, vol. 118, no. 2, pp. 201-210. https://doi.org/10.1016/S0166-6851(01)00371-1
Carlton, Jane M R ; Muller, Ralhston ; Yowell, Charles A. ; Fluegge, Michelle R. ; Sturrock, Kenneth A. ; Pritt, Jonathan R. ; Vargas-Serrato, Esmeralda ; Galinski, Mary R. ; Barnwell, John W. ; Mulder, Nicola ; Kanapin, Alexander ; Cawley, Simon E. ; Hide, Winston A. ; Dame, John B. / Profiling the malaria genome : A gene survey of three species of malaria parasite with comparison to other apicomplexan species. In: Molecular and Biochemical Parasitology. 2001 ; Vol. 118, No. 2. pp. 201-210.
@article{a998a7b7dbf84a128969811e62e666d7,
title = "Profiling the malaria genome: A gene survey of three species of malaria parasite with comparison to other apicomplexan species",
abstract = "We have undertaken the first comparative pilot gene discovery analysis of approximately 25 000 random genomic and expressed sequence tags (ESTs) from three species of Plasmodium, the infectious agent that causes malaria. A total of 5482 genome survey sequences (GSSs) and 5582 ESTs were generated from mung bean nuclease (MBN) and cDNA libraries, respectively, of the ANKA line of the rodent malaria parasite Plasmodium berghei, and 10 874 GSSs generated from MBN libraries of the Salvador I and Belem lines of Plasmodium vivax, the most geographically wide-spread human malaria pathogen. These tags, together with 2438 Plasmodium falciparum sequences present in GenBank, were used to perform first-pass assembly and transcript reconstruction, and non-redundant consensus sequence datasets created. The datasets were compared against public protein databases and more than 1000 putative new Plasmodium proteins identified based on sequence similarity. Homologs of previously characterized Plasmodium genes were also identified, increasing the number of P. vivax and P. berghei sequences in public databases at least 10-fold. Comparative studies with other species of Apicomplexa identified interesting homologs of possible therapeutic or diagnostic value. A gene prediction program, Phat, was used to predict probable open reading frames for proteins in all three datasets. Predicted and non-redundant BLAST-matched proteins were submitted to InterPro, an integrated database of protein domains, signatures and families, for functional classification. Thus a partial predicted proteome was created for each species. This first comparative analysis of Plasmodium protein coding sequences represents a valuable resource for further studies on the biology of this important pathogen.",
keywords = "Apicomplexa, Comparative genomics, Malaria, Proteome",
author = "Carlton, {Jane M R} and Ralhston Muller and Yowell, {Charles A.} and Fluegge, {Michelle R.} and Sturrock, {Kenneth A.} and Pritt, {Jonathan R.} and Esmeralda Vargas-Serrato and Galinski, {Mary R.} and Barnwell, {John W.} and Nicola Mulder and Alexander Kanapin and Cawley, {Simon E.} and Hide, {Winston A.} and Dame, {John B.}",
year = "2001",
doi = "10.1016/S0166-6851(01)00371-1",
language = "English (US)",
volume = "118",
pages = "201--210",
journal = "Molecular and Biochemical Parasitology",
issn = "0166-6851",
publisher = "Elsevier",
number = "2",

}

TY - JOUR

T1 - Profiling the malaria genome

T2 - A gene survey of three species of malaria parasite with comparison to other apicomplexan species

AU - Carlton, Jane M R

AU - Muller, Ralhston

AU - Yowell, Charles A.

AU - Fluegge, Michelle R.

AU - Sturrock, Kenneth A.

AU - Pritt, Jonathan R.

AU - Vargas-Serrato, Esmeralda

AU - Galinski, Mary R.

AU - Barnwell, John W.

AU - Mulder, Nicola

AU - Kanapin, Alexander

AU - Cawley, Simon E.

AU - Hide, Winston A.

AU - Dame, John B.

PY - 2001

Y1 - 2001

N2 - We have undertaken the first comparative pilot gene discovery analysis of approximately 25 000 random genomic and expressed sequence tags (ESTs) from three species of Plasmodium, the infectious agent that causes malaria. A total of 5482 genome survey sequences (GSSs) and 5582 ESTs were generated from mung bean nuclease (MBN) and cDNA libraries, respectively, of the ANKA line of the rodent malaria parasite Plasmodium berghei, and 10 874 GSSs generated from MBN libraries of the Salvador I and Belem lines of Plasmodium vivax, the most geographically wide-spread human malaria pathogen. These tags, together with 2438 Plasmodium falciparum sequences present in GenBank, were used to perform first-pass assembly and transcript reconstruction, and non-redundant consensus sequence datasets created. The datasets were compared against public protein databases and more than 1000 putative new Plasmodium proteins identified based on sequence similarity. Homologs of previously characterized Plasmodium genes were also identified, increasing the number of P. vivax and P. berghei sequences in public databases at least 10-fold. Comparative studies with other species of Apicomplexa identified interesting homologs of possible therapeutic or diagnostic value. A gene prediction program, Phat, was used to predict probable open reading frames for proteins in all three datasets. Predicted and non-redundant BLAST-matched proteins were submitted to InterPro, an integrated database of protein domains, signatures and families, for functional classification. Thus a partial predicted proteome was created for each species. This first comparative analysis of Plasmodium protein coding sequences represents a valuable resource for further studies on the biology of this important pathogen.

AB - We have undertaken the first comparative pilot gene discovery analysis of approximately 25 000 random genomic and expressed sequence tags (ESTs) from three species of Plasmodium, the infectious agent that causes malaria. A total of 5482 genome survey sequences (GSSs) and 5582 ESTs were generated from mung bean nuclease (MBN) and cDNA libraries, respectively, of the ANKA line of the rodent malaria parasite Plasmodium berghei, and 10 874 GSSs generated from MBN libraries of the Salvador I and Belem lines of Plasmodium vivax, the most geographically wide-spread human malaria pathogen. These tags, together with 2438 Plasmodium falciparum sequences present in GenBank, were used to perform first-pass assembly and transcript reconstruction, and non-redundant consensus sequence datasets created. The datasets were compared against public protein databases and more than 1000 putative new Plasmodium proteins identified based on sequence similarity. Homologs of previously characterized Plasmodium genes were also identified, increasing the number of P. vivax and P. berghei sequences in public databases at least 10-fold. Comparative studies with other species of Apicomplexa identified interesting homologs of possible therapeutic or diagnostic value. A gene prediction program, Phat, was used to predict probable open reading frames for proteins in all three datasets. Predicted and non-redundant BLAST-matched proteins were submitted to InterPro, an integrated database of protein domains, signatures and families, for functional classification. Thus a partial predicted proteome was created for each species. This first comparative analysis of Plasmodium protein coding sequences represents a valuable resource for further studies on the biology of this important pathogen.

KW - Apicomplexa

KW - Comparative genomics

KW - Malaria

KW - Proteome

UR - http://www.scopus.com/inward/record.url?scp=0035662389&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=0035662389&partnerID=8YFLogxK

U2 - 10.1016/S0166-6851(01)00371-1

DO - 10.1016/S0166-6851(01)00371-1

M3 - Article

VL - 118

SP - 201

EP - 210

JO - Molecular and Biochemical Parasitology

JF - Molecular and Biochemical Parasitology

SN - 0166-6851

IS - 2

ER -