Analysis of context sequence surrounding translation initiation site from complete genome of model plants

L. Rangan, Christine Vogel, A. Srivastava

Research output: Contribution to journalArticle

Abstract

Regions flanking the translation initiation site (TIS) are thought to play a crucial role in translation efficiency of mRNAs, but their exact sequence and evolution in eukaryotes are still a matter of debate. We investigated the context sequences in 20 nucleotides around the TIS in multi-cellular eukaryotes, with a focus on two model plants and a comparison to human. We identified consensus sequences aaaaaaa(A/G)(A/C)aAUGGcgaataata and ggcggc(g/c)(A/G)(A/C)(G/ C)AUGGCggcggcgg for Arabidopsis thaliana and Oryza sativa, respectively. We observe strongly conserved G at position +4 and A or C at position -2; however, the exact nucleotide frequencies vary between the three organisms even at these conserved positions. The frequency of pyrimidines, which are considered sub optimum at position -3, is higher in both plants than in human. Arabidopsis is GC-depleted (AU-enriched) compared to both rice and human, and the enrichment is slightly stronger upstream than downstream of AUG. While both plants are similar though not identical in their variation of nucleotide frequencies, rice and human are more similar to each other than Arabidopsis and human. All three organisms display clear periodicity in A + G and C + U content when analyzing normalized frequencies. These findings suggest that, besides few highly conserved positions, overall structure of the context sequence plays a larger role in TIS recognition than the actual nucleotide frequencies.

Original languageEnglish (US)
Pages (from-to)207-213
Number of pages7
JournalMolecular Biotechnology
Volume39
Issue number3
DOIs
StatePublished - Jul 2008

Fingerprint

Plant Genome
Nucleotides
Sequence Analysis
Genes
Arabidopsis
Eukaryota
Pyrimidines
Consensus Sequence
Base Composition
Protein Biosynthesis
Periodicity
Messenger RNA
Oryza

Keywords

  • A. thaliana
  • Consensus sequence
  • Eukaryotes
  • O. sativa
  • Translation initiation site

ASJC Scopus subject areas

  • Biochemistry
  • Biotechnology
  • Molecular Biology
  • Bioengineering
  • Applied Microbiology and Biotechnology
  • Biochemistry, Genetics and Molecular Biology(all)

Cite this

Analysis of context sequence surrounding translation initiation site from complete genome of model plants. / Rangan, L.; Vogel, Christine; Srivastava, A.

In: Molecular Biotechnology, Vol. 39, No. 3, 07.2008, p. 207-213.

Research output: Contribution to journalArticle

@article{49323148bc694f72a382374780a82bc0,
title = "Analysis of context sequence surrounding translation initiation site from complete genome of model plants",
abstract = "Regions flanking the translation initiation site (TIS) are thought to play a crucial role in translation efficiency of mRNAs, but their exact sequence and evolution in eukaryotes are still a matter of debate. We investigated the context sequences in 20 nucleotides around the TIS in multi-cellular eukaryotes, with a focus on two model plants and a comparison to human. We identified consensus sequences aaaaaaa(A/G)(A/C)aAUGGcgaataata and ggcggc(g/c)(A/G)(A/C)(G/ C)AUGGCggcggcgg for Arabidopsis thaliana and Oryza sativa, respectively. We observe strongly conserved G at position +4 and A or C at position -2; however, the exact nucleotide frequencies vary between the three organisms even at these conserved positions. The frequency of pyrimidines, which are considered sub optimum at position -3, is higher in both plants than in human. Arabidopsis is GC-depleted (AU-enriched) compared to both rice and human, and the enrichment is slightly stronger upstream than downstream of AUG. While both plants are similar though not identical in their variation of nucleotide frequencies, rice and human are more similar to each other than Arabidopsis and human. All three organisms display clear periodicity in A + G and C + U content when analyzing normalized frequencies. These findings suggest that, besides few highly conserved positions, overall structure of the context sequence plays a larger role in TIS recognition than the actual nucleotide frequencies.",
keywords = "A. thaliana, Consensus sequence, Eukaryotes, O. sativa, Translation initiation site",
author = "L. Rangan and Christine Vogel and A. Srivastava",
year = "2008",
month = "7",
doi = "10.1007/s12033-008-9036-9",
language = "English (US)",
volume = "39",
pages = "207--213",
journal = "Molecular Biotechnology",
issn = "1073-6085",
publisher = "Humana Press",
number = "3",

}

TY - JOUR

T1 - Analysis of context sequence surrounding translation initiation site from complete genome of model plants

AU - Rangan, L.

AU - Vogel, Christine

AU - Srivastava, A.

PY - 2008/7

Y1 - 2008/7

N2 - Regions flanking the translation initiation site (TIS) are thought to play a crucial role in translation efficiency of mRNAs, but their exact sequence and evolution in eukaryotes are still a matter of debate. We investigated the context sequences in 20 nucleotides around the TIS in multi-cellular eukaryotes, with a focus on two model plants and a comparison to human. We identified consensus sequences aaaaaaa(A/G)(A/C)aAUGGcgaataata and ggcggc(g/c)(A/G)(A/C)(G/ C)AUGGCggcggcgg for Arabidopsis thaliana and Oryza sativa, respectively. We observe strongly conserved G at position +4 and A or C at position -2; however, the exact nucleotide frequencies vary between the three organisms even at these conserved positions. The frequency of pyrimidines, which are considered sub optimum at position -3, is higher in both plants than in human. Arabidopsis is GC-depleted (AU-enriched) compared to both rice and human, and the enrichment is slightly stronger upstream than downstream of AUG. While both plants are similar though not identical in their variation of nucleotide frequencies, rice and human are more similar to each other than Arabidopsis and human. All three organisms display clear periodicity in A + G and C + U content when analyzing normalized frequencies. These findings suggest that, besides few highly conserved positions, overall structure of the context sequence plays a larger role in TIS recognition than the actual nucleotide frequencies.

AB - Regions flanking the translation initiation site (TIS) are thought to play a crucial role in translation efficiency of mRNAs, but their exact sequence and evolution in eukaryotes are still a matter of debate. We investigated the context sequences in 20 nucleotides around the TIS in multi-cellular eukaryotes, with a focus on two model plants and a comparison to human. We identified consensus sequences aaaaaaa(A/G)(A/C)aAUGGcgaataata and ggcggc(g/c)(A/G)(A/C)(G/ C)AUGGCggcggcgg for Arabidopsis thaliana and Oryza sativa, respectively. We observe strongly conserved G at position +4 and A or C at position -2; however, the exact nucleotide frequencies vary between the three organisms even at these conserved positions. The frequency of pyrimidines, which are considered sub optimum at position -3, is higher in both plants than in human. Arabidopsis is GC-depleted (AU-enriched) compared to both rice and human, and the enrichment is slightly stronger upstream than downstream of AUG. While both plants are similar though not identical in their variation of nucleotide frequencies, rice and human are more similar to each other than Arabidopsis and human. All three organisms display clear periodicity in A + G and C + U content when analyzing normalized frequencies. These findings suggest that, besides few highly conserved positions, overall structure of the context sequence plays a larger role in TIS recognition than the actual nucleotide frequencies.

KW - A. thaliana

KW - Consensus sequence

KW - Eukaryotes

KW - O. sativa

KW - Translation initiation site

UR - http://www.scopus.com/inward/record.url?scp=49849104172&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=49849104172&partnerID=8YFLogxK

U2 - 10.1007/s12033-008-9036-9

DO - 10.1007/s12033-008-9036-9

M3 - Article

VL - 39

SP - 207

EP - 213

JO - Molecular Biotechnology

JF - Molecular Biotechnology

SN - 1073-6085

IS - 3

ER -