In vitro RNA random pools are not structurally diverse

A computational analysis

Jana Gevertz, Hin Hark Gan, Tamar Schlick

Research output: Contribution to journalArticle

Abstract

In vitro selection of functional RNAs from large random sequence pools has led to the identification of many ligand-binding and catalytic RNAs. However, the structural diversity in random pools is not well understood. Such an understanding is a prerequisite for designing sequence pools to increase the probability of finding complex functional RNA by in vitro selection techniques. Toward this goal, we have generated by computer five random pools of RNA sequences of length up to 100 nt to mimic experiments and characterized the distribution of associated secondary structural motifs using sets of possible RNA tree structures derived from graph theory techniques. Our results show that such random pools heavily favor simple topological structures: For example, linear stem-loop and low-branching motifs are favored rather than complex structures with high-order junctions, as confirmed by known aptamers. Moreover, we quantify the rise of structural complexity with sequence length and report the dominant class of tree motifs (characterized by vertex number) for each pool. These analyses show not only that random pools do not lead to a uniform distribution of possible RNA secondary topologies; they point to avenues for designing pools with specific simple and complex structures in equal abundance in the goal of broadening the range of functional RNAs discovered by in vitro selection. Specifically, the optimal RNA sequence pool length to identify a structure with x stems is 20x.

Original languageEnglish (US)
Pages (from-to)853-863
Number of pages11
JournalRNA
Volume11
Issue number6
DOIs
StatePublished - Jun 2005

Fingerprint

RNA
Catalytic RNA
In Vitro Techniques
Ligands

Keywords

  • Graph theory
  • In vitro selection
  • Random pool
  • RNA pool design
  • RNA secondary structure
  • RNA topology

ASJC Scopus subject areas

  • Genetics
  • Molecular Biology

Cite this

In vitro RNA random pools are not structurally diverse : A computational analysis. / Gevertz, Jana; Gan, Hin Hark; Schlick, Tamar.

In: RNA, Vol. 11, No. 6, 06.2005, p. 853-863.

Research output: Contribution to journalArticle

Gevertz, Jana ; Gan, Hin Hark ; Schlick, Tamar. / In vitro RNA random pools are not structurally diverse : A computational analysis. In: RNA. 2005 ; Vol. 11, No. 6. pp. 853-863.
@article{63d7c4b495674b0c99eeaab70123af23,
title = "In vitro RNA random pools are not structurally diverse: A computational analysis",
abstract = "In vitro selection of functional RNAs from large random sequence pools has led to the identification of many ligand-binding and catalytic RNAs. However, the structural diversity in random pools is not well understood. Such an understanding is a prerequisite for designing sequence pools to increase the probability of finding complex functional RNA by in vitro selection techniques. Toward this goal, we have generated by computer five random pools of RNA sequences of length up to 100 nt to mimic experiments and characterized the distribution of associated secondary structural motifs using sets of possible RNA tree structures derived from graph theory techniques. Our results show that such random pools heavily favor simple topological structures: For example, linear stem-loop and low-branching motifs are favored rather than complex structures with high-order junctions, as confirmed by known aptamers. Moreover, we quantify the rise of structural complexity with sequence length and report the dominant class of tree motifs (characterized by vertex number) for each pool. These analyses show not only that random pools do not lead to a uniform distribution of possible RNA secondary topologies; they point to avenues for designing pools with specific simple and complex structures in equal abundance in the goal of broadening the range of functional RNAs discovered by in vitro selection. Specifically, the optimal RNA sequence pool length to identify a structure with x stems is 20x.",
keywords = "Graph theory, In vitro selection, Random pool, RNA pool design, RNA secondary structure, RNA topology",
author = "Jana Gevertz and Gan, {Hin Hark} and Tamar Schlick",
year = "2005",
month = "6",
doi = "10.1261/rna.7271405",
language = "English (US)",
volume = "11",
pages = "853--863",
journal = "RNA",
issn = "1355-8382",
publisher = "Cold Spring Harbor Laboratory Press",
number = "6",

}

TY - JOUR

T1 - In vitro RNA random pools are not structurally diverse

T2 - A computational analysis

AU - Gevertz, Jana

AU - Gan, Hin Hark

AU - Schlick, Tamar

PY - 2005/6

Y1 - 2005/6

N2 - In vitro selection of functional RNAs from large random sequence pools has led to the identification of many ligand-binding and catalytic RNAs. However, the structural diversity in random pools is not well understood. Such an understanding is a prerequisite for designing sequence pools to increase the probability of finding complex functional RNA by in vitro selection techniques. Toward this goal, we have generated by computer five random pools of RNA sequences of length up to 100 nt to mimic experiments and characterized the distribution of associated secondary structural motifs using sets of possible RNA tree structures derived from graph theory techniques. Our results show that such random pools heavily favor simple topological structures: For example, linear stem-loop and low-branching motifs are favored rather than complex structures with high-order junctions, as confirmed by known aptamers. Moreover, we quantify the rise of structural complexity with sequence length and report the dominant class of tree motifs (characterized by vertex number) for each pool. These analyses show not only that random pools do not lead to a uniform distribution of possible RNA secondary topologies; they point to avenues for designing pools with specific simple and complex structures in equal abundance in the goal of broadening the range of functional RNAs discovered by in vitro selection. Specifically, the optimal RNA sequence pool length to identify a structure with x stems is 20x.

AB - In vitro selection of functional RNAs from large random sequence pools has led to the identification of many ligand-binding and catalytic RNAs. However, the structural diversity in random pools is not well understood. Such an understanding is a prerequisite for designing sequence pools to increase the probability of finding complex functional RNA by in vitro selection techniques. Toward this goal, we have generated by computer five random pools of RNA sequences of length up to 100 nt to mimic experiments and characterized the distribution of associated secondary structural motifs using sets of possible RNA tree structures derived from graph theory techniques. Our results show that such random pools heavily favor simple topological structures: For example, linear stem-loop and low-branching motifs are favored rather than complex structures with high-order junctions, as confirmed by known aptamers. Moreover, we quantify the rise of structural complexity with sequence length and report the dominant class of tree motifs (characterized by vertex number) for each pool. These analyses show not only that random pools do not lead to a uniform distribution of possible RNA secondary topologies; they point to avenues for designing pools with specific simple and complex structures in equal abundance in the goal of broadening the range of functional RNAs discovered by in vitro selection. Specifically, the optimal RNA sequence pool length to identify a structure with x stems is 20x.

KW - Graph theory

KW - In vitro selection

KW - Random pool

KW - RNA pool design

KW - RNA secondary structure

KW - RNA topology

UR - http://www.scopus.com/inward/record.url?scp=21844467243&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=21844467243&partnerID=8YFLogxK

U2 - 10.1261/rna.7271405

DO - 10.1261/rna.7271405

M3 - Article

VL - 11

SP - 853

EP - 863

JO - RNA

JF - RNA

SN - 1355-8382

IS - 6

ER -