A computational proposal for designing structured RNA pools for in vitro selection of RNAs

Namhee Kim, Hark Gan Hin, Tamar Schlick

Research output: Contribution to journalArticle

Abstract

Although in vitro selection technology is a versatile experimental tool for discovering novel synthetic RNA molecules, finding complex RNA molecules is difficult because most RNAs identified from random sequence pools are simple motifs, consistent with recent computational analysis of such sequence pools. Thus, enriching in vitro selection pools with complex structures could increase the probability of discovering novel RNAs. Here we develop an approach for engineering sequence pools that links RNA sequence space regions with corresponding structural distributions via a "mixing matrix" approach combined with a graph theory analysis. We define five classes of mixing matrices motivated by covariance mutations in RNA; these constructs define nucleotide transition rates and are applied to chosen starting sequences to yield specific nonrandom pools. We examine the coverage of sequence space as a function of the mixing matrix and starting sequence via clustering analysis. We show that, in contrast to random sequences, which are associated only with a local region of sequence space, our designed pools, including a structured pool for GTP aptamers, can target specific motifs. It follows that experimental synthesis of designed pools can benefit from using optimized starting sequences, mixing matrices, and pool fractions associated with each of our constructed pools as a guide. Automation of our approach could provide practical tools for pool design applications for in vitro selection of RNAs and related problems. Published by Cold Spring Harbor Laboratory Press.

Original languageEnglish (US)
Pages (from-to)478-492
Number of pages15
JournalRNA
Volume13
Issue number4
DOIs
StatePublished - Apr 2007

Fingerprint

RNA
Automation
Guanosine Triphosphate
Sequence Analysis
Cluster Analysis
In Vitro Techniques
Nucleotides
Technology
Mutation

Keywords

  • Graph theory
  • In vitro selection
  • Mixing matrix
  • RNA pool design
  • Sequence-structure map

ASJC Scopus subject areas

  • Genetics
  • Molecular Biology

Cite this

A computational proposal for designing structured RNA pools for in vitro selection of RNAs. / Kim, Namhee; Hin, Hark Gan; Schlick, Tamar.

In: RNA, Vol. 13, No. 4, 04.2007, p. 478-492.

Research output: Contribution to journalArticle

Kim, Namhee ; Hin, Hark Gan ; Schlick, Tamar. / A computational proposal for designing structured RNA pools for in vitro selection of RNAs. In: RNA. 2007 ; Vol. 13, No. 4. pp. 478-492.
@article{0ee6a2a990f946499e797b876761f506,
title = "A computational proposal for designing structured RNA pools for in vitro selection of RNAs",
abstract = "Although in vitro selection technology is a versatile experimental tool for discovering novel synthetic RNA molecules, finding complex RNA molecules is difficult because most RNAs identified from random sequence pools are simple motifs, consistent with recent computational analysis of such sequence pools. Thus, enriching in vitro selection pools with complex structures could increase the probability of discovering novel RNAs. Here we develop an approach for engineering sequence pools that links RNA sequence space regions with corresponding structural distributions via a {"}mixing matrix{"} approach combined with a graph theory analysis. We define five classes of mixing matrices motivated by covariance mutations in RNA; these constructs define nucleotide transition rates and are applied to chosen starting sequences to yield specific nonrandom pools. We examine the coverage of sequence space as a function of the mixing matrix and starting sequence via clustering analysis. We show that, in contrast to random sequences, which are associated only with a local region of sequence space, our designed pools, including a structured pool for GTP aptamers, can target specific motifs. It follows that experimental synthesis of designed pools can benefit from using optimized starting sequences, mixing matrices, and pool fractions associated with each of our constructed pools as a guide. Automation of our approach could provide practical tools for pool design applications for in vitro selection of RNAs and related problems. Published by Cold Spring Harbor Laboratory Press.",
keywords = "Graph theory, In vitro selection, Mixing matrix, RNA pool design, Sequence-structure map",
author = "Namhee Kim and Hin, {Hark Gan} and Tamar Schlick",
year = "2007",
month = "4",
doi = "10.1261/rna.374907",
language = "English (US)",
volume = "13",
pages = "478--492",
journal = "RNA",
issn = "1355-8382",
publisher = "Cold Spring Harbor Laboratory Press",
number = "4",

}

TY - JOUR

T1 - A computational proposal for designing structured RNA pools for in vitro selection of RNAs

AU - Kim, Namhee

AU - Hin, Hark Gan

AU - Schlick, Tamar

PY - 2007/4

Y1 - 2007/4

N2 - Although in vitro selection technology is a versatile experimental tool for discovering novel synthetic RNA molecules, finding complex RNA molecules is difficult because most RNAs identified from random sequence pools are simple motifs, consistent with recent computational analysis of such sequence pools. Thus, enriching in vitro selection pools with complex structures could increase the probability of discovering novel RNAs. Here we develop an approach for engineering sequence pools that links RNA sequence space regions with corresponding structural distributions via a "mixing matrix" approach combined with a graph theory analysis. We define five classes of mixing matrices motivated by covariance mutations in RNA; these constructs define nucleotide transition rates and are applied to chosen starting sequences to yield specific nonrandom pools. We examine the coverage of sequence space as a function of the mixing matrix and starting sequence via clustering analysis. We show that, in contrast to random sequences, which are associated only with a local region of sequence space, our designed pools, including a structured pool for GTP aptamers, can target specific motifs. It follows that experimental synthesis of designed pools can benefit from using optimized starting sequences, mixing matrices, and pool fractions associated with each of our constructed pools as a guide. Automation of our approach could provide practical tools for pool design applications for in vitro selection of RNAs and related problems. Published by Cold Spring Harbor Laboratory Press.

AB - Although in vitro selection technology is a versatile experimental tool for discovering novel synthetic RNA molecules, finding complex RNA molecules is difficult because most RNAs identified from random sequence pools are simple motifs, consistent with recent computational analysis of such sequence pools. Thus, enriching in vitro selection pools with complex structures could increase the probability of discovering novel RNAs. Here we develop an approach for engineering sequence pools that links RNA sequence space regions with corresponding structural distributions via a "mixing matrix" approach combined with a graph theory analysis. We define five classes of mixing matrices motivated by covariance mutations in RNA; these constructs define nucleotide transition rates and are applied to chosen starting sequences to yield specific nonrandom pools. We examine the coverage of sequence space as a function of the mixing matrix and starting sequence via clustering analysis. We show that, in contrast to random sequences, which are associated only with a local region of sequence space, our designed pools, including a structured pool for GTP aptamers, can target specific motifs. It follows that experimental synthesis of designed pools can benefit from using optimized starting sequences, mixing matrices, and pool fractions associated with each of our constructed pools as a guide. Automation of our approach could provide practical tools for pool design applications for in vitro selection of RNAs and related problems. Published by Cold Spring Harbor Laboratory Press.

KW - Graph theory

KW - In vitro selection

KW - Mixing matrix

KW - RNA pool design

KW - Sequence-structure map

UR - http://www.scopus.com/inward/record.url?scp=33947719473&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=33947719473&partnerID=8YFLogxK

U2 - 10.1261/rna.374907

DO - 10.1261/rna.374907

M3 - Article

VL - 13

SP - 478

EP - 492

JO - RNA

JF - RNA

SN - 1355-8382

IS - 4

ER -