Predicting Large RNA-Like Topologies by a Knowledge-Based Clustering Approach

Naoto Baba, Shereef Elmetwaly, Namhee Kim, Tamar Schlick

Research output: Contribution to journalArticle

Abstract

An analysis and expansion of our resource for classifying, predicting, and designing RNA structures, RAG (RNA-As-Graphs), is presented, with the goal of understanding features of RNA-like and non-RNA-like motifs and exploiting this information for RNA design. RAG was first reported in 2004 for cataloging RNA secondary structure motifs using graph representations. In 2011, the RAG resource was updated with the increased availability of RNA structures and was improved by utilities for analyzing RNA structures, including substructuring and search tools. We also classified RNA structures as graphs up to 10 vertices (~ 200 nucleotides) into three classes: existing, RNA-like, and non-RNA-like using clustering approaches. Here, we focus on the tree graphs and evaluate the newly founded RNAs since 2011, which also support our refined predictions of RNA-like motifs. We expand the RAG resource for large tree graphs up to 13 vertices (~ 260 nucleotides), thereby cataloging more than 10 times as many secondary structures. We apply clustering algorithms based on features of RNA secondary structures translated from known tertiary structures to suggest which hypothetical large RNA motifs can be considered "RNA-like". The results by the PAM (Partitioning Around Medoids) approach, in particular, reveal good accuracy, with small error for the largest cases. The RAG update here up to 13 vertices offers a useful graph-based tool for exploring RNA motifs and suggesting large RNA motifs for design.

Original languageEnglish (US)
Pages (from-to)811-821
Number of pages11
JournalJournal of Molecular Biology
Volume428
Issue number5
DOIs
StatePublished - Feb 27 2016

Fingerprint

Cluster Analysis
RNA
Nucleotide Motifs
Cataloging
Nucleotides

Keywords

  • Prediction of RNA-like motifs
  • RNA atlas
  • RNA design
  • RNA motifs
  • RNA secondary structure

ASJC Scopus subject areas

  • Molecular Biology

Cite this

Predicting Large RNA-Like Topologies by a Knowledge-Based Clustering Approach. / Baba, Naoto; Elmetwaly, Shereef; Kim, Namhee; Schlick, Tamar.

In: Journal of Molecular Biology, Vol. 428, No. 5, 27.02.2016, p. 811-821.

Research output: Contribution to journalArticle

Baba, Naoto ; Elmetwaly, Shereef ; Kim, Namhee ; Schlick, Tamar. / Predicting Large RNA-Like Topologies by a Knowledge-Based Clustering Approach. In: Journal of Molecular Biology. 2016 ; Vol. 428, No. 5. pp. 811-821.
@article{8182ed2be0594eea8b531c51b2a91e2d,
title = "Predicting Large RNA-Like Topologies by a Knowledge-Based Clustering Approach",
abstract = "An analysis and expansion of our resource for classifying, predicting, and designing RNA structures, RAG (RNA-As-Graphs), is presented, with the goal of understanding features of RNA-like and non-RNA-like motifs and exploiting this information for RNA design. RAG was first reported in 2004 for cataloging RNA secondary structure motifs using graph representations. In 2011, the RAG resource was updated with the increased availability of RNA structures and was improved by utilities for analyzing RNA structures, including substructuring and search tools. We also classified RNA structures as graphs up to 10 vertices (~ 200 nucleotides) into three classes: existing, RNA-like, and non-RNA-like using clustering approaches. Here, we focus on the tree graphs and evaluate the newly founded RNAs since 2011, which also support our refined predictions of RNA-like motifs. We expand the RAG resource for large tree graphs up to 13 vertices (~ 260 nucleotides), thereby cataloging more than 10 times as many secondary structures. We apply clustering algorithms based on features of RNA secondary structures translated from known tertiary structures to suggest which hypothetical large RNA motifs can be considered {"}RNA-like{"}. The results by the PAM (Partitioning Around Medoids) approach, in particular, reveal good accuracy, with small error for the largest cases. The RAG update here up to 13 vertices offers a useful graph-based tool for exploring RNA motifs and suggesting large RNA motifs for design.",
keywords = "Prediction of RNA-like motifs, RNA atlas, RNA design, RNA motifs, RNA secondary structure",
author = "Naoto Baba and Shereef Elmetwaly and Namhee Kim and Tamar Schlick",
year = "2016",
month = "2",
day = "27",
doi = "10.1016/j.jmb.2015.10.009",
language = "English (US)",
volume = "428",
pages = "811--821",
journal = "Journal of Molecular Biology",
issn = "0022-2836",
publisher = "Academic Press Inc.",
number = "5",

}

TY - JOUR

T1 - Predicting Large RNA-Like Topologies by a Knowledge-Based Clustering Approach

AU - Baba, Naoto

AU - Elmetwaly, Shereef

AU - Kim, Namhee

AU - Schlick, Tamar

PY - 2016/2/27

Y1 - 2016/2/27

N2 - An analysis and expansion of our resource for classifying, predicting, and designing RNA structures, RAG (RNA-As-Graphs), is presented, with the goal of understanding features of RNA-like and non-RNA-like motifs and exploiting this information for RNA design. RAG was first reported in 2004 for cataloging RNA secondary structure motifs using graph representations. In 2011, the RAG resource was updated with the increased availability of RNA structures and was improved by utilities for analyzing RNA structures, including substructuring and search tools. We also classified RNA structures as graphs up to 10 vertices (~ 200 nucleotides) into three classes: existing, RNA-like, and non-RNA-like using clustering approaches. Here, we focus on the tree graphs and evaluate the newly founded RNAs since 2011, which also support our refined predictions of RNA-like motifs. We expand the RAG resource for large tree graphs up to 13 vertices (~ 260 nucleotides), thereby cataloging more than 10 times as many secondary structures. We apply clustering algorithms based on features of RNA secondary structures translated from known tertiary structures to suggest which hypothetical large RNA motifs can be considered "RNA-like". The results by the PAM (Partitioning Around Medoids) approach, in particular, reveal good accuracy, with small error for the largest cases. The RAG update here up to 13 vertices offers a useful graph-based tool for exploring RNA motifs and suggesting large RNA motifs for design.

AB - An analysis and expansion of our resource for classifying, predicting, and designing RNA structures, RAG (RNA-As-Graphs), is presented, with the goal of understanding features of RNA-like and non-RNA-like motifs and exploiting this information for RNA design. RAG was first reported in 2004 for cataloging RNA secondary structure motifs using graph representations. In 2011, the RAG resource was updated with the increased availability of RNA structures and was improved by utilities for analyzing RNA structures, including substructuring and search tools. We also classified RNA structures as graphs up to 10 vertices (~ 200 nucleotides) into three classes: existing, RNA-like, and non-RNA-like using clustering approaches. Here, we focus on the tree graphs and evaluate the newly founded RNAs since 2011, which also support our refined predictions of RNA-like motifs. We expand the RAG resource for large tree graphs up to 13 vertices (~ 260 nucleotides), thereby cataloging more than 10 times as many secondary structures. We apply clustering algorithms based on features of RNA secondary structures translated from known tertiary structures to suggest which hypothetical large RNA motifs can be considered "RNA-like". The results by the PAM (Partitioning Around Medoids) approach, in particular, reveal good accuracy, with small error for the largest cases. The RAG update here up to 13 vertices offers a useful graph-based tool for exploring RNA motifs and suggesting large RNA motifs for design.

KW - Prediction of RNA-like motifs

KW - RNA atlas

KW - RNA design

KW - RNA motifs

KW - RNA secondary structure

UR - http://www.scopus.com/inward/record.url?scp=84961725598&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84961725598&partnerID=8YFLogxK

U2 - 10.1016/j.jmb.2015.10.009

DO - 10.1016/j.jmb.2015.10.009

M3 - Article

VL - 428

SP - 811

EP - 821

JO - Journal of Molecular Biology

JF - Journal of Molecular Biology

SN - 0022-2836

IS - 5

ER -