The SUPERFAMILY database in 2007: Families and functions

Derek Wilson, Martin Madera, Christine Vogel, Cyrus Chothia, Julian Gough

Research output: Contribution to journalArticle

Abstract

The SUPERFAMILY database provides protein domain assignments, at the SCOP 'superfamily' level, for the predicted protein sequences in over 400 completed genomes. A superfamily groups together domains of different families which have a common evolutionary ancestor based on structural, functional and sequence data. SUPERFAMILY domain assignments are generated using an expert curated set of profile hidden Markov models. All models and structural assignments are available for browsing and download from http://supfam.org. The web interface includes services such as domain architectures and alignment details for all protein assignments, searchable domain combinations, domain occurrence network visualization, detection of over- or under-represented superfamilies for a given genome by comparison with other genomes, assignment of manually submitted sequences and keyword searches. In this update we describe the SUPERFAMILY database and outline two major developments: (i) incorporation of family level assignments and (ii) a superfamily-level functional annotation. The SUPERFAMILY database can be used for general protein evolution and superfamily-specific studies, genomic annotation, and structural genomics target suggestion and assessment.

Original languageEnglish (US)
JournalNucleic Acids Research
Volume35
Issue numberSUPPL. 1
DOIs
StatePublished - Jan 2007

Fingerprint

Genome
Databases
Proteins
Structural Models
Genomics
Protein Domains

ASJC Scopus subject areas

  • Genetics

Cite this

The SUPERFAMILY database in 2007 : Families and functions. / Wilson, Derek; Madera, Martin; Vogel, Christine; Chothia, Cyrus; Gough, Julian.

In: Nucleic Acids Research, Vol. 35, No. SUPPL. 1, 01.2007.

Research output: Contribution to journalArticle

Wilson, Derek ; Madera, Martin ; Vogel, Christine ; Chothia, Cyrus ; Gough, Julian. / The SUPERFAMILY database in 2007 : Families and functions. In: Nucleic Acids Research. 2007 ; Vol. 35, No. SUPPL. 1.
@article{cd8192dc6c9f4e04938ddd150d7bcbd1,
title = "The SUPERFAMILY database in 2007: Families and functions",
abstract = "The SUPERFAMILY database provides protein domain assignments, at the SCOP 'superfamily' level, for the predicted protein sequences in over 400 completed genomes. A superfamily groups together domains of different families which have a common evolutionary ancestor based on structural, functional and sequence data. SUPERFAMILY domain assignments are generated using an expert curated set of profile hidden Markov models. All models and structural assignments are available for browsing and download from http://supfam.org. The web interface includes services such as domain architectures and alignment details for all protein assignments, searchable domain combinations, domain occurrence network visualization, detection of over- or under-represented superfamilies for a given genome by comparison with other genomes, assignment of manually submitted sequences and keyword searches. In this update we describe the SUPERFAMILY database and outline two major developments: (i) incorporation of family level assignments and (ii) a superfamily-level functional annotation. The SUPERFAMILY database can be used for general protein evolution and superfamily-specific studies, genomic annotation, and structural genomics target suggestion and assessment.",
author = "Derek Wilson and Martin Madera and Christine Vogel and Cyrus Chothia and Julian Gough",
year = "2007",
month = "1",
doi = "10.1093/nar/gkl910",
language = "English (US)",
volume = "35",
journal = "Nucleic Acids Research",
issn = "0305-1048",
publisher = "Oxford University Press",
number = "SUPPL. 1",

}

TY - JOUR

T1 - The SUPERFAMILY database in 2007

T2 - Families and functions

AU - Wilson, Derek

AU - Madera, Martin

AU - Vogel, Christine

AU - Chothia, Cyrus

AU - Gough, Julian

PY - 2007/1

Y1 - 2007/1

N2 - The SUPERFAMILY database provides protein domain assignments, at the SCOP 'superfamily' level, for the predicted protein sequences in over 400 completed genomes. A superfamily groups together domains of different families which have a common evolutionary ancestor based on structural, functional and sequence data. SUPERFAMILY domain assignments are generated using an expert curated set of profile hidden Markov models. All models and structural assignments are available for browsing and download from http://supfam.org. The web interface includes services such as domain architectures and alignment details for all protein assignments, searchable domain combinations, domain occurrence network visualization, detection of over- or under-represented superfamilies for a given genome by comparison with other genomes, assignment of manually submitted sequences and keyword searches. In this update we describe the SUPERFAMILY database and outline two major developments: (i) incorporation of family level assignments and (ii) a superfamily-level functional annotation. The SUPERFAMILY database can be used for general protein evolution and superfamily-specific studies, genomic annotation, and structural genomics target suggestion and assessment.

AB - The SUPERFAMILY database provides protein domain assignments, at the SCOP 'superfamily' level, for the predicted protein sequences in over 400 completed genomes. A superfamily groups together domains of different families which have a common evolutionary ancestor based on structural, functional and sequence data. SUPERFAMILY domain assignments are generated using an expert curated set of profile hidden Markov models. All models and structural assignments are available for browsing and download from http://supfam.org. The web interface includes services such as domain architectures and alignment details for all protein assignments, searchable domain combinations, domain occurrence network visualization, detection of over- or under-represented superfamilies for a given genome by comparison with other genomes, assignment of manually submitted sequences and keyword searches. In this update we describe the SUPERFAMILY database and outline two major developments: (i) incorporation of family level assignments and (ii) a superfamily-level functional annotation. The SUPERFAMILY database can be used for general protein evolution and superfamily-specific studies, genomic annotation, and structural genomics target suggestion and assessment.

UR - http://www.scopus.com/inward/record.url?scp=33846044585&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=33846044585&partnerID=8YFLogxK

U2 - 10.1093/nar/gkl910

DO - 10.1093/nar/gkl910

M3 - Article

VL - 35

JO - Nucleic Acids Research

JF - Nucleic Acids Research

SN - 0305-1048

IS - SUPPL. 1

ER -