ESTimating plant phylogeny: Lessons from partitioning

Jose E B De La Torre, Mary G. Egan, Manpreet S. Katari, Eric D. Brenner, Dennis W. Stevenson, Gloria M. Coruzzi, Rob DeSalle

Research output: Contribution to journalArticle

Abstract

Background: While Expressed Sequence Tags (ESTs) have proven a viable and efficient way to sample genomes, particularly those for which whole-genome sequencing is impractical, phylogenetic analysis using ESTs remains difficult. Sequencing errors and orthology determination are the major problems when using ESTs as a source of characters for systematics. Here we develop methods to incorporate EST sequence information in a simultaneous analysis framework to address controversial phylogenetic questions regarding the relationships among the major groups of seed plants. We use an automated, phylogenetically derived approach to orthology determination called OrthologID generate a phylogeny based on 43 process partitions, many of which are derived from ESTs, and examine several measures of support to assess the utility of EST data for phylogenies. Results: A maximum parsimony (MP) analysis resulted in a single tree with relatively high support at all nodes in the tree despite rampant conflict among trees generated from the separate analysis of individual partitions. In a comparison of broader-scale groupings based on cellular compartment (ie: chloroplast, mitochondrial or nuclear) or function, only the nuclear partition tree (based largely on EST data) was found to be topologically identical to the tree based on the simultaneous analysis of all data. Despite topological conflict among the broader-scale groupings examined, only the tree based on morphological data showed statistically significant differences. Conclusion: Based on the amount of character support contributed by EST data which make up a majority of the nuclear data set, and the lack of conflict of the nuclear data set with the simultaneous analysis tree, we conclude that the inclusion of EST data does provide a viable and efficient approach to address phylogenetic questions within a parsimony framework on a genomic scale, if problems of orthology determination and potential sequencing errors can be overcome. In addition, approaches that examine conflict and support in a simultaneous analysis framework allow for a more precise understanding of the evolutionary history of individual process partitions and may be a novel way to understand functional aspects of different kinds of cellular classes of gene products.

Original languageEnglish (US)
Article number48
JournalBMC Evolutionary Biology
Volume6
DOIs
StatePublished - Jun 15 2006

Fingerprint

Expressed Sequence Tags
Phylogeny
expressed sequence tags
phylogeny
partitioning
phylogenetics
genome
parsimony analysis
chloroplast
Genome
analysis
genomics
Chloroplasts
Spermatophytina
seed
funding
conflict
gene
Seeds
chloroplasts

ASJC Scopus subject areas

  • Medicine(all)
  • Ecology, Evolution, Behavior and Systematics

Cite this

De La Torre, J. E. B., Egan, M. G., Katari, M. S., Brenner, E. D., Stevenson, D. W., Coruzzi, G. M., & DeSalle, R. (2006). ESTimating plant phylogeny: Lessons from partitioning. BMC Evolutionary Biology, 6, [48]. https://doi.org/10.1186/1471-2148-6-48

ESTimating plant phylogeny : Lessons from partitioning. / De La Torre, Jose E B; Egan, Mary G.; Katari, Manpreet S.; Brenner, Eric D.; Stevenson, Dennis W.; Coruzzi, Gloria M.; DeSalle, Rob.

In: BMC Evolutionary Biology, Vol. 6, 48, 15.06.2006.

Research output: Contribution to journalArticle

De La Torre, JEB, Egan, MG, Katari, MS, Brenner, ED, Stevenson, DW, Coruzzi, GM & DeSalle, R 2006, 'ESTimating plant phylogeny: Lessons from partitioning', BMC Evolutionary Biology, vol. 6, 48. https://doi.org/10.1186/1471-2148-6-48
De La Torre JEB, Egan MG, Katari MS, Brenner ED, Stevenson DW, Coruzzi GM et al. ESTimating plant phylogeny: Lessons from partitioning. BMC Evolutionary Biology. 2006 Jun 15;6. 48. https://doi.org/10.1186/1471-2148-6-48
De La Torre, Jose E B ; Egan, Mary G. ; Katari, Manpreet S. ; Brenner, Eric D. ; Stevenson, Dennis W. ; Coruzzi, Gloria M. ; DeSalle, Rob. / ESTimating plant phylogeny : Lessons from partitioning. In: BMC Evolutionary Biology. 2006 ; Vol. 6.
@article{06b455dc10fb487bbc8eb967523c8a13,
title = "ESTimating plant phylogeny: Lessons from partitioning",
abstract = "Background: While Expressed Sequence Tags (ESTs) have proven a viable and efficient way to sample genomes, particularly those for which whole-genome sequencing is impractical, phylogenetic analysis using ESTs remains difficult. Sequencing errors and orthology determination are the major problems when using ESTs as a source of characters for systematics. Here we develop methods to incorporate EST sequence information in a simultaneous analysis framework to address controversial phylogenetic questions regarding the relationships among the major groups of seed plants. We use an automated, phylogenetically derived approach to orthology determination called OrthologID generate a phylogeny based on 43 process partitions, many of which are derived from ESTs, and examine several measures of support to assess the utility of EST data for phylogenies. Results: A maximum parsimony (MP) analysis resulted in a single tree with relatively high support at all nodes in the tree despite rampant conflict among trees generated from the separate analysis of individual partitions. In a comparison of broader-scale groupings based on cellular compartment (ie: chloroplast, mitochondrial or nuclear) or function, only the nuclear partition tree (based largely on EST data) was found to be topologically identical to the tree based on the simultaneous analysis of all data. Despite topological conflict among the broader-scale groupings examined, only the tree based on morphological data showed statistically significant differences. Conclusion: Based on the amount of character support contributed by EST data which make up a majority of the nuclear data set, and the lack of conflict of the nuclear data set with the simultaneous analysis tree, we conclude that the inclusion of EST data does provide a viable and efficient approach to address phylogenetic questions within a parsimony framework on a genomic scale, if problems of orthology determination and potential sequencing errors can be overcome. In addition, approaches that examine conflict and support in a simultaneous analysis framework allow for a more precise understanding of the evolutionary history of individual process partitions and may be a novel way to understand functional aspects of different kinds of cellular classes of gene products.",
author = "{De La Torre}, {Jose E B} and Egan, {Mary G.} and Katari, {Manpreet S.} and Brenner, {Eric D.} and Stevenson, {Dennis W.} and Coruzzi, {Gloria M.} and Rob DeSalle",
year = "2006",
month = "6",
day = "15",
doi = "10.1186/1471-2148-6-48",
language = "English (US)",
volume = "6",
journal = "BMC Evolutionary Biology",
issn = "1471-2148",
publisher = "BioMed Central",

}

TY - JOUR

T1 - ESTimating plant phylogeny

T2 - Lessons from partitioning

AU - De La Torre, Jose E B

AU - Egan, Mary G.

AU - Katari, Manpreet S.

AU - Brenner, Eric D.

AU - Stevenson, Dennis W.

AU - Coruzzi, Gloria M.

AU - DeSalle, Rob

PY - 2006/6/15

Y1 - 2006/6/15

N2 - Background: While Expressed Sequence Tags (ESTs) have proven a viable and efficient way to sample genomes, particularly those for which whole-genome sequencing is impractical, phylogenetic analysis using ESTs remains difficult. Sequencing errors and orthology determination are the major problems when using ESTs as a source of characters for systematics. Here we develop methods to incorporate EST sequence information in a simultaneous analysis framework to address controversial phylogenetic questions regarding the relationships among the major groups of seed plants. We use an automated, phylogenetically derived approach to orthology determination called OrthologID generate a phylogeny based on 43 process partitions, many of which are derived from ESTs, and examine several measures of support to assess the utility of EST data for phylogenies. Results: A maximum parsimony (MP) analysis resulted in a single tree with relatively high support at all nodes in the tree despite rampant conflict among trees generated from the separate analysis of individual partitions. In a comparison of broader-scale groupings based on cellular compartment (ie: chloroplast, mitochondrial or nuclear) or function, only the nuclear partition tree (based largely on EST data) was found to be topologically identical to the tree based on the simultaneous analysis of all data. Despite topological conflict among the broader-scale groupings examined, only the tree based on morphological data showed statistically significant differences. Conclusion: Based on the amount of character support contributed by EST data which make up a majority of the nuclear data set, and the lack of conflict of the nuclear data set with the simultaneous analysis tree, we conclude that the inclusion of EST data does provide a viable and efficient approach to address phylogenetic questions within a parsimony framework on a genomic scale, if problems of orthology determination and potential sequencing errors can be overcome. In addition, approaches that examine conflict and support in a simultaneous analysis framework allow for a more precise understanding of the evolutionary history of individual process partitions and may be a novel way to understand functional aspects of different kinds of cellular classes of gene products.

AB - Background: While Expressed Sequence Tags (ESTs) have proven a viable and efficient way to sample genomes, particularly those for which whole-genome sequencing is impractical, phylogenetic analysis using ESTs remains difficult. Sequencing errors and orthology determination are the major problems when using ESTs as a source of characters for systematics. Here we develop methods to incorporate EST sequence information in a simultaneous analysis framework to address controversial phylogenetic questions regarding the relationships among the major groups of seed plants. We use an automated, phylogenetically derived approach to orthology determination called OrthologID generate a phylogeny based on 43 process partitions, many of which are derived from ESTs, and examine several measures of support to assess the utility of EST data for phylogenies. Results: A maximum parsimony (MP) analysis resulted in a single tree with relatively high support at all nodes in the tree despite rampant conflict among trees generated from the separate analysis of individual partitions. In a comparison of broader-scale groupings based on cellular compartment (ie: chloroplast, mitochondrial or nuclear) or function, only the nuclear partition tree (based largely on EST data) was found to be topologically identical to the tree based on the simultaneous analysis of all data. Despite topological conflict among the broader-scale groupings examined, only the tree based on morphological data showed statistically significant differences. Conclusion: Based on the amount of character support contributed by EST data which make up a majority of the nuclear data set, and the lack of conflict of the nuclear data set with the simultaneous analysis tree, we conclude that the inclusion of EST data does provide a viable and efficient approach to address phylogenetic questions within a parsimony framework on a genomic scale, if problems of orthology determination and potential sequencing errors can be overcome. In addition, approaches that examine conflict and support in a simultaneous analysis framework allow for a more precise understanding of the evolutionary history of individual process partitions and may be a novel way to understand functional aspects of different kinds of cellular classes of gene products.

UR - http://www.scopus.com/inward/record.url?scp=33748766899&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=33748766899&partnerID=8YFLogxK

U2 - 10.1186/1471-2148-6-48

DO - 10.1186/1471-2148-6-48

M3 - Article

C2 - 16776834

AN - SCOPUS:33748766899

VL - 6

JO - BMC Evolutionary Biology

JF - BMC Evolutionary Biology

SN - 1471-2148

M1 - 48

ER -