Coalescent Methods for Estimating Species Trees from Phylogenomic Data

Liang Liu, Shaoyuan Wu, Lili Yu

Research output: Contribution to journalArticlepeer-review

75 Scopus citations

Abstract

Genome-scale sequence data have become increasingly available in the phylogenetic studies for understanding the evolutionary histories of species. However, it is challenging to develop probabilistic models to account for heterogeneity of phylogenomic data. The multispecies coalescent model describes gene trees as independent random variables generated from a coalescence process occurring along the lineages of the species tree. Since the multispecies coalescent model allows gene trees to vary across genes, coalescent-based methods have been popularly used to account for heterogeneous gene trees in phylogenomic data analysis. In this paper, we summarize and evaluate the performance of coalescent-based methods for estimating species trees from genome-scale sequence data. We investigate the effects of deep coalescence and mutation on the performance of species tree estimation methods. We found that the coalescent-based methods perform well in estimating species trees for a large number of genes, regardless of the degree of deep coalescence and mutation. The performance of the coalescent methods is negatively correlated with the lengths of internal branches of the species tree.

Original languageAmerican English
JournalJournal of Systematics and Evolution
Volume53
DOIs
StatePublished - Sep 1 2015

Disciplines

  • Public Health
  • Biostatistics
  • Community Health

Keywords

  • Gene tree
  • Genome-scale sequence
  • Multispecies coalescent model
  • Species tree

Fingerprint

Dive into the research topics of 'Coalescent Methods for Estimating Species Trees from Phylogenomic Data'. Together they form a unique fingerprint.

Cite this