Inferring evolution of gene duplicates using probabilistic models and nonparametric belief propagation

dc.contributor.authorZeng, Jia
dc.contributor.authorHannenhalli, Sridhar
dc.date.accessioned2021-09-28T14:38:00Z
dc.date.available2021-09-28T14:38:00Z
dc.date.issued2013-01-21
dc.description.abstractGene duplication, followed by functional evolution of duplicate genes, is a primary engine of evolutionary innovation. In turn, gene expression evolution is a critical component of overall functional evolution of paralogs. Inferring evolutionary history of gene expression among paralogs is therefore a problem of considerable interest. It also represents significant challenges. The standard approaches of evolutionary reconstruction assume that at an internal node of the duplication tree, the two duplicates evolve independently. However, because of various selection pressures functional evolution of the two paralogs may be coupled. The coupling of paralog evolution corresponds to three major fates of gene duplicates: subfunctionalization (SF), conserved function (CF) or neofunctionalization (NF). Quantitative analysis of these fates is of great interest and clearly influences evolutionary inference of expression. These two interrelated problems of inferring gene expression and evolutionary fates of gene duplicates have not been studied together previously and motivate the present study. Here we propose a novel probabilistic framework and algorithm to simultaneously infer (i) ancestral gene expression and (ii) the likely fate (SF, NF, CF) at each duplication event during the evolution of gene family. Using tissue-specific gene expression data, we develop a nonparametric belief propagation (NBP) algorithm to predict the ancestral expression level as a proxy for function, and describe a novel probabilistic model that relates the predicted and known expression levels to the possible evolutionary fates. We validate our model using simulation and then apply it to a genome-wide set of gene duplicates in human. Our results suggest that SF tends to be more frequent at the earlier stage of gene family expansion, while NF occurs more frequently later on.en_US
dc.description.urihttps://doi.org/10.1186/1471-2164-14-S1-S15
dc.identifierhttps://doi.org/10.13016/nme5-aly0
dc.identifier.citationZeng, J., Hannenhalli, S. Inferring evolution of gene duplicates using probabilistic models and nonparametric belief propagation. BMC Genomics 14, S15 (2013).en_US
dc.identifier.urihttp://hdl.handle.net/1903/28034
dc.language.isoen_USen_US
dc.publisherSpringer Natureen_US
dc.relation.isAvailableAtCollege of Computer, Mathematical & Physical Sciencesen_us
dc.relation.isAvailableAtDigital Repository at the University of Marylanden_us
dc.relation.isAvailableAtBiologyen_us
dc.relation.isAvailableAtUniversity of Maryland (College Park, MD)en_us
dc.subjectFeature Pointen_US
dc.subjectLeaf Nodeen_US
dc.subjectGaussian Mixture Modelen_US
dc.subjectInternal Nodeen_US
dc.subjectConserve Functionen_US
dc.titleInferring evolution of gene duplicates using probabilistic models and nonparametric belief propagationen_US
dc.typeArticleen_US

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
1471-2164-14-S1-S15.pdf
Size:
4.19 MB
Format:
Adobe Portable Document Format
Description: