Functional protein representations from biological networks enable diverse cross-species inference

dc.contributor.authorFan, Jason
dc.contributor.authorCannistra, Anthony
dc.contributor.authorFried, Inbar
dc.contributor.authorLim, Tim
dc.contributor.authorSchaffner, Thomas
dc.contributor.authorCrovella, Mark
dc.contributor.authorHescott, Benjamin
dc.contributor.authorLeiserson, Mark D.M.
dc.date.accessioned2019-12-12T16:27:25Z
dc.date.available2019-12-12T16:27:25Z
dc.date.issued2019-03-08
dc.descriptionPartial funding for Open Access provided by the UMD Libraries' Open Access Publishing Fund.
dc.description.abstractTransferring knowledge between species is key for many biological applications, but is complicated by divergent and convergent evolution. Many current approaches for this problem leverage sequence and interaction network data to transfer knowledge across species, exemplified by network alignment methods. While these techniques do well, they are limited in scope, creating metrics to address one specific problem or task. We take a different approach by creating an environment where multiple knowledge transfer tasks can be performed using the same protein representations. Specifically, our kernel-based method, MUNK, integrates sequence and network structure to create functional protein representations, embedding proteins from different species in the same vector space. First we show proteins in different species that are close in MUNKspace are functionally similar. Next,we use these representations to share knowledge of synthetic lethal interactions between species. Importantly, we find that the results using MUNK-representations are at least as accurate as existing algorithms for these tasks. Finally, we generalize the notion of a phenolog (‘orthologous phenotype’) to use functionally similar proteins (i.e. those with similar representations). We demonstrate the utility of this broadened notion by using it to identify known phenologs and novel non-obvious ones supported by current research.en_US
dc.identifierhttps://doi.org/10.13016/cz0r-jn4u
dc.identifier.citationJason Fan, Anthony Cannistra, Inbar Fried, Tim Lim, Thomas Schaffner, Mark Crovella, Benjamin Hescott, Mark D M Leiserson, Functional protein representations from biological networks enable diverse cross-species inference, Nucleic Acids Research, Volume 47, Issue 9, 21 May 2019, Page e51, https://doi.org/10.1093/nar/gkz132en_US
dc.identifier.urihttp://hdl.handle.net/1903/25317
dc.language.isoen_USen_US
dc.publisherOxforden_US
dc.relation.isAvailableAtCollege of Computer, Mathematical & Natural Sciencesen_us
dc.relation.isAvailableAtComputer Scienceen_us
dc.relation.isAvailableAtDigital Repository at the University of Marylanden_us
dc.relation.isAvailableAtUniversity of Maryland (College Park, MD)en_us
dc.subjectGenomicsen_US
dc.subjectComputational Methodsen_US
dc.titleFunctional protein representations from biological networks enable diverse cross-species inferenceen_US
dc.typeArticleen_US

Files

Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
gkz132.pdf
Size:
1.09 MB
Format:
Adobe Portable Document Format
Description:
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.57 KB
Format:
Item-specific license agreed upon to submission
Description: