Skip to content
University of Maryland LibrariesDigital Repository at the University of Maryland
    • Login
    View Item 
    •   DRUM
    • College of Information Studies
    • Information Studies
    • Information Studies Research Works
    • View Item
    •   DRUM
    • College of Information Studies
    • Information Studies
    • Information Studies Research Works
    • View Item
    JavaScript is disabled for your browser. Some features of this site may not work without it.

    PubMed related articles: a probabilistic topic-based model for content similarity

    Thumbnail
    View/Open
    1471-2105-8-423.pdf (1.515Mb)
    No. of downloads: 41

    External Link(s)
    https://doi.org/10.1186/1471-2105-8-423
    Date
    2007-10-30
    Author
    Lin, Jimmy
    Wilbur, W John
    Citation
    Lin, J., Wilbur, W.J. PubMed related articles: a probabilistic topic-based model for content similarity. BMC Bioinformatics 8, 423 (2007).
    DRUM DOI
    https://doi.org/10.13016/gfgf-atuw
    Metadata
    Show full item record
    Abstract
    We present a probabilistic topic-based model for content similarity called pmra that underlies the related article search feature in PubMed. Whether or not a document is about a particular topic is computed from term frequencies, modeled as Poisson distributions. Unlike previous probabilistic retrieval models, we do not attempt to estimate relevance–but rather our focus is "relatedness", the probability that a user would want to examine a particular document given known interest in another. We also describe a novel technique for estimating parameters that does not require human relevance judgments; instead, the process is based on the existence of MeSH ® in MEDLINE ®. The pmra retrieval model was compared against bm25, a competitive probabilistic model that shares theoretical similarities. Experiments using the test collection from the TREC 2005 genomics track shows a small but statistically significant improvement of pmra over bm25 in terms of precision. Our experiments suggest that the pmra model provides an effective ranking algorithm for related article search.
    URI
    http://hdl.handle.net/1903/28203
    Collections
    • Information Studies Research Works

    DRUM is brought to you by the University of Maryland Libraries
    University of Maryland, College Park, MD 20742-7011 (301)314-1328.
    Please send us your comments.
    Web Accessibility
     

     

    Browse

    All of DRUMCommunities & CollectionsBy Issue DateAuthorsTitlesSubjectsThis CollectionBy Issue DateAuthorsTitlesSubjects

    My Account

    LoginRegister
    Pages
    About DRUMAbout Download Statistics

    DRUM is brought to you by the University of Maryland Libraries
    University of Maryland, College Park, MD 20742-7011 (301)314-1328.
    Please send us your comments.
    Web Accessibility