PageRank without hyperlinks: Reranking with PubMed related article networks for biomedical text retrieval

dc.contributor.authorLin, Jimmy
dc.date.accessioned2021-12-06T15:42:24Z
dc.date.available2021-12-06T15:42:24Z
dc.date.issued2008-06-06
dc.description.abstractGraph analysis algorithms such as PageRank and HITS have been successful in Web environments because they are able to extract important inter-document relationships from manually-created hyperlinks. We consider the application of these techniques to biomedical text retrieval. In the current PubMed® search interface, a MEDLINE® citation is connected to a number of related citations, which are in turn connected to other citations. Thus, a MEDLINE record represents a node in a vast content-similarity network. This article explores the hypothesis that these networks can be exploited for text retrieval, in the same manner as hyperlink graphs on the Web. We conducted a number of reranking experiments using the TREC 2005 genomics track test collection in which scores extracted from PageRank and HITS analysis were combined with scores returned by an off-the-shelf retrieval engine. Experiments demonstrate that incorporating PageRank scores yields significant improvements in terms of standard ranked-retrieval metrics. The link structure of content-similarity networks can be exploited to improve the effectiveness of information retrieval systems. These results generalize the applicability of graph analysis algorithms to text retrieval in the biomedical domain.en_US
dc.description.urihttps://doi.org/10.1186/1471-2105-9-270
dc.identifierhttps://doi.org/10.13016/eh1g-07n0
dc.identifier.citationLin, J. PageRank without hyperlinks: Reranking with PubMed related article networks for biomedical text retrieval. BMC Bioinformatics 9, 270 (2008).en_US
dc.identifier.urihttp://hdl.handle.net/1903/28195
dc.language.isoen_USen_US
dc.publisherSpringer Natureen_US
dc.relation.isAvailableAtCollege of Information Studiesen_us
dc.relation.isAvailableAtInformation Studiesen_us
dc.relation.isAvailableAtDigital Repository at the University of Marylanden_us
dc.relation.isAvailableAtUniversity of Maryland (College Park, MD)en_us
dc.subjectRelated Articleen_US
dc.subjectTest Collectionen_US
dc.subjectRelevance Judgmenten_US
dc.subjectText Retrievalen_US
dc.subjectRetrieval Effectivenessen_US
dc.titlePageRank without hyperlinks: Reranking with PubMed related article networks for biomedical text retrievalen_US
dc.typeArticleen_US

Files

Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
1471-2105-9-270.pdf
Size:
823.65 KB
Format:
Adobe Portable Document Format
Description:
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.57 KB
Format:
Item-specific license agreed upon to submission
Description: