VizMaps: A Bayesian Topic Modeling Based PubMed Search Interface

dc.contributor.advisorBanks, David L
dc.contributor.advisorBecker, Charles Maxwell
dc.contributor.authorKamboj, Kirti
dc.date.accessioned2016-01-04T19:37:31Z
dc.date.available2016-06-09T04:30:04Z
dc.date.issued2015
dc.departmentStatistical and Economic Modeling
dc.description.abstract<p>A common challenge that users of academic databases face is making sense of their query outputs for knowledge discovery. This is exacerbated by the size and growth of modern databases. PubMed, a central index of biomedical literature, contains over 25 million citations, and can output search results containing hundreds of thousands of citations. Under these conditions, efficient knowledge discovery requires a different data structure than a chronological list of articles. It requires a method of conveying what the important ideas are, where they are located, and how they are connected; a method of allowing users to see the underlying topical structure of their search. This paper presents VizMaps, a PubMed search interface that addresses some of these problems. Given search terms, our main backend pipeline extracts relevant words from the title and abstract, and clusters them into discovered topics using Bayesian topic models, in particular the Latent Dirichlet Allocation (LDA). It then outputs a visual, navigable map of the query results.</p>
dc.identifier.urihttps://hdl.handle.net/10161/11411
dc.subjectStatistics
dc.subjectBioinformatics
dc.subjectInformation science
dc.subjectinformation retrieval
dc.subjectknowledge discovery
dc.subjectLatent Dirichlet allocation
dc.subjectPubMed
dc.subjectsearch interface
dc.subjectTopic modeling
dc.titleVizMaps: A Bayesian Topic Modeling Based PubMed Search Interface
dc.typeMaster's thesis
duke.embargo.months5

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Kamboj_duke_0066N_13245.pdf
Size:
655.47 KB
Format:
Adobe Portable Document Format

Collections