Utilizing context in generative bayesian models for linked corpus

Saurabh Kataria, Prasenjit Mitra, Sumit Bhatia

Research output: Chapter in Book/Report/Conference proceedingConference contribution

37 Scopus citations

Abstract

In an interlinked corpus of documents, the context in which a citation appears provides extra information about the cited document. However, associating terms in the context to the cited document remains an open problem. We propose a novel document generation approach that statistically incor porates the context in which a document links to another doc ument. We quantitatively show that the proposed generation scheme explains the linking phenomenon better than previous approaches. The context information along with the actual content of the document provides signicant improvements over the previous approaches for various real world evalua tion tasks such as link prediction and log-likelihood estima tion on unseen content. The proposed method is more scal able to large collection of documents compared to the previ ous approaches.

Original languageEnglish (US)
Title of host publicationAAAI-10 / IAAI-10 - Proceedings of the 24th AAAI Conference on Artificial Intelligence and the 22nd Innovative Applications of Artificial Intelligence Conference
PublisherAI Access Foundation
Pages1340-1345
Number of pages6
ISBN (Print)9781577354666
StatePublished - 2010
Event24th AAAI Conference on Artificial Intelligence and the 22nd Innovative Applications of Artificial Intelligence Conference, AAAI-10 / IAAI-10 - Atlanta, GA, United States
Duration: Jul 11 2010Jul 15 2010

Publication series

NameProceedings of the National Conference on Artificial Intelligence
Volume3

Other

Other24th AAAI Conference on Artificial Intelligence and the 22nd Innovative Applications of Artificial Intelligence Conference, AAAI-10 / IAAI-10
Country/TerritoryUnited States
CityAtlanta, GA
Period7/11/107/15/10

All Science Journal Classification (ASJC) codes

  • Software
  • Artificial Intelligence

Fingerprint

Dive into the research topics of 'Utilizing context in generative bayesian models for linked corpus'. Together they form a unique fingerprint.

Cite this