SciSumm: A multi-document summarization system for scientific articles

Nitin Agarwal, Ravi Shankar Reddy, Kiran Gvr, Carolyn Penstein Rosé

Research output: Chapter in Book/Report/Conference proceedingConference contribution

18 Scopus citations

Abstract

In this demo, we present SciSumm, an interactive multi-document summarization system for scientific articles. The document collection to be summarized is a list of papers cited together within the same source article, otherwise known as a co-citation. At the heart of the approach is a topic based clustering of fragments extracted from each article based on queries generated from the context surrounding the co-cited list of papers. This analysis enables the generation of an overview of common themes from the co-cited papers that relate to the context in which the co-citation was found. SciSumm is currently built over the 2008 ACL Anthology, however the generalizable nature of the summarization techniques and the extensible architecture makes it possible to use the system with other corpora where a citation network is available. Evaluation results on the same corpus demonstrate that our system performs better than an existing widely used multi-document summarization system (MEAD).

Original languageEnglish (US)
Title of host publicationACL HLT 2011 - 49th Annual Meeting of the Association for Computational Linguistics
Subtitle of host publicationHuman Language Technologies, Proceedings of Student Session
Pages115-120
Number of pages6
StatePublished - 2011
Externally publishedYes
Event49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, ACL HLT 2011 - Portland, OR, United States
Duration: Jun 19 2011Jun 24 2011

Publication series

NameACL HLT 2011 - 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, Proceedings of Student Session

Conference

Conference49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, ACL HLT 2011
Country/TerritoryUnited States
CityPortland, OR
Period6/19/116/24/11

All Science Journal Classification (ASJC) codes

  • Language and Linguistics
  • Linguistics and Language

Fingerprint

Dive into the research topics of 'SciSumm: A multi-document summarization system for scientific articles'. Together they form a unique fingerprint.

Cite this