Empirical comparisons of MASC word sense annotations

Gerard De Melo, Collin F. Baker, Nancy Ide, Rebecca J. Passonneau, Christiane Fellbaum

Research output: Chapter in Book/Report/Conference proceedingConference contribution

7 Scopus citations

Abstract

We analyze how different conceptions of lexical semantics affect sense annotations and how multiple sense inventories can be compared empirically, based on annotated text. Our study focuses on the MASC project, where data has been annotated using WordNet sense identifiers on the one hand, and FrameNet lexical units on the other. This allows us to compare the sense inventories of these lexical resources empirically rather than just theoretically, based on their glosses, leading to new insights. In particular, we compute contingency matrices and develop a novel measure, the Expected Jaccard Index, that quantifies the agreement between annotations of the same data based on two different resources even when they have different sets of categories.

Original languageEnglish (US)
Title of host publicationProceedings of the 8th International Conference on Language Resources and Evaluation, LREC 2012
EditorsMehmet Ugur Dogan, Joseph Mariani, Asuncion Moreno, Sara Goggi, Khalid Choukri, Nicoletta Calzolari, Jan Odijk, Thierry Declerck, Bente Maegaard, Stelios Piperidis, Helene Mazo, Olivier Hamon
PublisherEuropean Language Resources Association (ELRA)
Pages3036-3043
Number of pages8
ISBN (Electronic)9782951740877
StatePublished - 2012
Externally publishedYes
Event8th International Conference on Language Resources and Evaluation, LREC 2012 - Istanbul, Turkey
Duration: May 21 2012May 27 2012

Publication series

NameProceedings of the 8th International Conference on Language Resources and Evaluation, LREC 2012

Other

Other8th International Conference on Language Resources and Evaluation, LREC 2012
CountryTurkey
CityIstanbul
Period5/21/125/27/12

All Science Journal Classification (ASJC) codes

  • Linguistics and Language
  • Language and Linguistics
  • Education
  • Library and Information Sciences

Keywords

  • Lexical resources
  • Lexical semantics
  • Statistical methods

Fingerprint Dive into the research topics of 'Empirical comparisons of MASC word sense annotations'. Together they form a unique fingerprint.

Cite this