TY - GEN
T1 - Empirical comparisons of MASC word sense annotations
AU - De Melo, Gerard
AU - Baker, Collin F.
AU - Ide, Nancy
AU - Passonneau, Rebecca J.
AU - Fellbaum, Christiane
PY - 2012
Y1 - 2012
N2 - We analyze how different conceptions of lexical semantics affect sense annotations and how multiple sense inventories can be compared empirically, based on annotated text. Our study focuses on the MASC project, where data has been annotated using WordNet sense identifiers on the one hand, and FrameNet lexical units on the other. This allows us to compare the sense inventories of these lexical resources empirically rather than just theoretically, based on their glosses, leading to new insights. In particular, we compute contingency matrices and develop a novel measure, the Expected Jaccard Index, that quantifies the agreement between annotations of the same data based on two different resources even when they have different sets of categories.
AB - We analyze how different conceptions of lexical semantics affect sense annotations and how multiple sense inventories can be compared empirically, based on annotated text. Our study focuses on the MASC project, where data has been annotated using WordNet sense identifiers on the one hand, and FrameNet lexical units on the other. This allows us to compare the sense inventories of these lexical resources empirically rather than just theoretically, based on their glosses, leading to new insights. In particular, we compute contingency matrices and develop a novel measure, the Expected Jaccard Index, that quantifies the agreement between annotations of the same data based on two different resources even when they have different sets of categories.
KW - Lexical resources
KW - Lexical semantics
KW - Statistical methods
UR - http://www.scopus.com/inward/record.url?scp=84929349762&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84929349762&partnerID=8YFLogxK
M3 - Conference contribution
AN - SCOPUS:84929349762
T3 - Proceedings of the 8th International Conference on Language Resources and Evaluation, LREC 2012
SP - 3036
EP - 3043
BT - Proceedings of the 8th International Conference on Language Resources and Evaluation, LREC 2012
A2 - Dogan, Mehmet Ugur
A2 - Mariani, Joseph
A2 - Moreno, Asuncion
A2 - Goggi, Sara
A2 - Choukri, Khalid
A2 - Calzolari, Nicoletta
A2 - Odijk, Jan
A2 - Declerck, Thierry
A2 - Maegaard, Bente
A2 - Piperidis, Stelios
A2 - Mazo, Helene
A2 - Hamon, Olivier
PB - European Language Resources Association (ELRA)
T2 - 8th International Conference on Language Resources and Evaluation, LREC 2012
Y2 - 21 May 2012 through 27 May 2012
ER -