Extending a lexicon of Portuguese nominalizations with data from corpora

Cláudia Freitas, Valeria de Paiva, Alexandre Rademaker, Gerard de Melo, Livy Real, Anne Silva

Research output: Chapter in Book/Report/Conference proceedingConference contribution

5 Scopus citations

Abstract

We describe the extension of a lexicon of nominalizations in Portuguese, NomLex-PT, with nominals from a collection of corpora, the AC/DC corpora. The resulting lexicon of nominalizations is RDFencoded and integrated with OpenWordNet-PT, a Portuguese WordNet freely available to download and consult. We discuss the reasons for this extension with corpus data, the methodology we followed, as well as our reasons for suggesting that the extended lexicon of nominalizations is a useful resource for researchers interested in Knowledge Representation of information extracted from Portuguese texts.

Original languageEnglish (US)
Title of host publicationComputational Processing of the Portuguese Language - 11th International Conference, PROPOR 2014, Proceedings
EditorsNuno Mamede, Jorge Baptista, Sara Candeias, Ivandré Paraboni, Sara Candeias, Thiago Alexandre Salgueiro Pardo, Maria das Graças Volpe Nunes
PublisherSpringer Verlag
Pages114-124
Number of pages11
ISBN (Electronic)9783319097602
DOIs
StatePublished - 2014
Event11th International Conference on Computational Processing of Portuguese, PROPOR 2014 - Sao Carlos/SP, Brazil
Duration: Oct 6 2014Oct 8 2014

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume8775
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Other

Other11th International Conference on Computational Processing of Portuguese, PROPOR 2014
CountryBrazil
CitySao Carlos/SP
Period10/6/1410/8/14

All Science Journal Classification (ASJC) codes

  • Theoretical Computer Science
  • Computer Science(all)

Keywords

  • Corpora
  • Lexical resources
  • Nominalizations
  • Portuguese

Fingerprint Dive into the research topics of 'Extending a lexicon of Portuguese nominalizations with data from corpora'. Together they form a unique fingerprint.

  • Cite this

    Freitas, C., de Paiva, V., Rademaker, A., de Melo, G., Real, L., & Silva, A. (2014). Extending a lexicon of Portuguese nominalizations with data from corpora. In N. Mamede, J. Baptista, S. Candeias, I. Paraboni, S. Candeias, T. A. S. Pardo, & M. D. G. V. Nunes (Eds.), Computational Processing of the Portuguese Language - 11th International Conference, PROPOR 2014, Proceedings (pp. 114-124). [LNAI 8775] (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 8775). Springer Verlag. https://doi.org/10.1007/978-3-319-09761-9