Extending a lexicon of Portuguese nominalizations with data from corpora

Cláudia Freitas, Valeria de Paiva, Alexandre Rademaker, Gerard De Melo, Livy Real, Anne Silva

Research output: Chapter in Book/Report/Conference proceedingConference contribution

4 Citations (Scopus)

Abstract

We describe the extension of a lexicon of nominalizations in Portuguese, NomLex-PT, with nominals from a collection of corpora, the AC/DC corpora. The resulting lexicon of nominalizations is RDFencoded and integrated with OpenWordNet-PT, a Portuguese WordNet freely available to download and consult. We discuss the reasons for this extension with corpus data, the methodology we followed, as well as our reasons for suggesting that the extended lexicon of nominalizations is a useful resource for researchers interested in Knowledge Representation of information extracted from Portuguese texts.

Original languageEnglish (US)
Title of host publicationComputational Processing of the Portuguese Language - 11th International Conference, PROPOR 2014, Proceedings
EditorsNuno Mamede, Jorge Baptista, Sara Candeias, Ivandré Paraboni, Sara Candeias, Thiago Alexandre Salgueiro Pardo, Maria das Graças Volpe Nunes
PublisherSpringer Verlag
Pages114-124
Number of pages11
ISBN (Electronic)9783319097602
DOIs
StatePublished - Jan 1 2014
Event11th International Conference on Computational Processing of Portuguese, PROPOR 2014 - Sao Carlos/SP, Brazil
Duration: Oct 6 2014Oct 8 2014

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume8775
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Other

Other11th International Conference on Computational Processing of Portuguese, PROPOR 2014
CountryBrazil
CitySao Carlos/SP
Period10/6/1410/8/14

Fingerprint

Knowledge representation
WordNet
Knowledge Representation
Categorical or nominal
Resources
Methodology
Corpus

All Science Journal Classification (ASJC) codes

  • Theoretical Computer Science
  • Computer Science(all)

Keywords

  • Corpora
  • Lexical resources
  • Nominalizations
  • Portuguese

Cite this

Freitas, C., de Paiva, V., Rademaker, A., De Melo, G., Real, L., & Silva, A. (2014). Extending a lexicon of Portuguese nominalizations with data from corpora. In N. Mamede, J. Baptista, S. Candeias, I. Paraboni, S. Candeias, T. A. S. Pardo, & M. D. G. V. Nunes (Eds.), Computational Processing of the Portuguese Language - 11th International Conference, PROPOR 2014, Proceedings (pp. 114-124). [LNAI 8775] (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 8775). Springer Verlag. https://doi.org/10.1007/978-3-319-09761-9
Freitas, Cláudia ; de Paiva, Valeria ; Rademaker, Alexandre ; De Melo, Gerard ; Real, Livy ; Silva, Anne. / Extending a lexicon of Portuguese nominalizations with data from corpora. Computational Processing of the Portuguese Language - 11th International Conference, PROPOR 2014, Proceedings. editor / Nuno Mamede ; Jorge Baptista ; Sara Candeias ; Ivandré Paraboni ; Sara Candeias ; Thiago Alexandre Salgueiro Pardo ; Maria das Graças Volpe Nunes. Springer Verlag, 2014. pp. 114-124 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)).
@inproceedings{d680ff96bcc54b12bdb1b20690f169f0,
title = "Extending a lexicon of Portuguese nominalizations with data from corpora",
abstract = "We describe the extension of a lexicon of nominalizations in Portuguese, NomLex-PT, with nominals from a collection of corpora, the AC/DC corpora. The resulting lexicon of nominalizations is RDFencoded and integrated with OpenWordNet-PT, a Portuguese WordNet freely available to download and consult. We discuss the reasons for this extension with corpus data, the methodology we followed, as well as our reasons for suggesting that the extended lexicon of nominalizations is a useful resource for researchers interested in Knowledge Representation of information extracted from Portuguese texts.",
keywords = "Corpora, Lexical resources, Nominalizations, Portuguese",
author = "Cl{\'a}udia Freitas and {de Paiva}, Valeria and Alexandre Rademaker and {De Melo}, Gerard and Livy Real and Anne Silva",
year = "2014",
month = "1",
day = "1",
doi = "10.1007/978-3-319-09761-9",
language = "English (US)",
series = "Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)",
publisher = "Springer Verlag",
pages = "114--124",
editor = "Nuno Mamede and Jorge Baptista and Sara Candeias and Ivandr{\'e} Paraboni and Sara Candeias and Pardo, {Thiago Alexandre Salgueiro} and Nunes, {Maria das Gra{\cc}as Volpe}",
booktitle = "Computational Processing of the Portuguese Language - 11th International Conference, PROPOR 2014, Proceedings",
address = "Germany",

}

Freitas, C, de Paiva, V, Rademaker, A, De Melo, G, Real, L & Silva, A 2014, Extending a lexicon of Portuguese nominalizations with data from corpora. in N Mamede, J Baptista, S Candeias, I Paraboni, S Candeias, TAS Pardo & MDGV Nunes (eds), Computational Processing of the Portuguese Language - 11th International Conference, PROPOR 2014, Proceedings., LNAI 8775, Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), vol. 8775, Springer Verlag, pp. 114-124, 11th International Conference on Computational Processing of Portuguese, PROPOR 2014, Sao Carlos/SP, Brazil, 10/6/14. https://doi.org/10.1007/978-3-319-09761-9

Extending a lexicon of Portuguese nominalizations with data from corpora. / Freitas, Cláudia; de Paiva, Valeria; Rademaker, Alexandre; De Melo, Gerard; Real, Livy; Silva, Anne.

Computational Processing of the Portuguese Language - 11th International Conference, PROPOR 2014, Proceedings. ed. / Nuno Mamede; Jorge Baptista; Sara Candeias; Ivandré Paraboni; Sara Candeias; Thiago Alexandre Salgueiro Pardo; Maria das Graças Volpe Nunes. Springer Verlag, 2014. p. 114-124 LNAI 8775 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 8775).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

TY - GEN

T1 - Extending a lexicon of Portuguese nominalizations with data from corpora

AU - Freitas, Cláudia

AU - de Paiva, Valeria

AU - Rademaker, Alexandre

AU - De Melo, Gerard

AU - Real, Livy

AU - Silva, Anne

PY - 2014/1/1

Y1 - 2014/1/1

N2 - We describe the extension of a lexicon of nominalizations in Portuguese, NomLex-PT, with nominals from a collection of corpora, the AC/DC corpora. The resulting lexicon of nominalizations is RDFencoded and integrated with OpenWordNet-PT, a Portuguese WordNet freely available to download and consult. We discuss the reasons for this extension with corpus data, the methodology we followed, as well as our reasons for suggesting that the extended lexicon of nominalizations is a useful resource for researchers interested in Knowledge Representation of information extracted from Portuguese texts.

AB - We describe the extension of a lexicon of nominalizations in Portuguese, NomLex-PT, with nominals from a collection of corpora, the AC/DC corpora. The resulting lexicon of nominalizations is RDFencoded and integrated with OpenWordNet-PT, a Portuguese WordNet freely available to download and consult. We discuss the reasons for this extension with corpus data, the methodology we followed, as well as our reasons for suggesting that the extended lexicon of nominalizations is a useful resource for researchers interested in Knowledge Representation of information extracted from Portuguese texts.

KW - Corpora

KW - Lexical resources

KW - Nominalizations

KW - Portuguese

UR - http://www.scopus.com/inward/record.url?scp=84908565522&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84908565522&partnerID=8YFLogxK

U2 - 10.1007/978-3-319-09761-9

DO - 10.1007/978-3-319-09761-9

M3 - Conference contribution

T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

SP - 114

EP - 124

BT - Computational Processing of the Portuguese Language - 11th International Conference, PROPOR 2014, Proceedings

A2 - Mamede, Nuno

A2 - Baptista, Jorge

A2 - Candeias, Sara

A2 - Paraboni, Ivandré

A2 - Candeias, Sara

A2 - Pardo, Thiago Alexandre Salgueiro

A2 - Nunes, Maria das Graças Volpe

PB - Springer Verlag

ER -

Freitas C, de Paiva V, Rademaker A, De Melo G, Real L, Silva A. Extending a lexicon of Portuguese nominalizations with data from corpora. In Mamede N, Baptista J, Candeias S, Paraboni I, Candeias S, Pardo TAS, Nunes MDGV, editors, Computational Processing of the Portuguese Language - 11th International Conference, PROPOR 2014, Proceedings. Springer Verlag. 2014. p. 114-124. LNAI 8775. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)). https://doi.org/10.1007/978-3-319-09761-9