TY - GEN
T1 - MENTA
T2 - 19th International Conference on Information and Knowledge Management and Co-located Workshops, CIKM'10
AU - De Melo, Gerard
AU - Weikum, Gerhard
PY - 2010
Y1 - 2010
N2 - In recent years, a number of projects have turned to Wikipedia to establish large-scale taxonomies that describe orders of magnitude more entities than traditional manually built knowledge bases. So far, however, the multilingual nature of Wikipedia has largely been neglected. This paper investigates how entities from all editions of Wikipedia as well as WordNet can be integrated into a single coherent taxonomic class hierarchy. We rely on linking heuristics to discover potential taxonomic relationships, graph partitioning to form consistent equivalence classes of entities, and a Markov chain-based ranking approach to construct the final taxonomy. This results in MENTA (Multilingual Entity Taxonomy), a resource that describes 5.4 million entities and is presumably the largest multilingual lexical knowledge base currently available.
AB - In recent years, a number of projects have turned to Wikipedia to establish large-scale taxonomies that describe orders of magnitude more entities than traditional manually built knowledge bases. So far, however, the multilingual nature of Wikipedia has largely been neglected. This paper investigates how entities from all editions of Wikipedia as well as WordNet can be integrated into a single coherent taxonomic class hierarchy. We rely on linking heuristics to discover potential taxonomic relationships, graph partitioning to form consistent equivalence classes of entities, and a Markov chain-based ranking approach to construct the final taxonomy. This results in MENTA (Multilingual Entity Taxonomy), a resource that describes 5.4 million entities and is presumably the largest multilingual lexical knowledge base currently available.
KW - Algorithms
UR - http://www.scopus.com/inward/record.url?scp=78651269398&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=78651269398&partnerID=8YFLogxK
U2 - 10.1145/1871437.1871577
DO - 10.1145/1871437.1871577
M3 - Conference contribution
AN - SCOPUS:78651269398
SN - 9781450300995
T3 - International Conference on Information and Knowledge Management, Proceedings
SP - 1099
EP - 1108
BT - CIKM'10 - Proceedings of the 19th International Conference on Information and Knowledge Management and Co-located Workshops
Y2 - 26 October 2010 through 30 October 2010
ER -