|
Research highlight : DISCOVERING CROSS-LANGUAGE LINKS IN WIKIPEDIA THROUGH SEMANTIC RELATEDNESS |
|
|
|
|
DISCOVERING CROSS-LANGUAGE LINKS IN WIKIPEDIA THROUGH SEMANTIC RELATEDNESS
12 June 2012
Antonio Penta, Gianluca Quercini, Chantal Reynaud and Nigel Shadbolt, in European Conference on Artificial Intelligence (ECAI 2012)
|
Wikipedia is a large multilingual collection of interlinked articles, used and contributed by millions of users over the Internet, that provides editions in up to 283 languages. Two articles in different language versions of Wikipedia may have information on exactly the same concept, in which case they are often connected through a cross-language link. However, many cross-language links are either missing or incorrect and this negatively affects both the readers of Wikipedia and multiplingual information retrieval applications. In this paper, we propose WIKICL, an algorithm for discovering cross-language links using the semantic relatedness of two articles derived from the Wikipedia graph structure. Our evaluation shows that we achieve comparable, and in some cases, better results than previous methods with much less computational time.
Keyword
° Artificial Intelligence ° Information integration
Group
° Artificial Intelligence and Inference Systems
Contact
[none]
|
| |
|
|
|
|