EDBT/ICDT 2009 Joint Conference

Electronic Conference Proceedings

NNexus: An Automatic Linker for Collaborative Web-Based Corpora

Authors

Abstract

Collaborative online encyclopedias or knowledge bases such as Wikipedia and PlanetMath are becoming increasingly popular. We demonstrate NNexus, a generalization of the automatic linking engine of PlanetMath.org and the first system that automates the process of linking disparate “encyclopedia” entries into a fully-connected conceptual network. The main challenges of this problem space include: 1) linking quality (correctly identifying which terms to link and which entry to link to with minimal effort on the part of users), 2) efficiency and scalability, and 3) generalization to multiple knowledge bases and web-based information environment. We present NNexus that utilizes subject classification and other metadata to address these challenges and demonstrate its effectiveness and efficiency through multiple real world corpora.

Session

EDBT Demo Session 2: Demo Group 2 (Wednesday, March 25, 14:00—17:30)