NNexus: An Automatic Linker for Collaborative Web-Based Corpora
Authors
- James Gardner (Emory University, USA)
- Aaron Krowne (PlanetMath.org, USA)
- Li Xiong (Emory University, USA)
Abstract
Collaborative online encyclopedias or knowledge bases such as Wikipedia and PlanetMath are becoming increasingly popular. We demonstrate NNexus, a generalization of the automatic linking engine of PlanetMath.org and the first system that automates the process of linking disparate “encyclopedia” entries into a fully-connected conceptual network. The main challenges of this problem space include: 1) linking quality (correctly identifying which terms to link and which entry to link to with minimal effort on the part of users), 2) efficiency and scalability, and 3) generalization to multiple knowledge bases and web-based information environment. We present NNexus that utilizes subject classification and other metadata to address these challenges and demonstrate its effectiveness and efficiency through multiple real world corpora.
Session
EDBT Demo Session 2: Demo Group 2 (Wednesday, March 25, 14:00—17:30)