Publications
Local, domain-independent heuristics for the FEIII challenge: Lessons and observations
Abstract
The recently concluded Financial Entity Identification and Information Integration (FEIII) competition is an example of a domain-specific entity linking challenge. Given a variety of datasets describing financial institutions, the goal of the competition was to interlink entities referring to the same underlying entity in a minimally supervised manner. In this paper, we present our solution to the challenge. Using local, domain-independent heuristics, namely thresholded block purging and a simple Jaccard matcher, we devised a solution that has execution times of less than a minute on the alloted tasks. Although the method did not achieve competitive precision, it was recall-friendly, suggesting that it is useful both as an easily implemented baseline, as well as a generic preprocessing step for more expensive, precision-friendly algorithms that are fine-tuned for specific domains.
- Date
- June 26, 2016
- Authors
- Mayank Kejriwal, Daniel P Miranker
- Book
- Proceedings of the Second International Workshop on Data Science for Macro-Modeling
- Pages
- 1-2