Lightweight Multilingual Entity Extraction and Linking
Title | Lightweight Multilingual Entity Extraction and Linking |
Publication Type | Conference Paper |
Year of Publication | 2017 |
Authors | Pappu, Aasish, Blanco, Roi, Mehdad, Yashar, Stent, Amanda, Thadani, Kapil |
Conference Name | Proceedings of the Tenth ACM International Conference on Web Search and Data Mining |
Publisher | ACM |
Conference Location | New York, NY, USA |
ISBN Number | 978-1-4503-4675-7 |
Keywords | clustering entities, composability, document processing, entity extraction, entity linking, Human Behavior, human factors, Metrics, natural language processing, pubcrawl, Scalability, text analytics, unsupervised learning |
Abstract | Text analytics systems often rely heavily on detecting and linking entity mentions in documents to knowledge bases for downstream applications such as sentiment analysis, question answering and recommender systems. A major challenge for this task is to be able to accurately detect entities in new languages with limited labeled resources. In this paper we present an accurate and lightweight, multilingual named entity recognition (NER) and linking (NEL) system. The contributions of this paper are three-fold: 1) Lightweight named entity recognition with competitive accuracy; 2) Candidate entity retrieval that uses search click-log data and entity embeddings to achieve high precision with a low memory footprint; and 3) efficient entity disambiguation. Our system achieves state-of-the-art performance on TAC KBP 2013 multilingual data and on English AIDA CONLL data. |
URL | https://dl.acm.org/citation.cfm?doid=3018661.3018724 |
DOI | 10.1145/3018661.3018724 |
Citation Key | pappu_lightweight_2017 |