Visible to the public Evaluation of Open Information Extraction Methods Using Reuters-21578 Database

TitleEvaluation of Open Information Extraction Methods Using Reuters-21578 Database
Publication TypeConference Paper
Year of Publication2018
AuthorsRodríguez, Juan M., Merlino, Hernán D., Pesado, Patricia, García-Martínez, Ramón
Conference NameProceedings of the 2Nd International Conference on Machine Learning and Soft Computing
Date PublishedFebruary 2018
PublisherACM
Conference LocationNew York, NY, USA
ISBN Number978-1-4503-6336-5
KeywordsAir gaps, clausIE, composability, Human Behavior, Knowledge extraction, Metrics, natural language processing, OIE, OLLIE, open information extraction, pubcrawl, resilience, Resiliency, reverb, self-supervised extraction, semantic relation extraction
Abstract

The following article shows the precision, the recall and the F1-measure for three knowledge extraction methods under Open Information Extraction paradigm. These methods are: ReVerb, OLLIE and ClausIE. For the calculation of these three measures, a representative sample of Reuters-21578 was used; 103 newswire texts were taken randomly from that database. A big discrepancy was observed, after analyzing the obtained results, between the expected and the observed precision for ClausIE. In order to save the observed gap in ClausIE precision, a simple improvement is proposed for the method. Although the correction improved the precision of Clausie, ReVerb turned out to be the most precise method; however ClausIE is the one with the better F1-measure.

URLhttps://dl.acm.org/doi/10.1145/3184066.3184099
DOI10.1145/3184066.3184099
Citation KeyrodriguezEvaluationOpenInformation2018