Visible to the public Learning Transferable Features For Open-Domain Question Answering

TitleLearning Transferable Features For Open-Domain Question Answering
Publication TypeConference Paper
Year of Publication2018
AuthorsZuin, Gianlucca, Chaimowicz, Luiz, Veloso, Adriano
Conference Name2018 International Joint Conference on Neural Networks (IJCNN)
ISBN Number978-1-5090-6014-6
KeywordsAdaptation models, Analogies and Transference, clustering algorithm, complementary data, Data models, Deep Networks, domain adaptation approaches, domain-adaptation, domain-specific features, domain-specific QA models, Human Behavior, human factors, information retrieval, Knowledge discovery, learning (artificial intelligence), Linear programming, natural language, natural language processing, open-domain QA models, open-domain question answering, open-domain Question-Answering models, pattern clustering, pubcrawl, question answering (information retrieval), Question-Answering, sentence-level QA models, single open-domain QA model, span-level QA benefits, Task Analysis, Training, training corpora, Training data, transfer learning, transferable feature learning, transferable features
Abstract

Corpora used to learn open-domain Question-Answering (QA) models are typically collected from a wide variety of topics or domains. Since QA requires understanding natural language, open-domain QA models generally need very large training corpora. A simple way to alleviate data demand is to restrict the domain covered by the QA model, leading thus to domain-specific QA models. While learning improved QA models for a specific domain is still challenging due to the lack of sufficient training data in the topic of interest, additional training data can be obtained from related topic domains. Thus, instead of learning a single open-domain QA model, we investigate domain adaptation approaches in order to create multiple improved domain-specific QA models. We demonstrate that this can be achieved by stratifying the source dataset, without the need of searching for complementary data unlike many other domain adaptation approaches. We propose a deep architecture that jointly exploits convolutional and recurrent networks for learning domain-specific features while transferring domain-shared features. That is, we use transferable features to enable model adaptation from multiple source domains. We consider different transference approaches designed to learn span-level and sentence-level QA models. We found that domain-adaptation greatly improves sentence-level QA performance, and span-level QA benefits from sentence information. Finally, we also show that a simple clustering algorithm may be employed when the topic domains are unknown and the resulting loss in accuracy is negligible.

URLhttps://ieeexplore.ieee.org/document/8489057
DOI10.1109/IJCNN.2018.8489057
Citation Keyzuin_learning_2018