Multi-Scale Deformable CNN for Answer Selection
Title | Multi-Scale Deformable CNN for Answer Selection |
Publication Type | Journal Article |
Year of Publication | 2019 |
Authors | Liu, Donglei, Niu, Zhendong, Zhang, Chunxia, Zhang, Jiadi |
Journal | IEEE Access |
Volume | 7 |
Pagination | 164986—164995 |
ISSN | 2169-3536 |
Keywords | Answer selection, answer selection task, automatic question answering system, compositionality, Computing Theory and Compositionality, convolution, convolutional neural nets, convolutional neural networks, deep neural networks, deformable convolution neural network, feature extraction, fixed-length convolutional kernels, Human Behavior, human factors, Kernel, Knowledge discovery, mine multiscale n-gram features, multiple deformable convolutional layers, multiscale deformable CNN, multiscale deformable convolutional neural network, n-gram models, natural language, natural language processing, nonconsecutive n-gram features, nonconsecutive words, pubcrawl, question answering, question answering (information retrieval), question answering systems, Semantics, sentence modeling, Task Analysis, text analysis, text features, traditional CNN, variable length n-gram features |
Abstract | The answer selection task is one of the most important issues within the automatic question answering system, and it aims to automatically find accurate answers to questions. Traditional methods for this task use manually generated features based on tf-idf and n-gram models to represent texts, and then select the right answers according to the similarity between the representations of questions and the candidate answers. Nowadays, many question answering systems adopt deep neural networks such as convolutional neural network (CNN) to generate the text features automatically, and obtained better performance than traditional methods. CNN can extract consecutive n-gram features with fixed length by sliding fixed-length convolutional kernels over the whole word sequence. However, due to the complex semantic compositionality of the natural language, there are many phrases with variable lengths and be composed of non-consecutive words in natural language, such as these phrases whose constituents are separated by other words within the same sentences. But the traditional CNN is unable to extract the variable length n-gram features and non-consecutive n-gram features. In this paper, we propose a multi-scale deformable convolutional neural network to capture the non-consecutive n-gram features by adding offset to the convolutional kernel, and also propose to stack multiple deformable convolutional layers to mine multi-scale n-gram features by the means of generating longer n-gram in higher layer. Furthermore, we apply the proposed model into the task of answer selection. Experimental results on public dataset demonstrate the effectiveness of our proposed model in answer selection. |
URL | https://ieeexplore.ieee.org/document/8896933 |
DOI | 10.1109/ACCESS.2019.2953219 |
Citation Key | liu_multi-scale_2019 |
- question answering (information retrieval)
- multiscale deformable convolutional neural network
- n-gram models
- natural language
- natural language processing
- nonconsecutive n-gram features
- nonconsecutive words
- pubcrawl
- question answering
- multiscale deformable CNN
- question answering systems
- Semantics
- sentence modeling
- Task Analysis
- text analysis
- text features
- traditional CNN
- variable length n-gram features
- convolutional neural networks
- Human behavior
- Human Factors
- Computing Theory and Compositionality
- Answer selection
- answer selection task
- automatic question answering system
- convolution
- convolutional neural nets
- Compositionality
- deep neural networks
- deformable convolution neural network
- feature extraction
- fixed-length convolutional kernels
- Kernel
- Knowledge Discovery
- mine multiscale n-gram features
- multiple deformable convolutional layers