Title | Adversarial Audio Detection Method Based on Transformer |
Publication Type | Conference Paper |
Year of Publication | 2022 |
Authors | Li, Yunchen, Luo, Da |
Conference Name | 2022 International Conference on Machine Learning and Intelligent Systems Engineering (MLISE) |
Keywords | adversarial detection, Black Box Attacks, composability, feature extraction, machine learning, Metrics, Modeling, Noise measurement, Position Encoding, pubcrawl, Resiliency, security, Self-Attention, Speech recognition, Transformers |
Abstract | Speech recognition technology has been applied to all aspects of our daily life, but it faces many security issues. One of the major threats is the adversarial audio examples, which may tamper the recognition results of the acoustic speech recognition system (ASR). In this paper, we propose an adversarial detection framework to detect adversarial audio examples. The method is based on the transformer self-attention mechanism. Spectrogram features are extracted from the audio and divided into patches. Position information are embedded and then fed into transformer encoder. Experimental results show that the method achieves good performance with the detection accuracy of above 96.5% under the white-box attacks and blackbox attacks, and noisy circumstances. Even when detecting adversarial examples generated by the unknown attacks, it also achieves satisfactory results. |
DOI | 10.1109/MLISE57402.2022.00023 |
Citation Key | li_adversarial_2022 |