Visible to the public Adversarial Audio Detection Method Based on Transformer

TitleAdversarial Audio Detection Method Based on Transformer
Publication TypeConference Paper
Year of Publication2022
AuthorsLi, Yunchen, Luo, Da
Conference Name2022 International Conference on Machine Learning and Intelligent Systems Engineering (MLISE)
Keywordsadversarial detection, Black Box Attacks, composability, feature extraction, machine learning, Metrics, Modeling, Noise measurement, Position Encoding, pubcrawl, Resiliency, security, Self-Attention, Speech recognition, Transformers
AbstractSpeech recognition technology has been applied to all aspects of our daily life, but it faces many security issues. One of the major threats is the adversarial audio examples, which may tamper the recognition results of the acoustic speech recognition system (ASR). In this paper, we propose an adversarial detection framework to detect adversarial audio examples. The method is based on the transformer self-attention mechanism. Spectrogram features are extracted from the audio and divided into patches. Position information are embedded and then fed into transformer encoder. Experimental results show that the method achieves good performance with the detection accuracy of above 96.5% under the white-box attacks and blackbox attacks, and noisy circumstances. Even when detecting adversarial examples generated by the unknown attacks, it also achieves satisfactory results.
DOI10.1109/MLISE57402.2022.00023
Citation Keyli_adversarial_2022