Visible to the public Spoofed Voice Detection using Dense Features of STFT and MDCT Spectrograms

TitleSpoofed Voice Detection using Dense Features of STFT and MDCT Spectrograms
Publication TypeConference Paper
Year of Publication2021
AuthorsSaleem, Summra, Dilawari, Aniqa, Khan, Usman Ghani
Conference Name2021 International Conference on Artificial Intelligence (ICAI)
Keywordsattestation, authentication, CNN, composability, discrete cosine transforms, feature extraction, Forensics, Forgery, Fourier transforms, Hann window, Human Behavior, MDCT, pubcrawl, recognition, Resiliency, spectrum, Speech recognition, spoofed voices, STFT
AbstractAttestation of audio signals for recognition of forgery in voice is challenging task. In this research work, a deep convolutional neural network (CNN) is utilized to detect audio operations i.e. pitch shifted and amplitude varied signals. Short-time Fourier transform (STFT) and Modified Discrete Cosine Transform (MDCT) features are chosen for audio processing and their plotted patterns are fed to CNN. Experimental results show that our model can successfully distinguish tampered signals to facilitate the audio authentication on TIMIT dataset. Proposed CNN architecture can distinguish spoofed voices of shifting pitch with accuracy of 97.55% and of varying amplitude with accuracy of 98.85%.
DOI10.1109/ICAI52203.2021.9445259
Citation Keysaleem_spoofed_2021