Edit Detection in Speech Recordings via Instantaneous Electric Network Frequency Variations
Title | Edit Detection in Speech Recordings via Instantaneous Electric Network Frequency Variations |
Publication Type | Journal Article |
Year of Publication | 2014 |
Authors | Andrade Esquef, P.A., Apolinario, J.A., Biscainho, L.W.P. |
Journal | Information Forensics and Security, IEEE Transactions on |
Volume | 9 |
Pagination | 2314-2326 |
Date Published | Dec |
ISSN | 1556-6013 |
Keywords | Acoustic signal processing, Acoustical signal processing, audio edit events, audio recording, broadband background noise, detection criterion, digital forensics, distinct annotated database, edit detection, EER, electrical network frequency analysis, ENF analysis, equal error rate, forensic audio analysis, frequency estimation, instantaneous electric network frequency variations, instantaneous frequency, Noise measurement, nominal frequency, originally noisy database signals, Signal processing, signal processing chain, Spectral analysis, speech processing, speech recordings, voice activity detection |
Abstract | In this paper, an edit detection method for forensic audio analysis is proposed. It develops and improves a previous method through changes in the signal processing chain and a novel detection criterion. As with the original method, electrical network frequency (ENF) analysis is central to the novel edit detector, for it allows monitoring anomalous variations of the ENF related to audio edit events. Working in unsupervised manner, the edit detector compares the extent of ENF variations, centered at its nominal frequency, with a variable threshold that defines the upper limit for normal variations observed in unedited signals. The ENF variations caused by edits in the signal are likely to exceed the threshold providing a mechanism for their detection. The proposed method is evaluated in both qualitative and quantitative terms via two distinct annotated databases. Results are reported for originally noisy database signals as well as versions of them further degraded under controlled conditions. A comparative performance evaluation, in terms of equal error rate (EER) detection, reveals that, for one of the tested databases, an improvement from 7% to 4% EER is achieved, respectively, from the original to the new edit detection method. When the signals are amplitude clipped or corrupted by broadband background noise, the performance figures of the novel method follow the same profile of those of the original method. |
DOI | 10.1109/TIFS.2014.2363524 |
Citation Key | 6926817 |
- forensic audio analysis
- voice activity detection
- speech recordings
- speech processing
- Spectral analysis
- signal processing chain
- signal processing
- originally noisy database signals
- nominal frequency
- Noise measurement
- instantaneous frequency
- instantaneous electric network frequency variations
- frequency estimation
- Acoustic signal processing
- equal error rate
- ENF analysis
- electrical network frequency analysis
- EER
- edit detection
- distinct annotated database
- Digital Forensics
- detection criterion
- broadband background noise
- audio recording
- audio edit events
- Acoustical signal processing