A Dynamic Matching Algorithm for Audio Timestamp Identification Using the ENF Criterion
Title | A Dynamic Matching Algorithm for Audio Timestamp Identification Using the ENF Criterion |
Publication Type | Journal Article |
Year of Publication | 2014 |
Authors | Guang Hua, Goh, J., Thing, V.L.L. |
Journal | Information Forensics and Security, IEEE Transactions on |
Volume | 9 |
Pagination | 1045-1055 |
Date Published | July |
ISSN | 1556-6013 |
Keywords | audio authentication, audio forensics, audio recording, audio timestamp identification, autocorrection process, Correlation, correlation methods, dynamic matching, Electric network frequency (ENF), electric network frequency criterion, ENF criterion, Estimation, extracted ENF signal, Fourier transforms, frequency estimates, frequency estimation, frequency resolution problems, maximum correlation coefficient, mean square error methods, minimum mean squared error, MMSE, reference data, short recording durations, short-time Fourier transform, Signal resolution, Signal to noise ratio, signal-to-noise ratio, STFT, Time-frequency Analysis, timestamp identification, window size |
Abstract | The electric network frequency (ENF) criterion is a recently developed technique for audio timestamp identification, which involves the matching between extracted ENF signal and reference data. For nearly a decade, conventional matching criterion has been based on the minimum mean squared error (MMSE) or maximum correlation coefficient. However, the corresponding performance is highly limited by low signal-to-noise ratio, short recording durations, frequency resolution problems, and so on. This paper presents a threshold-based dynamic matching algorithm (DMA), which is capable of autocorrecting the noise affected frequency estimates. The threshold is chosen according to the frequency resolution determined by the short-time Fourier transform (STFT) window size. A penalty coefficient is introduced to monitor the autocorrection process and finally determine the estimated timestamp. It is then shown that the DMA generalizes the conventional MMSE method. By considering the mainlobe width in the STFT caused by limited frequency resolution, the DMA achieves improved identification accuracy and robustness against higher levels of noise and the offset problem. Synthetic performance analysis and practical experimental results are provided to illustrate the advantages of the DMA. |
DOI | 10.1109/TIFS.2014.2321228 |
Citation Key | 6808537 |
- frequency estimation
- window size
- timestamp identification
- Time-frequency Analysis
- STFT
- signal-to-noise ratio
- Signal to noise ratio
- Signal resolution
- short-time Fourier transform
- short recording durations
- reference data
- MMSE
- minimum mean squared error
- mean square error methods
- maximum correlation coefficient
- frequency resolution problems
- audio authentication
- frequency estimates
- Fourier transforms
- extracted ENF signal
- estimation
- ENF criterion
- electric network frequency criterion
- Electric network frequency (ENF)
- dynamic matching
- correlation methods
- Correlation
- autocorrection process
- audio timestamp identification
- audio recording
- audio forensics