Visible to the public A general framework for dictionary based audio fingerprinting

TitleA general framework for dictionary based audio fingerprinting
Publication TypeConference Paper
Year of Publication2014
AuthorsMoussallam, M., Daudet, L.
Conference NameAcoustics, Speech and Signal Processing (ICASSP), 2014 IEEE International Conference on
Date PublishedMay
KeywordsAtomic clocks, Audio fingerprinting, audio signal processing, concurrent objectives, Dictionaries, dictionary based audio fingerprinting, Entropy, fingerprint identification, Fingerprint recognition, fingerprint-based audio recognition system, general framework, greedy algorithms, Pareto distribution, Pareto-like continuum, penalized sparse representation problem, probabilistic distribution, Robustness, Sparse Representation, sparse support, Speech, structured sparsity, suboptimal greedy algorithm, Time-frequency Analysis
Abstract

Fingerprint-based Audio recognition system must address concurrent objectives. Indeed, fingerprints must be both robust to distortions and discriminative while their dimension must remain to allow fast comparison. This paper proposes to restate these objectives as a penalized sparse representation problem. On top of this dictionary-based approach, we propose a structured sparsity model in the form of a probabilistic distribution for the sparse support. A practical suboptimal greedy algorithm is then presented and evaluated on robustness and recognition tasks. We show that some existing methods can be seen as particular cases of this algorithm and that the general framework allows to reach other points of a Pareto-like continuum.

DOI10.1109/ICASSP.2014.6854166
Citation Key6854166