Visible to the public AudioGest: Enabling Fine-grained Hand Gesture Detection by Decoding Echo Signal

TitleAudioGest: Enabling Fine-grained Hand Gesture Detection by Decoding Echo Signal
Publication TypeConference Paper
Year of Publication2016
AuthorsRuan, Wenjie, Sheng, Quan Z., Yang, Lei, Gu, Tao, Xu, Peipei, Shangguan, Longfei
Conference NameProceedings of the 2016 ACM International Joint Conference on Pervasive and Ubiquitous Computing
PublisherACM
Conference LocationNew York, NY, USA
ISBN Number978-1-4503-4461-6
Keywordsaudio, Doppler effect, FFT, hand gestures, microphone, pubcrawl170201
AbstractHand gesture is becoming an increasingly popular means of interacting with consumer electronic devices, such as mobile phones, tablets and laptops. In this paper, we present AudioGest, a device-free gesture recognition system that can accurately sense the hand in-air movement around user's devices. Compared to the state-of-the-art, AudioGest is superior in using only one pair of built-in speaker and microphone, without any extra hardware or infrastructure support and with no training, to achieve fine-grained hand detection. Our system is able to accurately recognize various hand gestures, estimate the hand in-air time, as well as average moving speed and waving range. We achieve this by transforming the device into an active sonar system that transmits inaudible audio signal and decodes the echoes of hand at its microphone. We address various challenges including cleaning the noisy reflected sound signal, interpreting the echo spectrogram into hand gestures, decoding the Doppler frequency shifts into the hand waving speed and range, as well as being robust to the environmental motion and signal drifting. We implement the proof-of-concept prototype in three different electronic devices and extensively evaluate the system in four real-world scenarios using 3,900 hand gestures that collected by five users for more than two weeks. Our results show that AudioGest can detect six hand gestures with an accuracy up to 96%, and by distinguishing the gesture attributions, it can provide up to 162 control commands for various applications.
URLhttp://doi.acm.org/10.1145/2971648.2971736
DOI10.1145/2971648.2971736
Citation Keyruan_audiogest:_2016