EchoPrint: Two-Factor Authentication Using Acoustics and Vision on Smartphones
Title | EchoPrint: Two-Factor Authentication Using Acoustics and Vision on Smartphones |
Publication Type | Conference Paper |
Year of Publication | 2018 |
Authors | Zhou, Bing, Lohokare, Jay, Gao, Ruipeng, Ye, Fan |
Conference Name | Proceedings of the 24th Annual International Conference on Mobile Computing and Networking |
Publisher | ACM |
Conference Location | New York, NY, USA |
ISBN Number | 978-1-4503-5903-0 |
Keywords | Acoustics, authentication, Human Behavior, human factors, mobile sensing, pubcrawl, Two factor Authentication |
Abstract | User authentication on smartphones must satisfy both security and convenience, an inherently difficult balancing art. Apple's FaceID is arguably the latest of such efforts, at the cost of additional hardware (e.g., dot projector, flood illuminator and infrared camera). We propose a novel user authentication system EchoPrint, which leverages acoustics and vision for secure and convenient user authentication, without requiring any special hardware. EchoPrint actively emits almost inaudible acoustic signals from the earpiece speaker to "illuminate" the user's face and authenticates the user by the unique features extracted from the echoes bouncing off the 3D facial contour. To combat changes in phone-holding poses thus echoes, a Convolutional Neural Network (CNN) is trained to extract reliable acoustic features, which are further combined with visual facial landmark locations to feed a binary Support Vector Machine (SVM) classifier for final authentication. Because the echo features depend on 3D facial geometries, EchoPrint is not easily spoofed by images or videos like 2D visual face recognition systems. It needs only commodity hardware, thus avoiding the extra costs of special sensors in solutions like FaceID. Experiments with 62 volunteers and non-human objects such as images, photos, and sculptures show that EchoPrint achieves 93.75% balanced accuracy and 93.50% F-score, while the average precision is 98.05%, and no image/video based attack is observed to succeed in spoofing. |
URL | https://dl.acm.org/citation.cfm?doid=3241539.3241575 |
DOI | 10.1145/3241539.3241575 |
Citation Key | zhou_echoprint:_2018 |