Audio-CAPTCHA with distinction between random phoneme sequences and words spoken by multi-speaker

Submitted by grigby1 on Wed, 12/20/2017 - 1:03pm

Title	Audio-CAPTCHA with distinction between random phoneme sequences and words spoken by multi-speaker
Publication Type	Conference Paper
Year of Publication	2017
Authors	Yamaguchi, M., Kikuchi, H.
Conference Name	2017 IEEE International Conference on Systems, Man, and Cybernetics (SMC)
Keywords	CAPTCHA, captchas, composability, distortion, Human Behavior, human beings, human factors, Proposals, pubcrawl, Semantics, Speech, Speech recognition, usability
Abstract	Audio-CAPTCHA prevents malicious bots from attacking Web services and provides Web accessibility for visually-impaired persons. Most of the conventional methods employ statistical noise to distort sounds and let users remember and spell the words, which are difficult and laborious work for humans. In this paper, we utilize the difficulty on speaker-independent recognition for ASR machines instead of distortion with statistical noise. Our scheme synthesizes various voices by changing voice speed, pitch and native language of speakers. Moreover, we employ semantic identification problems between random phoneme sequences and meaningful words to release users from remembering and spelling words, so it improves the accuracy of humans and usability. We also evaluated our scheme in several experiments.
DOI	10.1109/SMC.2017.8123098
Citation Key	yamaguchi_audio-captcha_2017

Groups: