Visible to the public Denoising and Verification Cross-Layer Ensemble Against Black-box Adversarial Attacks

TitleDenoising and Verification Cross-Layer Ensemble Against Black-box Adversarial Attacks
Publication TypeConference Paper
Year of Publication2019
AuthorsChow, Ka-Ho, Wei, Wenqi, Wu, Yanzhao, Liu, Ling
Conference Name2019 IEEE International Conference on Big Data (Big Data)
Keywordsadversarial deep learning, adversarial examples, adversarial inputs, benign inputs, black-box adversarial attacks, composability, Cross Layer Security, cross-layer model diversity ensemble framework, deep neural networks, defense success rates, defense-attack arms race, DNNs, ensemble defense, ensemble diversity, learning (artificial intelligence), machine learning tasks, Manifolds, MODEF, neural nets, Neural networks, noise reduction, Predictive models, pubcrawl, representative attacks, Resiliency, Robustness, security of data, supervised model verification ensemble, Testing, Training, unsupervised model, verification cross-layer ensemble
AbstractDeep neural networks (DNNs) have demonstrated impressive performance on many challenging machine learning tasks. However, DNNs are vulnerable to adversarial inputs generated by adding maliciously crafted perturbations to the benign inputs. As a growing number of attacks have been reported to generate adversarial inputs of varying sophistication, the defense-attack arms race has been accelerated. In this paper, we present MODEF, a cross-layer model diversity ensemble framework. MODEF intelligently combines unsupervised model denoising ensemble with supervised model verification ensemble by quantifying model diversity, aiming to boost the robustness of the target model against adversarial examples. Evaluated using eleven representative attacks on popular benchmark datasets, we show that MODEF achieves remarkable defense success rates, compared with existing defense methods, and provides a superior capability of repairing adversarial inputs and making correct predictions with high accuracy in the presence of black-box attacks.
DOI10.1109/BigData47090.2019.9006090
Citation Keychow_denoising_2019