On the Investigation of Essential Diversities for Deep Learning Testing Criteria
Title | On the Investigation of Essential Diversities for Deep Learning Testing Criteria |
Publication Type | Conference Paper |
Year of Publication | 2019 |
Authors | Zhang, Z., Xie, X. |
Conference Name | 2019 IEEE 19th International Conference on Software Quality, Reliability and Security (QRS) |
Date Published | jul |
Keywords | Deep Learning, deep learning models, deep learning systems, Deep Learning testing, deep learning testing criteria, defined metrics, erroneous behaviors, essential diversities, essential metrics, fault detection, fault detection ability, fault diagnosis, image retrieval, learning (artificial intelligence), Measurement, metamorphic testing, Metrics, metrics testing, neuron activities, Neurons, program testing, pubcrawl, reliability, system robustness, Task Analysis, test diversities, test suites, Testing, testing criteria |
Abstract | Recent years, more and more testing criteria for deep learning systems has been proposed to ensure system robustness and reliability. These criteria were defined based on different perspectives of diversity. However, there lacks comprehensive investigation on what are the most essential diversities that should be considered by a testing criteria for deep learning systems. Therefore, in this paper, we conduct an empirical study to investigate the relation between test diversities and erroneous behaviors of deep learning models. We define five metrics to reflect diversities in neuron activities, and leverage metamorphic testing to detect erroneous behaviors. We investigate the correlation between metrics and erroneous behaviors. We also go further step to measure the quality of test suites under the guidance of defined metrics. Our results provided comprehensive insights on the essential diversities for testing criteria to exhibit good fault detection ability. |
DOI | 10.1109/QRS.2019.00056 |
Citation Key | zhang_investigation_2019 |
- Measurement
- testing criteria
- testing
- test suites
- test diversities
- Task Analysis
- system robustness
- Reliability
- pubcrawl
- program testing
- Neurons
- neuron activities
- Metrics
- metamorphic testing
- metrics testing
- deep learning
- learning (artificial intelligence)
- image retrieval
- fault diagnosis
- fault detection ability
- fault detection
- essential metrics
- essential diversities
- erroneous behaviors
- defined metrics
- deep learning testing criteria
- Deep Learning testing
- deep learning systems
- deep learning models