Cross-Entropy: A New Metric for Software Defect Prediction
Title | Cross-Entropy: A New Metric for Software Defect Prediction |
Publication Type | Conference Paper |
Year of Publication | 2018 |
Authors | Zhang, Xian, Ben, Kerong, Zeng, Jie |
Conference Name | 2018 IEEE International Conference on Software Quality, Reliability and Security (QRS) |
Date Published | jul |
ISBN Number | 978-1-5386-7757-5 |
Keywords | code metric, code naturalness, Computer bugs, cross-entropy, Deep Learning, DefectLearner, Entropy, language model, Measurement, Metrics, natural language processing, Predictive models, predictive security metrics, program debugging, pubcrawl, recurrent neural nets, recurrent neural network language model, Semantics, Software, software bugs, software defect prediction, software metrics, software quality, software quality assurance, software repository, source code, source code (software), Task Analysis |
Abstract | Defect prediction is an active topic in software quality assurance, which can help developers find potential bugs and make better use of resources. To improve prediction performance, this paper introduces cross-entropy, one common measure for natural language, as a new code metric into defect prediction tasks and proposes a framework called DefectLearner for this process. We first build a recurrent neural network language model to learn regularities in source code from software repository. Based on the trained model, the cross-entropy of each component can be calculated. To evaluate the discrimination for defect-proneness, cross-entropy is compared with 20 widely used metrics on 12 open-source projects. The experimental results show that cross-entropy metric is more discriminative than 50% of the traditional metrics. Besides, we combine cross-entropy with traditional metric suites together for accurate defect prediction. With cross-entropy added, the performance of prediction models is improved by an average of 2.8% in F1-score. |
URL | https://ieeexplore.ieee.org/document/8424963 |
DOI | 10.1109/QRS.2018.00025 |
Citation Key | zhang_cross-entropy:_2018 |
- pubcrawl
- Task Analysis
- source code (software)
- source code
- software repository
- software quality assurance
- software quality
- software metrics
- software defect prediction
- software bugs
- Software
- Semantics
- recurrent neural network language model
- recurrent neural nets
- code metric
- program debugging
- predictive security metrics
- Predictive models
- natural language processing
- Metrics
- Measurement
- language model
- Entropy
- DefectLearner
- deep learning
- cross-entropy
- Computer bugs
- code naturalness