Software defect prediction based on non-linear manifold learning and hybrid deep learning techniques

Kun Zhu, Nana Zhang, Qing Zhang, Shi Ying*, Xu Wang

*Corresponding author for this work

Research output: Contribution to JournalArticleAcademicpeer-review

Abstract

Software defect prediction plays a very important role in software quality assurance, which aims to inspect as many potentially defect-prone software modules as possible. However, the performance of the prediction model is susceptible to high dimensionality of the dataset that contains irrelevant and redundant features. In addition, software metrics for software defect prediction are almost entirely traditional features compared to the deep semantic feature representation from deep learning techniques. To address these two issues, we propose the following two solutions in this paper: (1) We leverage a novel non-linear manifold learning method - SOINN Landmark Isomap (SL-Isomap) to extract the representative features by selecting automatically the reasonable number and position of landmarks, which can reveal the complex intrinsic structure hidden behind the defect data. (2) We propose a novel defect prediction model named DLDD based on hybrid deep learning techniques, which leverages denoising autoencoder to learn true input features that are not contaminated by noise, and utilizes deep neural network to learn the abstract deep semantic features. We combine the squared error loss function of denoising autoencoder with the cross entropy loss function of deep neural network to achieve the best prediction performance by adjusting a hyperparameter. We compare the SL-Isomap with seven state-of-the-art feature extraction methods and compare the DLDD model with six baseline models across 20 open source software projects. The experimental results verify that the superiority of SL-Isomap and DLDD on four evaluation indicators.

Original languageEnglish
Pages (from-to)1467-1486
Number of pages20
JournalComputers, Materials and Continua
Volume65
Issue number2
DOIs
Publication statusPublished - 20 Aug 2020

Funding

Funding Statement: This work is supported in part by the National Science Foundation of China (Grant Nos. 61672392, 61373038), and in part by the National Key Research and Development Program of China (Grant No. 2016YFC1202204).

FundersFunder number
National Natural Science Foundation of China61672392, 61373038
National Natural Science Foundation of China
National Key Research and Development Program of China2016YFC1202204
National Key Research and Development Program of China

    Keywords

    • Deep learning
    • Deep neural network
    • Denoising autoencoder
    • Loss function
    • Non-linear manifold learning
    • Software defect prediction

    Fingerprint

    Dive into the research topics of 'Software defect prediction based on non-linear manifold learning and hybrid deep learning techniques'. Together they form a unique fingerprint.

    Cite this