Duplicate Bug Report Detection and Classification System Based on Deep Learning Technique

Duplicate bug report detection is a process of finding a duplicate bug report in the bug tracking system. This process is essential to avoid unnecessary work and rediscovery. In typical bug tracking systems, more than thousands of duplicate bug reports are reported every day. In turn, human cost, ef...

Full description

Bibliographic Details
Main Authors: Ashima Kukkar, Rajni Mohana, Yugal Kumar, Anand Nayyar, Muhammad Bilal, Kyung-Sup Kwak
Format: Article
Language:English
Published: IEEE 2020-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/9235309/
id doaj-63765ec59e4c4396b5e4d6d7b4cd066f
record_format Article
spelling doaj-63765ec59e4c4396b5e4d6d7b4cd066f2021-03-30T04:29:54ZengIEEEIEEE Access2169-35362020-01-01820074920076310.1109/ACCESS.2020.30330459235309Duplicate Bug Report Detection and Classification System Based on Deep Learning TechniqueAshima Kukkar0Rajni Mohana1Yugal Kumar2Anand Nayyar3https://orcid.org/0000-0002-9821-6146Muhammad Bilal4https://orcid.org/0000-0003-4221-0877Kyung-Sup Kwak5https://orcid.org/0000-0002-9559-4352Department of Computer Science, Jaypee University of Information Technology, Wakanghat, IndiaDepartment of Computer Science, Jaypee University of Information Technology, Wakanghat, IndiaDepartment of Computer Science, Jaypee University of Information Technology, Wakanghat, IndiaGraduate School, Duy Tan University, Da Nang, VietnamComputer and Electronics Systems Engineering Department, Hankuk University of Foreign Studies, Seoul, South KoreaInformation and Communication Engineering, Inha University, Incheon, South KoreaDuplicate bug report detection is a process of finding a duplicate bug report in the bug tracking system. This process is essential to avoid unnecessary work and rediscovery. In typical bug tracking systems, more than thousands of duplicate bug reports are reported every day. In turn, human cost, effort and time are increased. This makes it an important problem in the software management process. The solution is to automate the duplicate bug report detection system for reducing the manual effort, thus the productivity of triager's and developer's is increased. It also speeds up the process of software management as a result software maintenance cost is also reduced. However, existing systems are not quite accurate yet, in spite of these systems used various machine learning approaches. In this work, an automatic bug report detection and classification model is proposed using deep learning technique. The proposed system has three modules i.e. Preprocessing, Deep Learning Model and Duplicate Bug report Detection and Classification. Further, the proposed model used Convolutional Neural Network based deep learning model to extract relevant feature. These relevant features are used to determine the similar features of bug reports. Hence, the bug reports similarity is computers through these similar features. The performance of the proposed system is evaluated on six publicly available datasets using six performance metrics. It is noticed that the proposed system outperforms the existing systems by achieving an accuracy rate in the range of 85% to 99 % and recall@k rate in between 79%-94%. Moreover, the effectiveness of the proposed system is also measured on the cross training datasets of same and different domain. The proposed system achieves a good high accuracy rate for same domain data sets and low accuracy rate for different domain datasets.https://ieeexplore.ieee.org/document/9235309/Duplicate bug report detectionSiamese networksnatural language processingdeep learningbug tracking systemsoftware maintenance
collection DOAJ
language English
format Article
sources DOAJ
author Ashima Kukkar
Rajni Mohana
Yugal Kumar
Anand Nayyar
Muhammad Bilal
Kyung-Sup Kwak
spellingShingle Ashima Kukkar
Rajni Mohana
Yugal Kumar
Anand Nayyar
Muhammad Bilal
Kyung-Sup Kwak
Duplicate Bug Report Detection and Classification System Based on Deep Learning Technique
IEEE Access
Duplicate bug report detection
Siamese networks
natural language processing
deep learning
bug tracking system
software maintenance
author_facet Ashima Kukkar
Rajni Mohana
Yugal Kumar
Anand Nayyar
Muhammad Bilal
Kyung-Sup Kwak
author_sort Ashima Kukkar
title Duplicate Bug Report Detection and Classification System Based on Deep Learning Technique
title_short Duplicate Bug Report Detection and Classification System Based on Deep Learning Technique
title_full Duplicate Bug Report Detection and Classification System Based on Deep Learning Technique
title_fullStr Duplicate Bug Report Detection and Classification System Based on Deep Learning Technique
title_full_unstemmed Duplicate Bug Report Detection and Classification System Based on Deep Learning Technique
title_sort duplicate bug report detection and classification system based on deep learning technique
publisher IEEE
series IEEE Access
issn 2169-3536
publishDate 2020-01-01
description Duplicate bug report detection is a process of finding a duplicate bug report in the bug tracking system. This process is essential to avoid unnecessary work and rediscovery. In typical bug tracking systems, more than thousands of duplicate bug reports are reported every day. In turn, human cost, effort and time are increased. This makes it an important problem in the software management process. The solution is to automate the duplicate bug report detection system for reducing the manual effort, thus the productivity of triager's and developer's is increased. It also speeds up the process of software management as a result software maintenance cost is also reduced. However, existing systems are not quite accurate yet, in spite of these systems used various machine learning approaches. In this work, an automatic bug report detection and classification model is proposed using deep learning technique. The proposed system has three modules i.e. Preprocessing, Deep Learning Model and Duplicate Bug report Detection and Classification. Further, the proposed model used Convolutional Neural Network based deep learning model to extract relevant feature. These relevant features are used to determine the similar features of bug reports. Hence, the bug reports similarity is computers through these similar features. The performance of the proposed system is evaluated on six publicly available datasets using six performance metrics. It is noticed that the proposed system outperforms the existing systems by achieving an accuracy rate in the range of 85% to 99 % and recall@k rate in between 79%-94%. Moreover, the effectiveness of the proposed system is also measured on the cross training datasets of same and different domain. The proposed system achieves a good high accuracy rate for same domain data sets and low accuracy rate for different domain datasets.
topic Duplicate bug report detection
Siamese networks
natural language processing
deep learning
bug tracking system
software maintenance
url https://ieeexplore.ieee.org/document/9235309/
work_keys_str_mv AT ashimakukkar duplicatebugreportdetectionandclassificationsystembasedondeeplearningtechnique
AT rajnimohana duplicatebugreportdetectionandclassificationsystembasedondeeplearningtechnique
AT yugalkumar duplicatebugreportdetectionandclassificationsystembasedondeeplearningtechnique
AT anandnayyar duplicatebugreportdetectionandclassificationsystembasedondeeplearningtechnique
AT muhammadbilal duplicatebugreportdetectionandclassificationsystembasedondeeplearningtechnique
AT kyungsupkwak duplicatebugreportdetectionandclassificationsystembasedondeeplearningtechnique
_version_ 1724181651823525888