Performance Evaluation of Some Machine Learning Algorithms in NASA Defect Prediction Data Sets

Küçük Resim Yok

Tarih

2020

Dergi Başlığı

Dergi ISSN

Cilt Başlığı

Yayıncı

Ieee

Erişim Hakkı

info:eu-repo/semantics/closedAccess

Araştırma projeleri

Organizasyon Birimleri

Dergi sayısı

Özet

The main purpose of machine learning is to model the systems making predictions by using some mathematical and operational features on the data with computers [1].Today, there are many studies on machine learning in all areas of the software world. Software Defect Prediction is a sub-branch that progresses rapidly in machine learning. In this study, five of machine learning classification algorithms were conducted on with PYTHON programming language on defect prediction data sets which are JM1, KC1, CM1, PC1 in the PROMISE repository. These data sets are created within the scope of the publicly available NASA institution's Metric Data Program. The accuracy, recall, precision and F-measure and support values of the algorithms on the data are compared. When the results are examined in terms of the accuracy of machine learning algorithms, the accuracy rates of the algorithms are quite high in all 4 data sets. The highest success rates were obtained from the classification algorithms applied in 4 data sets in CM1 and PC1 data sets. In 4 data sets, the highest success rates were seen with Random Forest algorithm.

Açıklama

5th International Conference on Computer Science and Engineering (UBMK) -- SEP 09-11, 2020 -- Diyarbakir, TURKEY

Anahtar Kelimeler

Software Defect Prediction, Mccabe, Halstead, Random Forest, Adaboost, Naive Bayes, Decision Tree

Kaynak

2020 5th International Conference on Computer Science And Engineering (Ubmk)

WoS Q Değeri

N/A

Scopus Q Değeri

N/A

Cilt

Sayı

Künye