Prioritization of software development demands with text mining techniques

Küçük Resim Yok

Tarih

2019

Dergi Başlığı

Dergi ISSN

Cilt Başlığı

Yayıncı

Pamukkale Univ

Erişim Hakkı

info:eu-repo/semantics/openAccess

Araştırma projeleri

Organizasyon Birimleri

Dergi sayısı

Özet

In corporations, software issues and software change demands are forwarded to the Information Technology (IT) unit via a demand management system. The priority information in this system has critical importance to the IT unit. However, the priority decision that is left to the individuals who create the demand records may not always be realistic. For instance, a non-critical and low-priority demand may be created with the highest priority, and this may lead to faulty planning and eventually to customer dissatisfaction. In this work, internal customer demands were classified using text mining techniques and their priorities were predicted. The system was trained and tested with the records extracted from the demand management system of a corporation. After cleaning and preprocessing the raw textual demand data, TF-IDF (Term Frequency - Inverse Document Frequency) weighting scheme was used when creating the document-term matrix. Several classification algorithms were tested on the data set generated, and the highest performance was obtained by Sequential Minimal Optimization algorithm with 54.1% F-Score. In addition, on the dataset made balanced with oversampling technique, the highest performance was achieved by Random Forest algorithm with 74.5% F-Score.

Açıklama

Anahtar Kelimeler

Software Engineering, Demand Prioritization, Machine Learning, Text Classification, Random Forest

Kaynak

Pamukkale University Journal of Engineering Sciences-Pamukkale Universitesi Muhendislik Bilimleri Dergisi

WoS Q Değeri

N/A

Scopus Q Değeri

Cilt

25

Sayı

5

Künye