Solving Complexity Dataset in e-Ticketing using Machine Learning to Determine Optimum Feature

Authors

  • Siti Zulaikha Mohd Jamaludin School of Mathematical Sciences, Universiti Sains Malaysia, 11800 USM, Gelugor, Pulau Pinang, Malaysia
  • Majid Khan Majahar Ali School of Mathematical Sciences, Universiti Sains Malaysia, 11800 USM, Gelugor, Pulau Pinang, Malaysia
  • Eric Shiung Wong Vun School of Mathematical Sciences, Universiti Sains Malaysia, 11800 USM, Gelugor, Pulau Pinang, Malaysia
  • Mohd Tahir Ismail School of Mathematical Sciences, Universiti Sains Malaysia, 11800 USM, Gelugor, Pulau Pinang, Malaysia
  • Noor Farizah Ibrahim School of Computer Sciences, Universiti Sains Malaysia, 11800 USM, Gelugor, Pulau Pinang, Malaysia

DOI:

https://doi.org/10.11113/mjfas.v19n6.2921

Keywords:

e-Ticketing, Classification, Machine Learning, Accuracy, F1-score

Abstract

e-ticketing is one of the common applications used in technical support in Information Technology (IT) and has been used worldwide in any field of company. The benefits of e-ticketing can reduce the human efforts, increase the sufficiency of system and provides the benefits and efficiency to the customers. Also, e-ticketing ticketing system can enhance worker safety, improving productivity, increasing project efficiency and cause the good impact on the performance of the business in terms of profitability. The main objective of this study is defining the model performance in each important feature by analysing the complex dataset by using logistic regression as a Machine Learning (ML) algorithm. In evaluation the performance of classifier, the dataset is injected to Python programming and split into 90% as training set and 10% for the testing set. From the analysis, the study found that only 3 out of 11 independent features in dataset that are relevant chosen to proceed the ML analysis. From the result, the accuracy for sct_short_description, sct_cmdb_ci, and sct_assignment_group is 41.65%, 48.77% and 96.49%, respectively. It showed that the accuracy’s result for the sct_assignment_group resulted that the model is very good accuracy and indicate that the model is well performing. Meanwhile, the value of F1-score is 96.11% in each feature. This result indicates that the model has a good balance of precision and recall in its binary classification predictions. Hence, the study considers the sct_assignment_group as a best features to proceed the analysis. The future study will consider dealing the combination of complexity features by implementing more analysis on ML such as Support Vector Machines and Naïve Bayes.

References

Aglibar, K. D., & Rodelas, N. (2022). Impact of Critical and Auto Ticket: Analysis for Management and Workers Productivity in using a Ticketing System. International Journal of Computing Sciences Research, 6, 988-1004.

Al Shalabi, L., Najjar, M., & Al Kayed, A. (2006). A framework to deal with missing data in data sets. Journal of Computer Science, 2(9), 740-745.

Al-Hawari, F., & Barham, H. (2021). A machine learning based help desk system for IT service management. Journal of King Saud University-Computer and Information Sciences, 33(6), 702-718.

Bhanot, N., Ahuja, J., Kidwai, H. I., Nayan, A., & Bhatti, R. S. (2022). A sustainable economic revival plan for post-COVID-19 using machine learning approach–a case study in developing economy context. Benchmarking: An International Journal, (ahead-of-print).

Chang, W., Ji, X., Wang, L., Liu, H., Zhang, Y., Chen, B., & Zhou, S. (2021, October). A Machine-Learning method of predicting vital capacity plateau value for ventilatory pump failure based on data mining. In Healthcare (Vol. 9, No. 10, p. 1306). Multidisciplinary Digital Publishing Institute.

Chola, C., Muaad, A. Y., Bin Heyat, M. B., Benifa, J. V., Naji, W. R., Hemachandran, K., ... & Kim, T. S. (2022). BCNet: A Deep Learning Computer-Aided Diagnosis Framework for Human Peripheral Blood Cell Identification. Diagnostics, 12(11), 2815.

Chowdhury, M. Z. I., Leung, A. A., Walker, R. L., Sikdar, K. C., O’Beirne, M., Quan, H., & Turin, T. C. (2023). A comparison of machine learning algorithms and traditional regression-based statistical modeling for predicting hypertension incidence in a Canadian population. Scientific Reports, 13(1), 13.

Dedeturk, B. K., & Akay, B. (2020). Spam filtering using a logistic regression model trained by an artificial bee colony algorithm. Applied Soft Computing, 91, 106229.

DeVries, Z., Locke, E., Hoda, M., Moravek, D., Phan, K., Stratton, A., ... & Phan, P. (2021). Using a national surgical database to predict complications following posterior lumbar surgery and comparing the area under the curve and F1-score for the assessment of prognostic capability. The Spine Journal, 21(7), 1135-1142.

Gohil, F., & Kumar, M. V. (2019). Ticketing system. International Journal of Trend in Scientific Research and Development, 3(4), 155-156.

Gupta, M., Asadullah, A., Padmanabhuni, S., & Serebrenik, A. (2018). Reducing user input requests to improve IT support ticket resolution process. Empirical Software Engineering, 23(3), 1664-1703.

Hassan, M. A., & Ali, A. (2018). Logistic regression and gradient descent algorithms for big data classification. Journal of Big Data, 5(1), 1-19.

Hojski, D., Hazemali, D., & Lep, M. (2022). The Analysis of the Effects of a Fare Free Public Transport Travel Demand Based on E-Ticketing. Sustainability, 14(10), 5878.

Jamaludin, S. Z. M., Sa’ari, N. S., Kasihmuddin, M. S. M., Marsani, M. F., Zamri, N. E., Azhar, S. A., ... & Mansor, M. A. (2022). Artificial Bee Colony for Logic Mining in Credit Scoring. Malaysian Journal of Fundamental and Applied Sciences, 18(6), 654-673.

Jayasurya, G. G., Kumar, S., Singh, B. K., & Kumar, V. (2021). Analysis of public sentiment on COVID-19 vaccination using twitter. IEEE Transactions on Computational Social Systems, 9(4), 1101-1111.

Kasihmuddin, M. S. M., Jamaludin, S. Z. M., Mansor, M. A., Wahab, H. A., & Ghadzi, S. M. S. (2022). Supervised learning perspective in logic mining. Mathematics, 10(6), 915.

Li, F., & Sharma, A. (2022). Missing Data Filling Algorithm for Big Data-Based Map-Reduce Technology. International Journal of e-Collaboration (IJeC), 18(2), 1-11.

Malviya, A., & Dwivedi, R. K. (2022). Detecting Deceptive News in Social Media Using Supervised Machine Learning Techniques. In IOT with Smart Systems: Proceedings of ICTIS 2022, Volume 2 (pp. 239-250). Singapore: Springer Nature Singapore.

Mayes, E., Gehlbach, J. A., Jeziorczak, P. M., & Wooldridge, A. R. (2023). Machine Learning to Operationalize Team Cognition: A Case Study of Patient Handoffs. Human Factors in Healthcare, 100036.

Mordensky, S. P., Lipor, J. J., DeAngelo, J., Burns, E. R., & Lindsey, C. R. (2023). When less is more: How increasing the complexity of machine learning strategies for geothermal energy assessments may not lead toward better estimates. Geothermics, 110, 102662.

Nusinovici, S., Tham, Y. C., Yan, M. Y. C., Ting, D. S. W., Li, J., Sabanayagam, C., ... & Cheng, C. Y. (2020). Logistic regression was as good as machine learning for predicting major chronic diseases. Journal of Clinical Epidemiology, 122, 56-69.

Paramesh, S. P., & Shreedhara, K. S. (2019). Automated IT service desk systems using machine learning techniques. In Data Analytics and Learning (pp. 331-346). Springer, Singapore.

Penone, C., Davidson, A. D., Shoemaker, K. T., Di Marco, M., Rondinini, C., Brooks, T. M., ... & Costa, G. C. (2014). Imputation of missing data in life‐history trait datasets: Which approach performs the best?. Methods in Ecology and Evolution, 5(9), 961-970.

Qorib, M., Oladunni, T., Denis, M., Ososanya, E., & Cotae, P. (2023). COVID-19 vaccine hesitancy: Text mining, sentiment analysis and machine learning on COVID-19 vaccination Twitter dataset. Expert Systems with Applications, 212, 118715.

Robertson, G., Zhang, S., & Bogus, S. M. (2022). Challenges of Implementing E-Ticketing for Rural Transportation Construction Projects. In Construction Research Congress 2022 (pp. 453-462).

Rusli, A., Suryadibrata, A., Nusantara, S. B., & Young, J. C. (2020). A Comparison of Traditional Machine Learning Approaches for Supervised Feedback Classification in Bahasa Indonesia. International Journal of New Media Technology, 7(1), 28-32.

Sawhney, R., Malik, A., Sharma, S., & Narayan, V. (2023). A comparative assessment of artificial intelligence models used for early prediction and evaluation of chronic kidney disease. Decision Analytics Journal, 100169.

Schade, P., & Schuhmacher, M. C. (2023). Predicting entrepreneurial activity using machine learning. Journal of Business Venturing Insights, 19, e00357.

Singh, A., & Kumar, R. (2020, February). Heart disease prediction using machine learning algorithms. In 2020 international conference on electrical and electronics engineering (ICE3) (pp. 452-457). IEEE.

Singh, H., Sharma, V., & Singh, D. (2022). Comparative analysis of proficiencies of various textures and geometric features in breast mass classification using k-nearest neighbor. Visual Computing for Industry, Biomedicine, and Art, 5, 1-19.

Stojanov, Z., Dobrilovic, D., & Jevtic, V. (2011, September). Identifying properties of software change request process: Qualitative investigation in very small software companies. In 2011 IEEE 9th International Symposium on Intelligent Systems and Informatics (pp. 47-52). IEEE.

Tripathi, A., Patel, D., Sturgill, R., & Dadi, G. B. (2022). Analysis of E-Ticketing Technology for Inspection Performance and Practicality on Asphalt Paving Operations. Transportation Research Record: 03611981221083308.

Utama, A. A. G. S., Astuti, P. P. D., Hikmawati, E. E., & Setyowati, Y. (2021, November). Design E-Ticketing System to Increase Ticket Sales in Banyuwangi Branch New Star Cineplex. In International Conference on Management, Business, and Technology (ICOMBEST 2021) (pp. 120-126). Atlantis Press.

Xinzhou, H. (2015). The development trend of mobile Internet and its impact on the global economy. In 2015 International Conference on Social Science and Technology Education (pp. 598-600). Atlantis Press.

Zafeiropoulos, N., Mavrogiorgou, A., Kleftakis, S., Mavrogiorgos, K., Kiourtis, A., & Kyriazis, D. (2023). Interpretable Stroke Risk Prediction Using Machine Learning Algorithms. In Intelligent Sustainable Systems: Selected Papers of WorldS4 2022, Volume 2 (pp. 647-656). Singapore: Springer Nature Singapore.

Zheng, L., Wen, L., Lei, W., & Ning, Z. (2021). Added value of systemic inflammation markers in predicting pulmonary infection in stroke patients: A retrospective study by machine learning analysis. Medicine, 100(52).

Downloads

Published

04-12-2023