K-Means Clustering and Multilayer Perceptron for Categorizing Student Business Groups

Miftahul Walid, Norfiah Lailatin Nispi Sahbaniya, Hozairi Hozairi, Fajar Baskoro, Arya Yudhi Wijaya

Abstract


The research conducted in this study was driven by the East Java provincial government's requirement to assess the transaction levels of the Student Business Group (KUS) in the SMA Double Track program. These transaction levels are a basis for allocating supplementary financial aid to each business group. The system's primary objective is to assist the provincial government of East Java in making well-informed choices pertaining to the distribution of supplementary capital to the KUS. The classification technique employed in this study is the multilayer perceptron. However, the K-Means Clustering method is utilised to generate target data due to the limited availability during the classification process, which involves dividing the transaction level attributes into three distinct groups: (0) low transactions, (1) medium transactions, and (2) high transactions. The clustering process encompasses three distinct features: (1) income, (2) spending, and (3) profit. These three traits will be utilized as input data throughout the categorization procedure. The classification procedure employing the Multilayer Perceptron technique involved processing a dataset including 1383 data points. The training data constituted 80% of the dataset, while the remaining 20% was allocated for testing. In order to evaluate the efficacy of the constructed model, the training error was assessed using K-Fold cross-validation, yielding an average accuracy score of 0.92. In the present study, the categorization technique yielded an accuracy of 0.96. This model aims to classify scenarios when the dataset lacks prior target data.

Full Text:

PDF

References


A. Yulikah, M. A. Faizin, and A. E. Sujianto, "Implementation of Islamic Entrepreneurship Concept in Double Track SMA Program," Indones. Econ. Rev., vol. 1, no. 2, pp. 98–108, 2021.

J. Yu, Y. Qiao, N. Shu, K. Sun, S. Zhou, and J. Yang, "Neural Network Based Transaction Classification System for Chinese Transaction Behavior Analysis," Proc. - 2019 IEEE Int. Congr. Big Data, BigData Congr. 2019 - Part 2019 IEEE World Congr. Serv., pp. 64–71, 2019.

T. Hu, X. Liu, T. Chen, X. Zhang, and X. Huang, "Transaction-based classification and detection approach for Ethereum smart contract," vol. 58, no. May 2020, 2021.

B. Karunachandra, N. Putera, S. R. Wijaya, D. Suryani, J. Wesley, and Y. Purnama, "On the benefits of machine learning classification in cashback fraud detection," Procedia Comput. Sci., vol. 216, no. 2022, pp. 364–369, 2023.

R. Saxena, D. Arora, and V. Nagar, "Classifying Transactional Addresses using Supervised Learning Approaches over Ethereum Blockchain," Procedia Comput. Sci., vol. 218, no. 2022, pp. 2018–2025, 2023.

R. Sukumaran, "Improved Customer Transaction Classification using Semi-Supervised Knowledge Distillation," 2021.

A. Mardanshahi, V. Nasir, S. Kazemirad, and M. M. Shokrieh, "Detection and classification of matrix cracking in laminated composites using guided wave propagation and artificial neural networks," Compos. Struct., vol. 246, no. April, p. 112403, 2020.

U. Orhan, M. Hekim, and M. Ozer, "EEG signals classification using the K-means clustering and a multilayer perceptron neural network model," Expert Syst. Appl., vol. 38, no. 10, pp. 13475–13481, 2011.

Y. Rong and Y. Liu, "Staged text clustering algorithm based on K-means and hierarchical agglomeration clustering," Proc. 2020 IEEE Int. Conf. Artif. Intell. Comput. Appl. ICAICA 2020, pp. 124–127, 2020.

T. Gupta and S. P. Panda, "Clustering Validation of CLARA and K-Means Using Silhouette DUNN Measures on Iris Dataset," Proc. Int. Conf. Mach. Learn. Big Data, Cloud Parallel Comput. Trends, Prespectives Prospect. Com. 2019, pp. 10–13, 2019.

P. Sharath Chander, J. Soundarya, and R. Priyadharsini, Brain Tumour Detection and Classification Using K-Means Clustering and SVM Classifier. Springer Singapore, 2020.

A. Iqbal and S. Aftab, "A classification framework for software defect prediction using multi-filter feature selection technique and MLP," Int. J. Mod. Educ. Comput. Sci., vol. 12, no. 1, pp. 18–25, 2020.

R. Sharma, M. Kim, and A. Gupta, "Motor imagery classification in brain-machine interface with machine learning algorithms: Classical approach to multilayer perceptron model," Biomed. Signal Process. Control, vol. 71, no. PA, p. 103101, 2022.

H. Shi and M. Xu, "A Data Classification Method Using Genetic Algorithm and K-Means Algorithm with Optimizing Initial Cluster Center," 2018 IEEE Int. Conf. Comput. Commun. Eng. Technol. CCET 2018, pp. 224–228, 2018.

N. Sapkota, A. Alsadoon, P. W. C. Prasad, A. Elchouemi, and A. K. Singh, "Data Summarization Using Clustering and Classification: Spectral Clustering Combined with k-Means Using NFPH," Proc. Int. Conf. Mach. Learn. Big Data, Cloud Parallel Comput. Trends, Prespectives Prospect. Com. 2019, pp. 146–151, 2019.

G. Zhou, H. Moayedi, and L. K. Foong, "Teaching–learning-based metaheuristic scheme for modifying neural computing in appraising energy performance of building," Eng. Comput., vol. 37, no. 4, pp. 3037–3048, 2021.

S. Brownfield and J. Zhou, "Sentiment Analysis of Amazon Product Reviews," Adv. Intell. Syst. Comput., vol. 1295, no. 1, pp. 739–750, 2020.

J. Naskath, G. Sivakamasundari, and A. A. S. Begum, "A Study on Different Deep Learning Algorithms Used in Deep Neural Nets: MLP SOM and DBN," Wirel. Pers. Commun., vol. 128, no. 4, pp. 2913–2936, 2023.

M. Y. Chuttur and Y. Parianen, "A Comparison of Machine Learning Models to Prioritise Emails using Emotion Analysis for Customer Service Excellence," Knowl. Eng. Data Sci., vol. 5, no. 1, p. 41, 2022.

S. Talatian Azad, G. Ahmadi, and A. Rezaeipanah, "An intelligent ensemble classification method based on multilayer perceptron neural network and evolutionary algorithms for breast cancer diagnosis," J. Exp. Theor. Artif. Intell., vol. 34, no. 6, pp. 949–969, 2022.

I. Tolstikhin et al., "MLP-Mixer: An all-MLP Architecture for Vision," Adv. Neural Inf. Process. Syst., vol. 29, no. NeurIPS, pp. 24261–24272, 2021.

M. Wang, Y. Lu, and J. Qin, "A dynamic MLP-based DDoS attack detection method using feature selection and feedback," Comput. Secur., vol. 88, p. 101645, 2020.

N. Salankar, P. Mishra, and L. Garg, "Emotion recognition from EEG signals using empirical mode decomposition and second-order difference plot," Biomed. Signal Process. Control, vol. 65, no. August 2020, p. 102389, 2021.

A. Kurani, P. Doshi, A. Vakharia, and M. Shah, "A Comprehensive Comparative Study of Artificial Neural Network (ANN) and Support Vector Machines (SVM) on Stock Forecasting," Ann. Data Sci., vol. 10, no. 1, pp. 183–208, 2023.

T. S. Bressan, M. Kehl de Souza, T. J. Girelli, and F. C. Junior, "Evaluation of machine learning methods for lithology classification using geophysical data," Comput. Geosci., vol. 139, p. 104475, 2020.

P. Benz, S. Ham, C. Zhang, A. Karjauv, and I. S. Kweon, "Adversarial Robustness Comparison of Vision Transformer and MLP-Mixer to CNNs," 2021.

J. Hwang, J. Lee, and K. S. Lee, "A deep learning-based method for grip strength prediction: Comparison of multilayer perceptron and polynomial regression approaches," PLoS One, vol. 16, no. 2 February, pp. 1–12, 2021.

M. B. Tamam, M. Walid, J. Freitas, and A. Bernardo, "Classification of Sign Language in Real Time Using Convolutional Neural Network," vol. 6, no. 1, pp. 39–46, 2023.

X. S. Tan, Z. Yang, Y. Benlimane, and E. Liu, "Using Classification with K-means Clustering to Investigate Transaction Anomaly," pp. 171–174, 2020.

C. F. Yang, G. J. Liu, and C. G. Yan, “A k-means-based and no-super-parametric Improvement of AdaBoost and its Application to Transaction Fraud Detection,” in 2020 IEEE International Conference on Networking, Sensing and Control (ICNSC), Oct. 2020, pp. 1–5.

T. Amarasinghe and N. Krishnarajah, "Critical Analysis of Machine Learning Based Approaches for Fraud Detection in Financial Transactions," pp. 12–17, 2018.

A. R. Khan, M. Harouni, and R. Abbasi, "Brain tumor segmentation using K-means clustering and deep learning with synthetic data augmentation for classification," no. February 2020, pp. 1–11, 2021.

C. Usha Kumari, S. Jeevan Prasad, and G. Mounika, "Leaf disease detection: Feature extraction with k-means clustering and classification with ANN," Proc. 3rd Int. Conf. Comput. Methodol. Commun. ICCMC 2019, no. Iccmc, pp. 1095–1098, 2019.

M. H. Santoso, D. A. Larasati, U. Medan, and A. Sumatera, "Wayang Image Classification Using MLP Method and GLCM Feature Extraction," J. Comput. Sci. Inf. Technol. Telecommun. Eng., vol. 1, no. 2, pp. 111–120, 2020.

I. T. Um, J. H. Ra, and M. H. Kim, "Comparison of clustering methods for MLP-based speaker verification," Proc. - Int. Conf. Pattern Recognit., vol. 15, no. 2, pp. 475–478, 2000.

S. Fekri-Ershad, "Bark texture classification using improved local ternary patterns and multilayer neural network," Expert Syst. Appl., vol. 158, p. 113509, 2020.




DOI: http://dx.doi.org/10.17977/um018v6i12023p69-78

Refbacks

  • There are currently no refbacks.


Copyright (c) 2023 Knowledge Engineering and Data Science

Creative Commons License
This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.

Flag Counter

Creative Commons License


This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.

View My Stats