Behaviorally Interpretable Transactional Features for Customer Segmentation Using K-Means in Grocery Retail
DOI:
https://doi.org/10.29408/edumatic.v10i1.34163Keywords:
behavioral segmentation, customer segmentation, k-means clustering, transactional feature construction, unsupervised learningAbstract
Customer segmentation based on transactional data is widely used to understand purchasing behavior in retail. However, many existing studies tend to emphasize algorithm performance, with limited discussion on how transactional variables represent actual customer behavior. This study adopts a quantitative approach using transactional sales data from a grocery retail store (Toko Solo Latri), consisting of 10,000 item-level records collected during June 2025. The analysis follows the CRISP-DM framework, covering data understanding, preparation, modeling, and evaluation stages. Customer behavior is represented through several aggregated variables, including transaction frequency, total items purchased, and product diversity. The K-Means clustering algorithm is applied to group customers into meaningful segments. The number of clusters is determined using the Elbow Method and further evaluated using Silhouette analysis. The results reveal three distinct customer segments with different levels of purchase intensity and product diversity. The Silhouette Score of 0.464 indicates a moderate clustering structure. In addition, one-way ANOVA shows significant differences across the observed variables, with large effect sizes (η² ranging from 0.736 to 0.822). These findings suggest that constructing behavior-based transactional features can improve the interpretability of customer segmentation results.
References
Anitha, P., & Patil, M. M. (2022). RFM model for customer purchase behavior using K-Means algorithm. Journal of King Saud University - Computer and Information Sciences, 34(5), 1785–1792. https://doi.org/10.1016/j.jksuci.2019.12.011
Ashraf, A., Rayed, C. A., Awad, N. A., & Sabry, H. M. (2025). ScienceDirect A Framework for Customer Segmentation to Improve Marketing Strategies Using Machine Learning. Procedia Computer Science, 260, 616–625. https://doi.org/10.1016/j.procs.2025.03.240
Dhandayudam, P., & Krishnamurthi, I. (2013). Customer Behavior Analysis Using Rough Set Approach. Journal of Theoretical and Applied Electronic Commerce Research, 8(2), 21–33. https://doi.org/10.4067/S0718-18762013000200003
Dianti, A. R., Mualifah, T., & Dirgantara, I. M. B. (2024). Artificial Intelligence for Marketing: Systematic Literature Review. Research Horizon, 04(06). https://doi.org//10.54518/rh.4.6.2024.393
Firdausi, A. A., Hartanti, D., & Sari, A. A. (2025). A Recommendation System Using the Content-Based Filtering Method for Batrisyia Herbal Face Care Products. Journal Of Information Systems And Computer Engineering, 10(2), 232–237. https://doi.org/10.51876/simtek.v10i2.1569
Guney, S., Peker, S., & Turhan, C. (2024). A Combined Approach for Customer Profiling in Video on Demand Services using Clustering and Association Rule Mining. IEEE Access, 1–11. https://doi.org/10.1109/ACCESS.2020.2992064
Hsiang, A., Chen, L., & Gunawan, S. (2023). applied sciences Enhancing Retail Transactions : A Data Driven Recommendation Using Modified RFM Analysis and Association Rules Mining. Applied Sciences, 13(18). https://doi.org/10.3390/app131810057
Lee, Y. U., Chung, S. H., & Park, J. Y. (2024). Online Review Analysis from a Customer Behavior Observation Perspective for Product Development. Sustainability Article, 16(9), 1–18. https://doi.org/10.3390/su16093550
Lin, R., Chuang, W., Chuang, C., & Chang, W. (2021). Applied Big Data Analysis to Build Customer Product Recommendation Model. Sustainability, 13(9). https://doi.org/doi.org/10.3390/su13094985
Mahfuza, R., Islam, N., & Emon, A. F. (2022). LRFMV : An efficient customer segmentation model for superstores. 1–29. https://doi.org/10.1371/journal.pone.0279262
Plank, A., & Koll, O. (2026). Who seeks variety ? Profiles and behaviors of consumers with varying brand choice concentration. Journal of Retailing, xxxx, 1–24. https://doi.org/10.1016/j.jretai.2026.01.004
Riswanto, A. L., Ha, S., Lee, S., & Kwon, M. (2024). Online Reviews Meet Visual Attention : A Study on Consumer Patterns in Advertising , Analyzing Customer Satisfaction , Visual Engagement , and Purchase Intention. Journal of Theoretical and Applied Electronic Commerce Research, 19(4), 3102–3122. https://doi.org/10.3390/jtaer19040150
Sari, A. A., Pramono, P., Saputra, I. T., & Prakoso, A. D. (2024). Optimalisasi Proses Digitalisasi UMKM melalui Aplikasi Marketplace berbasis Design Thinking. Edumatic: Jurnal Pendidikan Informatika, 8(2), 535–544. https://doi.org/10.29408/edumatic.v8i2.27702
Smaili, M. Y., & Hachimi, H. (2023). New RFM-D classification model for improving customer analysis and response prediction. Ain Shams Engineering Journal, 14(12), 102254. https://doi.org/10.1016/j.asej.2023.102254
Stylianou, T., & Pantelidou, A. (2025). A machine learning approach to consumer behavior in supermarket analytics. Decision Analytics Journal, 16(January), 100600. https://doi.org/10.1016/j.dajour.2025.100600
Suh, Y. (2025). Discovering customer segments through interaction behaviors for home appliance business. In Journal of Big Data. Springer International Publishing. https://doi.org/10.1186/s40537-025-01111-y
Tabianan, K., & Velu, S. (2022). K-Means Clustering Approach for Intelligent Customer Segmentation Using Customer Purchase Behavior Data. Sustainability (Switzerland), 14(12), 1–15. https://doi.org/10.3390/su14127243
Tang, J. (2025). Unlocking Retail Insights : Predictive Modeling and Customer Segmentation Through Data Analytics. Journal of Theoretical and Applied Electronic Commerce Research, 20(2), 1–20. https://doi.org/10.3390/jtaer20020059
Turkmen, B. (2022). Customer segmentation with machine learning for online retail industry. The European Journal of Social and Behavioural Sciences, 31(2). https://doi.org/10.15405/ejsbs.316
Ufeli, C. P., Sattar, M. U., & Hasan, R. (2025). Enhancing Customer Segmentation Through Factor Analysis of Mixed Data (FAMD)-Based Approach Using K-Means and Hierarchical Clustering Algorithms. Information, 16(6), 1–25. https://doi.org/10.3390/info16060441
Wang, S., Sun, L., & Yu, Y. (2024). A dynamic customer segmentation approach by combining LRFMS and multivariate time series clustering. Scientific Reports, 1–18. https://doi.org/10.1038/s41598-024-68621-2
Zhang, Y. (2022). Variety-Seeking Behavior in Consumption : A Literature Review and Future Research Directions. Frontiers in Psychology, 18(6). https://doi.org/10.3389/fpsyg.2022.874444
Downloads
Published
How to Cite
Issue
Section
License
Copyright (c) 2026 Rendy Muhammad Aprizal, Oktalia Kumala Sari, Arif Bramantoro

This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.
All articles in this journal are the sole responsibility of the authors. Edumatic: Jurnal Pendidikan Informatika can be accessed free of charge, in accordance with the Creative Commons license used.

This work is licensed under a Lisensi a Creative Commons Attribution-ShareAlike 4.0 International License.


