Behaviorally Interpretable Transactional Features for Customer Segmentation Using K-Means in Grocery Retail

Authors

DOI:

https://doi.org/10.29408/edumatic.v10i1.34163

Keywords:

behavioral segmentation, customer segmentation, k-means clustering, transactional feature construction, unsupervised learning

Abstract

Customer segmentation based on transactional data is widely used to understand purchasing behavior in retail. However, many existing studies tend to emphasize algorithm performance, with limited discussion on how transactional variables represent actual customer behavior. This study adopts a quantitative approach using transactional sales data from a grocery retail store (Toko Solo Latri), consisting of 10,000 item-level records collected during June 2025. The analysis follows the CRISP-DM framework, covering data understanding, preparation, modeling, and evaluation stages. Customer behavior is represented through several aggregated variables, including transaction frequency, total items purchased, and product diversity. The K-Means clustering algorithm is applied to group customers into meaningful segments. The number of clusters is determined using the Elbow Method and further evaluated using Silhouette analysis. The results reveal three distinct customer segments with different levels of purchase intensity and product diversity. The Silhouette Score of 0.464 indicates a moderate clustering structure. In addition, one-way ANOVA shows significant differences across the observed variables, with large effect sizes (η² ranging from 0.736 to 0.822). These findings suggest that constructing behavior-based transactional features can improve the interpretability of customer segmentation results.

References

Anitha, P., & Patil, M. M. (2022). RFM model for customer purchase behavior using K-Means algorithm. Journal of King Saud University - Computer and Information Sciences, 34(5), 1785–1792. https://doi.org/10.1016/j.jksuci.2019.12.011

Ashraf, A., Rayed, C. A., Awad, N. A., & Sabry, H. M. (2025). ScienceDirect A Framework for Customer Segmentation to Improve Marketing Strategies Using Machine Learning. Procedia Computer Science, 260, 616–625. https://doi.org/10.1016/j.procs.2025.03.240

Dhandayudam, P., & Krishnamurthi, I. (2013). Customer Behavior Analysis Using Rough Set Approach. Journal of Theoretical and Applied Electronic Commerce Research, 8(2), 21–33. https://doi.org/10.4067/S0718-18762013000200003

Dianti, A. R., Mualifah, T., & Dirgantara, I. M. B. (2024). Artificial Intelligence for Marketing: Systematic Literature Review. Research Horizon, 04(06). https://doi.org//10.54518/rh.4.6.2024.393

Firdausi, A. A., Hartanti, D., & Sari, A. A. (2025). A Recommendation System Using the Content-Based Filtering Method for Batrisyia Herbal Face Care Products. Journal Of Information Systems And Computer Engineering, 10(2), 232–237. https://doi.org/10.51876/simtek.v10i2.1569

Guney, S., Peker, S., & Turhan, C. (2024). A Combined Approach for Customer Profiling in Video on Demand Services using Clustering and Association Rule Mining. IEEE Access, 1–11. https://doi.org/10.1109/ACCESS.2020.2992064

Hsiang, A., Chen, L., & Gunawan, S. (2023). applied sciences Enhancing Retail Transactions : A Data Driven Recommendation Using Modified RFM Analysis and Association Rules Mining. Applied Sciences, 13(18). https://doi.org/10.3390/app131810057

Lee, Y. U., Chung, S. H., & Park, J. Y. (2024). Online Review Analysis from a Customer Behavior Observation Perspective for Product Development. Sustainability Article, 16(9), 1–18. https://doi.org/10.3390/su16093550

Lin, R., Chuang, W., Chuang, C., & Chang, W. (2021). Applied Big Data Analysis to Build Customer Product Recommendation Model. Sustainability, 13(9). https://doi.org/doi.org/10.3390/su13094985

Mahfuza, R., Islam, N., & Emon, A. F. (2022). LRFMV : An efficient customer segmentation model for superstores. 1–29. https://doi.org/10.1371/journal.pone.0279262

Plank, A., & Koll, O. (2026). Who seeks variety ? Profiles and behaviors of consumers with varying brand choice concentration. Journal of Retailing, xxxx, 1–24. https://doi.org/10.1016/j.jretai.2026.01.004

Riswanto, A. L., Ha, S., Lee, S., & Kwon, M. (2024). Online Reviews Meet Visual Attention : A Study on Consumer Patterns in Advertising , Analyzing Customer Satisfaction , Visual Engagement , and Purchase Intention. Journal of Theoretical and Applied Electronic Commerce Research, 19(4), 3102–3122. https://doi.org/10.3390/jtaer19040150

Sari, A. A., Pramono, P., Saputra, I. T., & Prakoso, A. D. (2024). Optimalisasi Proses Digitalisasi UMKM melalui Aplikasi Marketplace berbasis Design Thinking. Edumatic: Jurnal Pendidikan Informatika, 8(2), 535–544. https://doi.org/10.29408/edumatic.v8i2.27702

Smaili, M. Y., & Hachimi, H. (2023). New RFM-D classification model for improving customer analysis and response prediction. Ain Shams Engineering Journal, 14(12), 102254. https://doi.org/10.1016/j.asej.2023.102254

Stylianou, T., & Pantelidou, A. (2025). A machine learning approach to consumer behavior in supermarket analytics. Decision Analytics Journal, 16(January), 100600. https://doi.org/10.1016/j.dajour.2025.100600

Suh, Y. (2025). Discovering customer segments through interaction behaviors for home appliance business. In Journal of Big Data. Springer International Publishing. https://doi.org/10.1186/s40537-025-01111-y

Tabianan, K., & Velu, S. (2022). K-Means Clustering Approach for Intelligent Customer Segmentation Using Customer Purchase Behavior Data. Sustainability (Switzerland), 14(12), 1–15. https://doi.org/10.3390/su14127243

Tang, J. (2025). Unlocking Retail Insights : Predictive Modeling and Customer Segmentation Through Data Analytics. Journal of Theoretical and Applied Electronic Commerce Research, 20(2), 1–20. https://doi.org/10.3390/jtaer20020059

Turkmen, B. (2022). Customer segmentation with machine learning for online retail industry. The European Journal of Social and Behavioural Sciences, 31(2). https://doi.org/10.15405/ejsbs.316

Ufeli, C. P., Sattar, M. U., & Hasan, R. (2025). Enhancing Customer Segmentation Through Factor Analysis of Mixed Data (FAMD)-Based Approach Using K-Means and Hierarchical Clustering Algorithms. Information, 16(6), 1–25. https://doi.org/10.3390/info16060441

Wang, S., Sun, L., & Yu, Y. (2024). A dynamic customer segmentation approach by combining LRFMS and multivariate time series clustering. Scientific Reports, 1–18. https://doi.org/10.1038/s41598-024-68621-2

Zhang, Y. (2022). Variety-Seeking Behavior in Consumption : A Literature Review and Future Research Directions. Frontiers in Psychology, 18(6). https://doi.org/10.3389/fpsyg.2022.874444

Downloads

Published

2026-04-21

How to Cite

Aprizal, R. M., Sari, O. K., & Bramantoro, A. (2026). Behaviorally Interpretable Transactional Features for Customer Segmentation Using K-Means in Grocery Retail. Edumatic: Jurnal Pendidikan Informatika, 10(1), 160–169. https://doi.org/10.29408/edumatic.v10i1.34163