E-Commerce Customer Segmentation Using K-Means Algorithm Based on Purchasing Characteristics and Customer Satisfaction
Abstract
This study addressed the existing research gap in e-commerce customer segmentation by integrating both behavioral metrics, specifically purchasing characteristics, and emotional metrics, which represented customer satisfaction levels. The primary objective was to establish a highly granular and representative customer typology that traditional transaction-only models fail to capture. To achieve this, a quantitative data mining approach was implemented using a dataset of 450 customer records, which underwent a crucial preprocessing phase using Min-Max Normalization to balance heterogeneous value ranges. The optimal number of clusters was determined using the Elbow Method, and the segmentation was executed through the K-Means Clustering algorithm. The empirical findings revealed that the dataset successfully partitioned into three distinct, non-overlapping behavioral archetypes: Cluster 0 representing high-intensity transactional users with a satisfaction gap, Cluster 1 representing at-risk or dissatisfied customers, and Cluster 2 representing satisfied advocates. The mathematical reliability and strong cohesion of these clusters were rigorously verified by a robust Silhouette Coefficient of 0.62. Ultimately, this research concluded that data-normalized pipelines could successfully transform raw customer analytics into actionable Customer Relationship Management (CRM) strategies, thereby providing e-commerce companies with a reliable foundation to optimize marketing efficiency, mitigate churn risks, and enhance overall profitability.
Full Text:
PDFReferences
Fauzan, R. M., & Alfian, G. (2024). Segmentasi pelanggan e-commerce menggunakan fitur recency, frequency, monetary (RFM) dan algoritma klasterisasi K-Means. Jurnal Informatika Sunan Kalijaga (JISKA), 9(3), 170–177. https://doi.org/10.14421/jiska.2024.9.3.170-177
Géron, A. (2023). Hands-on machine learning with Scikit-Learn, Keras, and TensorFlow: Concepts, tools, and techniques to build intelligent systems (3rd ed.). O'Reilly Media.
Han, J., Kamber, M., & Pei, J. (2012). Data mining: Concepts and techniques (3rd ed.). Morgan Kaufmann.
Han, J., Kamber, M., & Pei, J. (2022). Data mining: Concepts and techniques (4th ed.). Morgan Kaufmann.
James, G., Witten, D., Hastie, T., & Tibshirani, R. (2021). An Introduction to Statistical Learning: with Applications in R (2nd ed.). Springer.
Kelleher, J. D., Tierney, B., & Namee, B. M. (2020). Fundamentals of machine learning for predictive data analytics (2nd ed.). MIT Press.
Kumar, V., & Rajan, B. (2022). Customer granularity and data-driven marketing automation: Optimizing K-Means clustering metrics. Journal of Marketing Analytics, 10(3), 215–229. https://doi.org/10.1057/s41270-022-00156-3
Sarker, I. H. (2021). Machine learning: Algorithms, real-world applications and research directions. SN Computer Science, 2(3), 160. https://doi.org/10.1007/s42979-021-00592-x
Shmueli, G., Bruce, P. C., Gedeck, P., Patel, N. R., & Lichtendahl, K. C. (2023). Data mining for business analytics: Concepts, techniques, and applications in Python (6th ed.). Wiley.
Siagian, R., Sirait, P., & Halima, A. (2021). E-commerce customer segmentation using K-Means algorithm and LRFM model. Journal of Informatics and Telecommunication Engineering, 5(1), 180–189.
Simanjuntak, W., Hombing, N. M. B., & Wijaya, A. (2025). Analisis segmentasi pelanggan menggunakan K-Means untuk personalisasi penawaran produk dalam konteks e-commerce. Jurnal Sains dan Teknologi Informasi.
Sinaga, K. P., & Yang, M. S. (2020). Unsupervised K-means clustering algorithm. IEEE Access, 8, 80716–80727. https://doi.org/10.1109/ACCESS.2020.2988767
Smith, A. J., & Thompson, M. R. (2021). Digital customer behavior: The dynamics of spurious loyalty in modern e-commerce formats. Routledge.
Tan, P.-N., Steinbach, M., Karpatne, A., & Kumar, V. (2019). Introduction to data mining (2nd ed.). Pearson.
Warianta, D. T., Astagina, P., Julianto, R., & Arini, F. Y. (2025). Optimasi K-Means menggunakan algoritma Firefly untuk segmentasi pelanggan pada e-commerce. Jurnal Fasilkom.
Wedel, M., & Kannan, P. K. (2016). Marketing analytics for data-rich environments. Journal of Marketing, 80(6), 97–121. https://doi.org/10.1509/jm.15.0413
Yuan, C., & Yang, H. (2019). Research on K-value selection method of K-means clustering algorithm. J, 2(2), 226-235. https://doi.org/10.3390/j2020016
Zhao, L., & Larry, P. (2023). Re-evaluating the service-profit chain in the digital platform economy: Satisfaction as a critical threshold. International Journal of Electronic Commerce, 27(2), 184–205. https://doi.org/10.1080/10864415.2023.2198341
Article Metrics
Abstract view : 1 timesPDF – 1 times
Refbacks
- There are currently no refbacks.
JURNAL Excellent:International Journal of computational Intelligence and Sustainable Innovation by Universitas Dharmawangsa Medan is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.




