Abstract
In this study, we aim to predict customer purchase behavior using various machine learning models to better understand customer tendencies and enhance marketing strategies. We use a dataset containing demographic and behavioral data, including age, gender, annual income, number of purchases, product category, time spent on the website, loyalty program membership, and discounts availed. Our analysis involves data preprocessing, exploratory data analysis (EDA), and feature engineering. We then train and evaluate six different machine learning models: Logistic Regression, Random Forest, Gradient Boosting, Support Vector Machine (SVM), K-Nearest Neighbors (KNN), and XGBoost. The models are assessed using metrics such as accuracy, precision, recall, F1-score, and ROC AUC. Results indicate that ensemble models, specifically Random Forest and Gradient Boosting, outperform the other models in terms of accuracy and ROC AUC. The study concludes that ensemble models are highly effective for predicting customer purchase behavior, providing valuable insights for businesses to tailor their marketing efforts. Future research could explore additional features, more advanced models, and real-time prediction capabilities.
References
Agarwal, V., S. Taware, S. A. Yadav, D. Gangodkar, A. L. N. Rao, et al., 2022 Customer - churn prediction using machine learning. In 2022 2nd International Conference on Technological Advancements in Computational Sciences (ICTACS), pp. 893–899, IEEE.
Alfian, G., M. Q. H. Octava, F. M. Hilmy, R. A. Nurhaliza, Y. M. Saputra, et al., 2023 Customer shopping behavior analysis using rfid and machine learning models. Information 14: 551.
Chaudhary, R., S. Chaudhary, A. Singh, A. Bhanu, S. Chandela, et al., 2024 Artificial intelligence-based digital marketing for discovering shopping possibilities and enhancing customer experience. In 2024 2nd International Conference on Disruptive Technologies (ICDT), pp. 413–417, IEEE.
Deniz, E., H. Erbay, and M. Co¸sar, 2022 Multi-label classification of e-commerce customer reviews via machine learning. Axioms 11: 436.
Krishna, R. D., M. Mahadev, S. Hariprasad, S. Abhishek, and T. Anjali, 2023 Cultivating customer purchase intent: Leveraging machine learning for precise predictions. In 2023 12th International Conference on System Modeling & Advancement in Research Trends (SMART), pp. 374–379, IEEE.
Krishnamoorthy, R., K. P. Kaliyamurthie, B. S. H. S. Ahamed, N. Harathi, and R. S. Selvan, 2023 Multi objective evaluator model development for analyzing customer behavior. In 2023 3rd International Conference on Advancement in Electronics & Communication Engineering (AECE), pp. 640–645, IEEE.
Kumar, M. M., A. S. Venkat, M. V. N. Balaji, C. H. N. Kumar, S. Srithar, et al., 2023 Driving e-commerce success with advanced machine learning: Customer purchase pattern insights. In 2023 International Conference on Sustainable Communication Networks and Application (ICSCNA), pp. 1196–1203, IEEE.
Rana, M. N. U., S. A. Al Shiam, S. A. Shochona, M. R. Islam, M. Asrafuzzaman, et al., 2024 Revolutionizing banking decisionmaking: A deep learning approach to predicting customer behavior. Journal of Business and Management Studies 6: 21–27.
Thabet, M., M. Messaadia, and M. Kumar, 2023 Distributed machine learning for predicting customer behavior while preserving privacy. In 2023 International Conference on Decision Aid Sciences and Applications (DASA), pp. 613–617, IEEE.
Wen, Z., W. Lin, and H. Liu, 2023 Machine-learning-based approach for anonymous online customer purchase intentions using clickstream data. Systems 11: 255.

This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.