Using Logistic Regression Model to Predict the Success of Bank Telemarketing
International Journal on Data Science and Technology
Volume 4, Issue 1, March 2018, Pages: 35-41
Received: Jun. 20, 2018; Published: Jun. 21, 2018
Views 1239      Downloads 275
Yiyan Jiang, School of Data Science, Zhejiang University of Finance and Economics, Hangzhou, China
Article Tools
Follow on us
Term deposit is always an essential business of a bank and a good market campaign plays an essential role in financial selling. Nowadays, the telephone marketing, which can assist consulting institution to extract potential clients, has been one of the most general marketing campaigns. Previous research shows that data mining has gradually stood out on the era of Big Data and has been incorporated to deal with massive data precisely. The purpose of this study is to predict the success of bank telemarketing to select the best consumer set. A relationship is observed between success and other factors through constructing logistic regression model. To validate the effectiveness of prediction, some basic classification models have been compared in this study, including Bayes, Support Vector Machine, Neural Network and Decision Tree. As a result, the prediction accuracy and the area under ROC curve prove the logistic regression model performs best in classifying than other models. All of the experiments are implemented by R language software. And the experimental results can provide some suggestions and instructions towards the management of the bank.
Term Deposit, Data Mining, Prediction, Logistic Regression Model, R Language
To cite this article
Yiyan Jiang, Using Logistic Regression Model to Predict the Success of Bank Telemarketing, International Journal on Data Science and Technology. Vol. 4, No. 1, 2018, pp. 35-41. doi: 10.11648/j.ijdst.20180401.15
S. J. Press and S. Wilson, “Choosing between logistic regression and discriminant analysis,” Journal of the American Statistical Association, 1978, vol. 73, pp. 699-705.
G. A. Morgan, J. J. Vaske, J. A. Gliner and R. J. Harmon, “Logistic regression and discriminant analysis: use and Interpretation,” Journal of the American Academy of Child and Adolescent Psychiatry, 2003, vol. 42, pp. 994-997.
S. Lisawadi, M. K. A. Shah and S. E. Ahmed, “Model selection and post estimation based on a pretest for logistic regression models,” Journal of Statistical Computation and Simulation, 2016, vol. 86, pp. 3495-3511.
A. E. Ades, M. Sculpher, A. Sutton and K. Abrams, et al., “Bayesian methods for evidence synthesis in cost-effectiveness analysis,” Pharmacoeonomics, vol. 24, pp. 1-19.
W. Dumouchel, “Bayesian data mining in large frequency tables with an application to the FDA spontaneous reporting System,” American Statistician, 1999, vol. 53, pp. 177-190.
K. J. Friston, V. Litvak, A. Oswal, A. Razi, K. E. Stephan, B. C. M. van Wijk, G. Ziegler and P. Zeidman, “Bayesian model reduction and empirical Bayes for group (DCM) studies,” Neuroimage, 2016, vol. 128, pp. 413-431.
G. M. Foody and A. Mathur, “A relative evaluation of multiclass image classification by support vector machines,” IEEE Transactions on Geoscience and Remote Sensing, 2004, vol. 42, pp. 1335-1343.
A. M. Andrew, “An introduction to support vector machines and other kernel‐based learning methods,” Kybernetes, 2002, vol. 32, pp. 1-28.
L. V. Utkin, A. I. Chekh, and Y. A. Zhuk, “Binary classification SVM-based algorithms with interval-valued training data using triangular and Epanechnikov kernels,” Neural Networks, 2016, vol. 80, pp. 53-66.
H. S. Hippert, C. E. Pedreira and R. C. Souza, “Neural networks for short-term load forecasting: a review and evaluation,” IEEE Transactions on Power Systems, 2001, vol. 16, pp. 44-55.
S. R. Presnell and F. E. Cohen, “Artificial neural networks for pattern recognition in biochemical sequences”, Annual Review of Biophysics and Biomolecular Structure, 1993, vol. 22, pp. 283-298.
L. Wang, B. Yang, Y. Chen, X. Zhang and J. Orchard, “Improving neural-network classifiers using nearest neighbor partitioning,” IEEE Transactions on Neural Networks and Learning Systems, 2017, vol. 28, pp. 2255-2267.
M. Núñez, “The use of background knowledge in decision tree induction,” Machine Learning, 1991, vol. 6, pp. 231-250.
S. K. Murthy, “Automatic construction of decision trees from data: a multi-disciplinary survey,” Data Mining and Knowledge Discovery, 1998, vol. 2, pp. 345-389.
R. Wang, S. Kwong, X. Z. Wang and Q. Jiang, “Segment based decision tree induction with continuous valued attributes,” IEEE Transactions on Cybernetics, 2017, vol. 45, pp. 1262-1275.
M. S. Chen, J. Han and P. S. Yu, “Data mining: an overview from a database perspective,” IEEE Transactions on Knowledge and Data Engineering, 2002, vol. 8, pp. 866-883.
S. Moro, P. Cortez and P. Rita, “A data-driven approach to predict the success of bank telemarketing,” Decision Support Systems, 2014, vol. 62, pp. 22-31.
S. Moro, P. Cortez and P. Rita, “Using customer lifetime value and neural networks to improve the prediction of bank deposit subscription in telemarketing campaigns,” Neural Computing and Applications, 2015, vol. 26, pp. 131-139.
T. M. P. D. John, P. S. P. D. Theodore and L. E. P. D. Terry, “Supply chain management and its relationship to logistics, marketing, production, and operations management,” Journal of Business Logistics, 2008, vol. 29, pp. 31–46.
M. Strano and B. M. Colosimo, “Logistic regression analysis for experimental determination of forming limit diagrams,” International Journal of Machine Tools and Manufacture, 2006, vol. 46, pp. 673–682.
Y. Yorozu, M. Hirano, K. Oka and Y. Tagawa, “Electron spectroscopy studies on magneto-optical media and plastic substrate interface,” IEEE Translation Journal on Magnetics in Japan, 1987, vol. 2, pp. 740–741.
Science Publishing Group
1 Rockefeller Plaza,
10th and 11th Floors,
New York, NY 10020
Tel: (001)347-983-5186