主管:中国科学院
主办:中国优选法统筹法与经济数学研究会
   中国科学院科技战略咨询研究院

Chinese Journal of Management Science ›› 2015, Vol. 23 ›› Issue (10): 162-169.doi: 10.16381/j.cnki.issn1003-207x.2015.10.019

• Articles • Previous Articles     Next Articles

Customer Targeting Model Based on Improved GMDH

XIAO Jin1, TANG Jing2,3, LIU Dun-hu4, XIE Ling1, WANG Shou-yang5   

  1. 1. Business School, Sichuan University, Chengdu 610064, China;
    2. School of Economics and Management, University of Chinese Academy of Sciences, Beijing 100190, China;
    3. Research Center on Fictitious Economy & Data Science, Chinese Academy of Sciences, Beijing 100190, China;
    4. Management Faculty, Chengdu University of Information Technology, Chengdu 610225, China;
    5. Academy of Mathematics and System Sciences, Chinese Academy of Sciences, Beijing 100190 China
  • Received:2014-10-30 Revised:2015-01-08 Online:2015-10-20 Published:2015-10-24

Abstract: In recent years, database marketing has become a hot topic in customer relationship management (CRM), and customer targeting modeling is one of the most important issues in database marketing. Essentially, customer targeting modeling is a binary classification problem, that is, all customers are divided into two categories: the customers responding to the corporate marketing activities and the ones responding to no activities. This study combines group method of data handling (GMDH) neural networks, re-sampling technique, as well as Logistic regression classification algorithm to construct customer targeting model LogGMDH-Logistic. This model consists of three phases: (1) In order to solve the highly imbalanced class distribution of training set for customer targeting modeling, a new resampling method (hybrid sampling) is proposed to balance the class distribution of training set; (2) To select some key features from a large number of characteristics describing the customers, the GMDH neural network is introduced and a new feature selection algorithm Log-GMDH is presented, which improves the traditional GMDH neural network model in both the selection of transfer function and the construction of new external criterion. In terms of the selection of transfer function, it uses the non-linear Logistic regression function to replace the linear transfer function of the traditional GMDH neural network; and in the construction of external criterion, it selects the hit rate suitable for the customer targeting modeling to replace the regularization criterion of the traditional GMDH neural network; (3) It obtains the training set by mapping according to the selected feature subset, trains the Logistic regression classification algorithm and predicts the response probability of potential customers. The experiment is carried out in a customer targeting dataset of a car insurance company from CoIL2000 prediction competition, and the results show that LogGMDH-Logistic model is superior to some existing customer targeting models both in performance and interpretability. In CRM, there are a lot of customer classification problems, such as customer churn prediction, customer credit scoring, which are similar to customer targeting modeling. Thus, the model proposed in this study can also be used to solve the above problems, and is expected to achieve satisfaction classification performance.

Key words: customer targeting, GMDH neural network, feature selection, hybrid sampling, logistic regression

CLC Number: