Hybridized model selection with Gifi system for categorical data using the genetic algorithm and information complexity

Electronic Commerce Research and Applications(2023)

引用 1|浏览2
暂无评分
摘要
In the cross-disciplinary fields of social and behavioral sciences, biology, e-commerce, econometrics, medical data mining, and in engineering applications the available data are mostly composed of many categorical, continuous, and mixed data types with both categorical and continuous variables. Modeling such data structures creates many challenges and difficulties in terms of the underlying probability distributional assumptions to model. This paper proposes a novel categorical regression (CATREG) model using optimal scaling technique in Gifi system to resolve the current existing problem by transforming the categorical data to a continuous data and then performing the analysis of the data in the new transformed Gifi space. Such transformation preserves the scaling properties of the original variables without loss of any information and mapping is one-to-one and onto, unlike the kernel mapping in feature space in machine learning. We introduce a hybridized model selection via the information complexity (ICOMP) criterion along with the genetic algorithm (GA) in CATREG model and provide interpretable results. Two real numerical examples are provided to study the effects of the cell phone usage on the sleep patterns of individuals, and a second example is based on building a predictive model of e-commerce for new car market. In both of these numerical examples subset selection of the best predictor variables are determined to build an optimal predictive model. Our results show the efficiency and the versatility of the proposed new approach.
更多
查看译文
关键词
Optimal scaling,Gifi system,Genetic algorithm,Information complexity,Sleep patterns,E-commerce,New car market
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要