Distance Metrics And Clustering Methods For Mixed-Type Data

INTERNATIONAL STATISTICAL REVIEW(2019)

引用 35|浏览53
暂无评分
摘要
In spite of the abundance of clustering techniques and algorithms, clustering mixed interval (continuous) and categorical (nominal and/or ordinal) scale data remain a challenging problem. In order to identify the most effective approaches for clustering mixed-type data, we use both theoretical and empirical analyses to present a critical review of the strengths and weaknesses of the methods identified in the literature. Guidelines on approaches to use under different scenarios are provided, along with potential directions for future research.
更多
查看译文
关键词
Discretisation, dummy coding, Gower's distance, k-means clustering, machine learning, Mahalanobis distance, mixture model, multivariate data analysis, unsupervised learning
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要