Abstract: AbstractMany data mining algorithms depend on a good distance function to be successful. Among large numbers of distance functions, Heterogeneous Euclidean-Overlap Metric (simply HEOM) is the simplest but effective distance function to handle the applications with both continuous and nominal attributes. In order to scale up its generalization performance, we present an improved HEOM by correlation weighting. We call our improved HEOM correlation weighted Heterogeneous Euclidean-Overlap Metric (simply CWHEOM) in this paper. In CWHEOM, to discrete and continuous class problems, we apply different correlation functions to estimate the correlation between attribute variables and class variable. Experiments running on 36 discrete class data sets and 36 continuous class data sets validate its effectiveness.
Publication Year: 2011
Publication Date: 2011-01-01
Language: en
Type: article
Indexed In: ['crossref']
Access and Citation
Cited By Count: 1
AI Researcher Chatbot
Get quick answers to your questions about the article from our AI researcher chatbot