Title: CREDOS: Classification using Ripple Down Structure (A Case for Rare Classes)
Abstract:Previous chapter Next chapter Full AccessProceedings Proceedings of the 2004 SIAM International Conference on Data Mining (SDM)CREDOS: Classification using Ripple Down Structure (A Case for Rare Class...Previous chapter Next chapter Full AccessProceedings Proceedings of the 2004 SIAM International Conference on Data Mining (SDM)CREDOS: Classification using Ripple Down Structure (A Case for Rare Classes)Mahesh V. Joshi and Vipin KumarMahesh V. Joshi and Vipin Kumarpp.321 - 332Chapter DOI:https://doi.org/10.1137/1.9781611972740.30PDFBibTexSections ToolsAdd to favoritesExport CitationTrack CitationsEmail SectionsAboutAbstract Ripple down rules (RDRs) are commonly used by the expert systems community because they make knowledge bases easy to use and efficient to maintain. We observe that RDRs offer a unique tree-based representation that generalizes the decision tree and disjunctive normal form (DNF) rule-based models, and specializes a generic form of the PNrule model. In this paper, we explore their use for learning predictive classifier models. Such models require to have a generalization capability, most commonly achieved with the help of pruning methods. Existing RDR induction algorithms are developed to build an initial knowledge base that will be used and modified by humans to explain every case correctly. They do not look at RDR as a predictive model, and hence offer very little measures against over-fitting. Existing pruning strategies developed by the data mining community cannot be directly used for pruning a RDR structure because of the uniqueness of the structure and the prediction process. In this paper, we propose a novel induction algorithm CREDOS. The key characteristic of CREDOS is its generic pruning framework. We provide a specific instantiation of it based on the minimum description length (MDL) principle. Using real-world datasets requiring prediction of rare classes, we compare CREDOS to other state-of-the-art algorithms. It exhibits significantly better or comparable performance, especially in predicting a wide variety of rarely occurring events. Previous chapter Next chapter RelatedDetails Published:2004ISBN:978-0-89871-568-2eISBN:978-1-61197-274-0 https://doi.org/10.1137/1.9781611972740Book Series Name:ProceedingsBook Code:PR117Book Pages:xiv + 537Read More
Publication Year: 2004
Publication Date: 2004-04-22
Language: en
Type: article
Indexed In: ['crossref']
Access and Citation
Cited By Count: 19
AI Researcher Chatbot
Get quick answers to your questions about the article from our AI researcher chatbot