Abstract
In many application domains, there is a need for learning algorithms that can effectively exploit attribute value taxonomies (AVT) - hierarchical groupings of attribute values - to learn compact, comprehensible, and accurate classifiers from data - including data that are partially specified. This paper describes AVT-NBL, a natural generalization of the Naïve Bayes learner (NBL), for learning classifiers from AVT and data. Our experimental results show that AVT-NBL is able to generate classifiers that are substantially more compact and more accurate than those produced by NBL on a broad range of data sets with different percentages of partially specified values. We also show that AVT-NBL is more efficient in its use of training data: AVT-NBL produces classifiers that outperform those produced by NBL using substantially fewer training examples.
Original language | English (US) |
---|---|
Title of host publication | Proceedings - Fourth IEEE International Conference on Data Mining, ICDM 2004 |
Publisher | IEEE Computer Society |
Pages | 289-296 |
Number of pages | 8 |
ISBN (Print) | 0769521428, 9780769521428 |
DOIs | |
State | Published - 2004 |
Event | Proceedings - Fourth IEEE International Conference on Data Mining, ICDM 2004 - Brighton, United Kingdom Duration: Nov 1 2004 → Nov 4 2004 |
Publication series
Name | Proceedings - Fourth IEEE International Conference on Data Mining, ICDM 2004 |
---|
Other
Other | Proceedings - Fourth IEEE International Conference on Data Mining, ICDM 2004 |
---|---|
Country/Territory | United Kingdom |
City | Brighton |
Period | 11/1/04 → 11/4/04 |
All Science Journal Classification (ASJC) codes
- General Engineering