TY - GEN
T1 - Error-driven generalist+experts (EDGE)
T2 - 17th ACM Conference on Information and Knowledge Management, CIKM'08
AU - Huang, Jian
AU - Madani, Omid
AU - Giles, C. Lee
PY - 2008
Y1 - 2008
N2 - We introduce a multi-stage ensemble framework, Error-Driven Generalist+Expert or Edge, for improved classica-tion on large-scale text categorization problems. Edgerst trains a generalist, capable of classifying under all classes, to deliver a reasonably accurate initial category ranking given an instance. Edge then computes a confusion graph for the generalist and allocates the learning resources to train experts on relatively small groups of classes that tend to be systematically confused with one another by the generalist. The experts' votes, when invoked on a given instance, yield a reranking of the classes, thereby correcting the errors of the generalist. Our evaluations showcase the improved classification and ranking performance on several large-scale text categorization datasets. Edge is in particular effcient when the underlying learners are effcient. Our study of confusion graphs is also of independent interest.
AB - We introduce a multi-stage ensemble framework, Error-Driven Generalist+Expert or Edge, for improved classica-tion on large-scale text categorization problems. Edgerst trains a generalist, capable of classifying under all classes, to deliver a reasonably accurate initial category ranking given an instance. Edge then computes a confusion graph for the generalist and allocates the learning resources to train experts on relatively small groups of classes that tend to be systematically confused with one another by the generalist. The experts' votes, when invoked on a given instance, yield a reranking of the classes, thereby correcting the errors of the generalist. Our evaluations showcase the improved classification and ranking performance on several large-scale text categorization datasets. Edge is in particular effcient when the underlying learners are effcient. Our study of confusion graphs is also of independent interest.
UR - http://www.scopus.com/inward/record.url?scp=70349250367&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=70349250367&partnerID=8YFLogxK
U2 - 10.1145/1458082.1458097
DO - 10.1145/1458082.1458097
M3 - Conference contribution
AN - SCOPUS:70349250367
SN - 9781595939913
T3 - International Conference on Information and Knowledge Management, Proceedings
SP - 83
EP - 92
BT - Proceedings of the 17th ACM Conference on Information and Knowledge Management, CIKM'08
Y2 - 26 October 2008 through 30 October 2008
ER -