Transductive methods for distributed ensemble classification

David J. Miller, Siddharth Pal

Research output: Chapter in Book/Report/Conference proceedingConference contribution

2 Scopus citations

Abstract

We consider ensemble classification for the case when there is no common labeled training data for designing the function which aggregates individual classifier decisions. We dub this problem distributed ensemble classification, addressing e.g. when individual classifiers are trained on different (e.g. proprietary, legacy) databases or operate (perhaps remotely) on different sensing modalities. Typically, fixed, principled (untrained) rules of classifier combination such as voting methods are used in this case for aggregating decisions. Alternatively, we take a transductive approach, optimizing the combining rule for an objective function measured on the unlabeled batch of test data. We propose specific maximum likelihood (ML) objectives that are shown to yield well-known forms of aggregation, albeit with iterative, EM-based adjustment to account for possible mismatch between the class priors used by individual classifiers and those reflected in the new data batch. We also propose an information-theoretic method which outperforms the ML methods and addresses some problem instances where the ML methods are not applicable. On benchmark data from the UC Irvine machine learning repository, all our methods give improvements in accuracy over the use of fixed rules when there is prior mismatch.

Original languageEnglish (US)
Title of host publication2006 IEEE Conference on Information Sciences and Systems, CISS 2006 - Proceedings
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages1605-1610
Number of pages6
ISBN (Print)1424403502, 9781424403509
DOIs
StatePublished - Jan 1 2006
Event2006 40th Annual Conference on Information Sciences and Systems, CISS 2006 - Princeton, NJ, United States
Duration: Mar 22 2006Mar 24 2006

Publication series

Name2006 IEEE Conference on Information Sciences and Systems, CISS 2006 - Proceedings

Other

Other2006 40th Annual Conference on Information Sciences and Systems, CISS 2006
Country/TerritoryUnited States
CityPrinceton, NJ
Period3/22/063/24/06

All Science Journal Classification (ASJC) codes

  • General Computer Science

Fingerprint

Dive into the research topics of 'Transductive methods for distributed ensemble classification'. Together they form a unique fingerprint.

Cite this