TY - GEN
T1 - Semisupervised domain adaptation for mixture model based classifiers
AU - Raghuram, Jayaram
AU - Miller, David Jonathan
AU - Kesidis, George
PY - 2012
Y1 - 2012
N2 - This paper introduces a method for mixture model-based classifier domain adaptation, wherein one has adequate labeled training data for one (source) domain, very scarce labeled data for another (target) domain, and where the discrepancy between the source and target domain class-conditional distributions is not too great. Starting from the source domain classifier parameters, the method maximizes the likelihood of target domain data, while constrained to agree as much as possible with the target domain label information. This is achieved via an expectation maximization (EM) algorithm, where the joint distribution of the latent variables in the E-Step is parametrically constrained, in order to ensure space-partitioning implications are gleaned from the labeled target domain samples. Experiments on publicly available Internet packet-flow traffic data from different temporal and spatial domains demonstrate significant gains in classification performance compared to 1. direct porting of the source domain classifier; 2. semisupervised learning using only the target domain data; and 3. extension of an existing unsupervised domain adaptation method.
AB - This paper introduces a method for mixture model-based classifier domain adaptation, wherein one has adequate labeled training data for one (source) domain, very scarce labeled data for another (target) domain, and where the discrepancy between the source and target domain class-conditional distributions is not too great. Starting from the source domain classifier parameters, the method maximizes the likelihood of target domain data, while constrained to agree as much as possible with the target domain label information. This is achieved via an expectation maximization (EM) algorithm, where the joint distribution of the latent variables in the E-Step is parametrically constrained, in order to ensure space-partitioning implications are gleaned from the labeled target domain samples. Experiments on publicly available Internet packet-flow traffic data from different temporal and spatial domains demonstrate significant gains in classification performance compared to 1. direct porting of the source domain classifier; 2. semisupervised learning using only the target domain data; and 3. extension of an existing unsupervised domain adaptation method.
UR - http://www.scopus.com/inward/record.url?scp=84868570400&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84868570400&partnerID=8YFLogxK
U2 - 10.1109/CISS.2012.6310708
DO - 10.1109/CISS.2012.6310708
M3 - Conference contribution
AN - SCOPUS:84868570400
SN - 9781467331401
T3 - 2012 46th Annual Conference on Information Sciences and Systems, CISS 2012
BT - 2012 46th Annual Conference on Information Sciences and Systems, CISS 2012
T2 - 2012 46th Annual Conference on Information Sciences and Systems, CISS 2012
Y2 - 21 March 2012 through 23 March 2012
ER -