TY - GEN
T1 - IKNN
T2 - 11th European Conference on Principles and Practice of Knowledge Discovery in Databases, PKDD 2007
AU - Yang, Song
AU - Jian, Huang
AU - Ding, Zhou
AU - Hongyuan, Zha
AU - Giles, C. Lee
PY - 2007
Y1 - 2007
N2 - The K-nearest neighbor (KNN) decision rule has been a ubiquitous classification tool with good scalability. Past experience has shown that the optimal choice of K depends upon the data, making it laborious to tune the parameter for different applications. We introduce a new metric that measures the informativeness of objects to be classified. When applied as a query-based distance metric to measure the closeness between objects, two novel KNN procedures, Locally Informative-KNN (LI-KNN) and Globally Informative-KNN (GI-KNN), are proposed. By selecting a subset of most informative objects from neighborhoods, our methods exhibit stability to the change of input parameters, number of neighbors(K) and informative points (I). Experiments on UCI benchmark data and diverse real-world data sets indicate that our approaches are application-independent and can generally outperform several popular KNN extensions, as well as SVM and Boosting methods.
AB - The K-nearest neighbor (KNN) decision rule has been a ubiquitous classification tool with good scalability. Past experience has shown that the optimal choice of K depends upon the data, making it laborious to tune the parameter for different applications. We introduce a new metric that measures the informativeness of objects to be classified. When applied as a query-based distance metric to measure the closeness between objects, two novel KNN procedures, Locally Informative-KNN (LI-KNN) and Globally Informative-KNN (GI-KNN), are proposed. By selecting a subset of most informative objects from neighborhoods, our methods exhibit stability to the change of input parameters, number of neighbors(K) and informative points (I). Experiments on UCI benchmark data and diverse real-world data sets indicate that our approaches are application-independent and can generally outperform several popular KNN extensions, as well as SVM and Boosting methods.
UR - http://www.scopus.com/inward/record.url?scp=38049101112&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=38049101112&partnerID=8YFLogxK
M3 - Conference contribution
AN - SCOPUS:38049101112
SN - 9783540749752
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 248
EP - 264
BT - Knowledge Discovery in Database
Y2 - 17 September 2007 through 21 September 2007
ER -