TY - GEN
T1 - Web page clustering using harmony search optimization
AU - Forsati, Rana
AU - Mahdavi, Mehrdad
AU - Kangavari, Mohammadreza
AU - Safarkhani, Banafsheh
PY - 2008
Y1 - 2008
N2 - Clustering has become an increasingly important task in modern application domains. Targeting useful and relevant information on the World Wide Web is a topical and highly complicated research area. Clustering techniques have been applied to categorize documents on web and extracting knowledge from the web. In this paper we propose novel clustering algorithms based on Harmony Search (HS) optimization method that deals with web document clustering. By modeling clustering as an optimization problem, first, we propose a pure HS based clustering algorithm that finds near global optimal clusters within a reasonable time. Then we hybridize K-means and harmony clustering to achieve better clustering. Experimental results on five different data sets reveal that the proposed algorithms can find better clusters when compared to similar methods and the quality of clusters is comparable. Also proposed algorithms converge to the best known optimum faster than other methods.
AB - Clustering has become an increasingly important task in modern application domains. Targeting useful and relevant information on the World Wide Web is a topical and highly complicated research area. Clustering techniques have been applied to categorize documents on web and extracting knowledge from the web. In this paper we propose novel clustering algorithms based on Harmony Search (HS) optimization method that deals with web document clustering. By modeling clustering as an optimization problem, first, we propose a pure HS based clustering algorithm that finds near global optimal clusters within a reasonable time. Then we hybridize K-means and harmony clustering to achieve better clustering. Experimental results on five different data sets reveal that the proposed algorithms can find better clusters when compared to similar methods and the quality of clusters is comparable. Also proposed algorithms converge to the best known optimum faster than other methods.
UR - http://www.scopus.com/inward/record.url?scp=51849105280&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=51849105280&partnerID=8YFLogxK
U2 - 10.1109/CCECE.2008.4564812
DO - 10.1109/CCECE.2008.4564812
M3 - Conference contribution
AN - SCOPUS:51849105280
SN - 9781424416431
T3 - Canadian Conference on Electrical and Computer Engineering
SP - 1601
EP - 1604
BT - IEEE Canadian Conference on Electrical and Computer Engineering, Proceedings, CCECE 2008
T2 - IEEE Canadian Conference on Electrical and Computer Engineering, CCECE 2008
Y2 - 4 May 2008 through 7 May 2008
ER -