PENS: An algorithm for density-based clustering in peer-to-peer systems

Mei Li, Guanling Lee, Wang-chien Lee, Anand Sivasubramaniam

Research output: Chapter in Book/Report/Conference proceedingConference contribution

9 Scopus citations

Abstract

Huge amounts of data are available in large-scale networks of autonomous data sources dispersed over a vide area. Data mining is an essential technology for obtaining hidden and valuable knowledge from these networked data sources. In this paper, we investigate clustering, one of the most important data mining tasks, in one of such networked computing environments, i.e., peer-to-peer (P2P) systems. The lack of a central control and the sheer large size of P2P systems make the existing clustering techniques not applicable here. We propose a fully distributed clustering algorithm, called Peer dENsity-based cluStering (PENS), which overcomes the challenge raised in performing clustering in peer-to-peer environments, i.e., cluster assembly. The main idea of PENS is hierarchical cluster assembly, which enables peers to collaborate in forming a global clustering model without requiring a central control or message flooding. The complexity analysis of the algorithm demonstrates that PENS can discover clusters and noise efficiently in P2P systems.

Original languageEnglish (US)
Title of host publicationProceedings of the 1st International Conference on Scalable Information Systems, InfoScale '06
DOIs
StatePublished - 2006
Event1st International Conference on Scalable Information Systems, InfoScale '06 - Hong Kong, China
Duration: May 30 2006Jun 1 2006

Publication series

NameACM International Conference Proceeding Series
Volume152

Other

Other1st International Conference on Scalable Information Systems, InfoScale '06
Country/TerritoryChina
CityHong Kong
Period5/30/066/1/06

All Science Journal Classification (ASJC) codes

  • Software
  • Human-Computer Interaction
  • Computer Vision and Pattern Recognition
  • Computer Networks and Communications

Fingerprint

Dive into the research topics of 'PENS: An algorithm for density-based clustering in peer-to-peer systems'. Together they form a unique fingerprint.

Cite this