Detecting Change in Data Streams

Daniel Kifer, Shai Ben-David, Johannes Gehrke

Research output: Chapter in Book/Report/Conference proceedingChapter

643 Scopus citations

Abstract

This chapter presents a novel method for the detection and estimation of change in a data stream. To provide statistical guarantees on the reliability of detected changes, this method provides meaningful descriptions and quantification of these changes. Detecting changes in a data stream is an important area of research with many applications. The approach assumes that the points in the stream are independently generated, but otherwise makes no assumptions on the nature of the generating distribution. Thus, these techniques work for both continuous and discrete data. A meta-algorithm for change detection in streaming data is also described. The meta-algorithm reduces the problem from the streaming data scenario to the problem of comparing two sample sets. In order to compare the various statistics for nonparametric change detection, it is necessary to use simulated data, so that the changes in generating distributions are known.

Original languageEnglish (US)
Title of host publicationProceedings 2004 VLDB Conference
Subtitle of host publicationThe 30th International Conference on Very Large Databases (VLDB)
PublisherElsevier
Pages180-191
Number of pages12
ISBN (Electronic)9780120884698
DOIs
StatePublished - Jan 1 2004

All Science Journal Classification (ASJC) codes

  • General Computer Science

Fingerprint

Dive into the research topics of 'Detecting Change in Data Streams'. Together they form a unique fingerprint.

Cite this