TY - JOUR

T1 - Weak limit theorems for univariate k-mean clustering under a nonregular condition

AU - Serinko, Regis J.

AU - Babu, Gutti Jogesh

N1 - Funding Information:
Received November 28, 1990; revised September 27, 1991. AMS 1980 classification numbers: 6OFO5, 62E20. Key words and phrases: Bahadur’s representation, singular maximum likelihood, double exponential distribution. * Research supported in part by NSA Grant MDA 904-90-H-1001.

PY - 1992/5

Y1 - 1992/5

N2 - A set of n points sampled from a common distribution F, is partitioned into k ≥ 2 groups that maximize the between group sum of squares. The asymptotic normality of the vector of probabilities of lying in each group and the vector of group means is known under the condition that a particular function, depending on F, has a nonsingular Hessian. This condition is not met by the double exponential distribution with k = 2. However, in this case it is shown that limiting distribution for the probability is b sign(W) √|W| and for the two means it is ai sign(W) √|W|, where W ∼ N(0,1) and b, a1, and a2 are constants. The rate of convergence in n 1 4 and the joint asymptotic disstribution for the two means is concentrated on the line x = y. A general theory is then developed for distributions with singular Hessians. It is shown that the projection of the probability vector onto some sequence of subspaces will have normal limiting distribution and that the rate of convergence is n 1 2. Further, a sufficient condition is given to assure that the probability vector and vector of group means have limiting distributions, and the possible limiting distributions under this condition are characterized. The convergence is slower than n 1 2.

AB - A set of n points sampled from a common distribution F, is partitioned into k ≥ 2 groups that maximize the between group sum of squares. The asymptotic normality of the vector of probabilities of lying in each group and the vector of group means is known under the condition that a particular function, depending on F, has a nonsingular Hessian. This condition is not met by the double exponential distribution with k = 2. However, in this case it is shown that limiting distribution for the probability is b sign(W) √|W| and for the two means it is ai sign(W) √|W|, where W ∼ N(0,1) and b, a1, and a2 are constants. The rate of convergence in n 1 4 and the joint asymptotic disstribution for the two means is concentrated on the line x = y. A general theory is then developed for distributions with singular Hessians. It is shown that the projection of the probability vector onto some sequence of subspaces will have normal limiting distribution and that the rate of convergence is n 1 2. Further, a sufficient condition is given to assure that the probability vector and vector of group means have limiting distributions, and the possible limiting distributions under this condition are characterized. The convergence is slower than n 1 2.

UR - http://www.scopus.com/inward/record.url?scp=38249011761&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=38249011761&partnerID=8YFLogxK

U2 - 10.1016/0047-259X(92)90070-V

DO - 10.1016/0047-259X(92)90070-V

M3 - Article

AN - SCOPUS:38249011761

SN - 0047-259X

VL - 41

SP - 273

EP - 296

JO - Journal of Multivariate Analysis

JF - Journal of Multivariate Analysis

IS - 2

ER -