Unsupervised learning

Cluster analysis

W. Stuetzle and R. Nugent
A generalized single linkage method for estimating the cluster tree of a density.
Journal of Computational and Graphical Statistics, 2009 (to appear)
PDF   Online supplement

A. Murua, W. Stuetzle, J. Tantrum, and S. Sieberts
Model based document classification and clustering.
International Journal of Tomography & Statistics, Vol. 8, No. W08, 2008, pp. 1--24.
PDF

A. Murua, L. Stanberry, and W. Stuetzle
On Potts model clustering, kernel k-means, and density estimation.
Journal of Computational and Graphical Statistics, Vol. 17, No. 4, 2008, pp. 629--658.
PDF

W. Stuetzle
Estimating the cluster tree of a density by analyzing the minimal spanning tree of a sample.
Journal of Classification, Vol. 20, No. 5, 2003, pp. 25-47.
PDF

J. Tantrum, Alejandro Murua, and W. Stuetzle
Hierarchical model-based clustering of large datasets through Fractionation and Refractionation.
Joint work with Proceedings of the 8th International Conference on Knowledge Discovery and Data Mining (KDD02), 2002, pp. 183--190.
PDF

J. Tantrum, Alejandro Murua, and W. Stuetzle
Assessment and pruning of hierarchical model-based clustering.
Proceedings of the 9th International Conference on Knowledge Discovery and Data Mining (KDD03), 2003, pp. 197 -- 205.
PDF

 

Principal curves and nonlinear principal components

T. Duchamp and W. Stuetzle
Extremal properties of principal curves in the plane.
Annals of Statistics, Vol. 24, No. 4, 1996, pp. 1511 - 1520.
PDF

T. Duchamp and W. Stuetzle
Geometric properties of principal curves in the plane.
In Robust Statistics, Data Analysis, ad Computer Intensive Methods, Helmut Rieder, ed, Springer Lecture Notes in Statistics No. 109, 1995.
PDF

A. Buja, D. Donnell, and W. Stuetzle
Analysis of additive dependencies and concurvities using smallest additive principal components
Discussion paper, Annals of Statistics, Vol. 22, 1994, pp. 1635--1673.
PDF

T. Hastie and W. Stuetzle
Principal curves

Journal of the American Statistical Association, Vol. 84, 1989, pp. 502-516.
PDF

 

 

 

 

 

 

 

 

 

Talks on machine learning

Unsupervised learning: Estimating the cluster tree of a density from the minimal spanning tree of a sample. Powerpoint presentation

Unsupervised learning: Statistical and computational perspectives. Powerpoint presentation

What are the effects of "Bagging"? Some experimental and theoretical results. Powerpoint presentation

Generalized single linkage clustering.  Powerpoint presentation