Advanced Clustering Algorithms
Kind of a misnomer in my opinion
What distinguishes this algorithm is the kind of clusters it finds
Other patterns can be fit using the Expectation Maximization algorithm
A centroid and a radius
Can do fun things like
Overlapping clusters
Explicitly treating points as outliers
Can assess with same approaches as before
Distortion
BIC
Much slower to create than k-means
Can be overkill for many problems
Conducts dimensionality reduction and clustering simultaneously
Like support vector machines
Mathematically equivalent to K-means clustering on a non-linear dimension-reduced space
Each data point starts as its own cluster
Two clusters are combined if the resulting fit is better
Continue until no more clusters can be combined
Repeatedly adds new point
A method for calculating difference between sequences when length is not the same
Can conduct statistical analysis of latent classes
Can be used to model changes in membership over time
Requires tons and tons of data
Very slow
Tends to find smaller number of latent classes than cluster analysis
Which one you choose depends on what the data look like
And what kind of patterns you want to find