Machine learning/Unsupervised Learning
Appearance
Unsupervised learning is a type of machine learning algorithm used to draw inferences from datasets consisting of input data without labeled responses.
The most common unsupervised learning method is cluster analysis, which is used for exploratory data analysis to find hidden patterns or grouping in data.
- What other methods are there besides cluster analysis?
Clustering
[edit | edit source]Problem statement:
- Given some data, , identify groups (clusters) of data points and assign each point to one of the groups (clusters)
- The clusters are identified using a measure of similarity which is defined upon metrics such as Euclidean or probabilistic distance.
- Points inside each cluster has a higher measure of similarity than data in any other cluster
Method | Comment |
---|---|
K-means clustering |
|
Hierarchical clustering | Builds a multilevel hierarchy of clusters by creating a cluster tree |
Gaussian mixture models | Models clusters as a mixture of multivariate normal density components |
Self-organizing maps | Uses neural networks that learn the topology and distribution of the data |