aiacademy: 機器學習 Unsupervised Learning

August 10, 2019 1 minute read

Tags: aiacademy, dimensionality-reduction, machine-learning, unsupervised-learning

Unsupervised Learning

Outline
- Unsupervised learning
- Dimension reduction
  - Principal component analysis (PCA)
  - T-Distributed Stochastic Neighbor Embedding (t-SNE)
- Clustering
  - K-means
  - Hierarchical clustering

Dimension Reduction

跟 Coursera 的 Data Compression 一樣

Why dimension reduction ?
- compress data preserve useful information
- Data visulization

Principlal Component analysis (PCA)

先作筆記，等看到　coursera 的時候再來複習！！！

PCA
- 把高維度的點，投影到低維度上面，且希望在低維度空間中保有在高維度中的性質！

Imgur

PCA in general

Perform PCA by computing the eigenvectors of the k largest eigenvalues of the covariance matrix

PCA on MNIST

pca_mnist

T-SNE (T-distributed Stochastic Neighbor Embedding)

Goal: find locations in low dimensions such that the distance between points are preserved
T-SNE allows non-linear transforms from the original data point to the new data point

T-SNE

Imgur

Why different silmilarity measures?

Crowding problem: distance between distant data points in the low dimension will not be large enough by SNE, which uses the same similarity measure for both low and high dimensional space

Imgur

T-SNE on MNIST

tsn-onmnist

Summary

Both PCA and t-SNE project data points into low dimension
- PCA allows only linear projection
- T-SNE allows non-linear projection
Adv of PCA
- Interpretability
- Can project new data points
Adv of t-SNE
- Visualization

Clustering

Hierarchical clustering

Types of hierarchical clustering
- Agglomerative (bottom-up)
  - Start with each data point as a cluster
  - Merge two closest clusters until only one cluster left
- Divisive (top-down)
  - Start with one cluster
  - Each step split a cluster until each cluster contains one data point

Agglomerateive example

Imgur

Different cluster result

Imgur

Summary

Clustering
- k-means and hierachical clustering

Twitter Facebook LinkedIn

阿葛廷

aiacademy: 機器學習 Unsupervised Learning

Unsupervised Learning

Dimension Reduction

Principlal Component analysis (PCA)

T-SNE (T-distributed Stochastic Neighbor Embedding)

Clustering

Hierarchical clustering

You May Also Enjoy

再不說些話我都快被ＡＩ淹沒拉！！！！

daily Programming: 寶哥出場品質保證又見 GitHub Copilot!

daily Programming: Azure AI Search

當你覺得對方很Ｇ8討厭的時候，放面鏡子照照自己剛剛的行為先吧！

阿葛廷

Unsupervised Learning

Dimension Reduction

Principlal Component analysis (PCA)

T-SNE (T-distributed Stochastic Neighbor Embedding)

Clustering

Hierarchical clustering

You May Also Enjoy

再不說些話我都快被ＡＩ淹沒拉！！！！

daily Programming: 寶哥出場 品質保證 又見 GitHub Copilot!

daily Programming: Azure AI Search

當你覺得對方很Ｇ8討厭的時候，放面鏡子照照自己剛剛的行為先吧！

daily Programming: 寶哥出場品質保證又見 GitHub Copilot!