Skip to content

Unsupervised clustering of the CIFAR-10 dataset using autoencoder bottlenecks and k-means clustering.

Notifications You must be signed in to change notification settings

theopfr/unsupervised-image-clustering-using-autoencoders

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 
 
 
 
 

Repository files navigation

unsupervised image classification by creating k-means clusters from bottleneck-layers of an autoencoder

| procedure:

Create bottlenecks of the CIFAR-10 dataset (only the plane, dog and truck class) and try to group them using k-means clustering.

| autoencoder:

architecture

| process:

The autoencoder was trained to reproduce images out of 3 classes from the CIFAR-10 dataset (dogs, planes and trucks). After training, the train-dataset was passed through the model again in evaluation mode and the bottleneck layers were saved.

K-means clustering was applied on the bottleneck-datapoints with K=3, since the amount of classes was fixed.

| results:

Click to see reconstructed images

plane_reconstruction dog_reconstruction truck_reconstruction

To measure the performance of the clustering, you can calculate the entropy of each cluster. We want every cluster to show (in the perfect case) just one class, therefore the better the clustering the lower the entropy.

examples cluster:

Click to see the clusters

plane_cluster dog_cluster truck_cluster

  • the first image shows a cluster with mainly planes (lower entropy)
  • the second image shows a cluster with most amount of dogs (higher entropy)
  • the third image shows a cluster with most amount of trucks (higher entropy)

After training different autoencoders and clustering, it seemed that images where mostly clustered by their colors and less by their objects. Therefore a "plane"-cluster shows more accurate results because the images often have a bright-blue or white background.

About

Unsupervised clustering of the CIFAR-10 dataset using autoencoder bottlenecks and k-means clustering.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages