Skip to content

chunyang-w/mini-ViT

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

19 Commits
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 

Repository files navigation

Mini ViT

This is a minimal ViT implementation from scratch for demonstrational/educational purposes along with step-by-step code annotation.

image

(Figure From An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale by Alexey Dosovitskiy, Lucas Beyer, Alexander Kolesnikov, Dirk Weissenborn, Xiaohua Zhai, Thomas Unterthiner, Mostafa Dehghani, Matthias Minderer, Georg Heigold, Sylvain Gelly, Jakob Uszkoreit, Neil Houlsby)

πŸ›’πŸ›’πŸ›’ Implementation:

πŸŽ†πŸŽ†πŸŽ† Demostration on MNIST dataset:

image

About

A minimal ViT Implementation from scratch for demonstration/education purposes.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published