Fine-Tune ViT for Image Classification with 🤗 Transformers

Nate Raw's avatar


Open In Colab

Just as transformers-based models have revolutionized NLP, we’re now seeing an explosion of papers applying them to all sorts of other domains. One of the most revolutionary of these was the Vision Transformer (ViT), which was introduced in June 2021 by a team of researchers at Google Brain.

This paper explored how

 

 

 

To finish reading, please visit source site