Open-sourcing Knowledge Distillation Code and Weights of SD-Small and SD-Tiny

Harish Prabhala's avatar
Yatharth Gupta's avatar

In recent times, the AI community has witnessed a remarkable surge in the development of larger and more performant language models, such as Falcon 40B, LLaMa-2 70B, Falcon 40B, MPT 30B, and in the

 

 

 

To finish reading, please visit source site