Open-sourcing Knowledge Distillation Code and Weights of SD-Small and SD-Tiny

In recent times, the AI community has witnessed a remarkable surge in the development of larger and more performant language models, such as Falcon 40B, LLaMa-2 70B, Falcon 40B, MPT 30B, and in the