MTEB: Massive Text Embedding Benchmark

MTEB is a massive benchmark for measuring the performance of text embedding models on diverse embedding tasks. The ๐Ÿฅ‡ leaderboard provides a holistic view of the best text embedding models out there on a variety of tasks. The ๐Ÿ“ paper gives background on the tasks and datasets in MTEB and analyzes leaderboard results! The ๐Ÿ’ป Github    

Read more

From PyTorch DDP to Accelerate to Trainer, mastery of distributed training with ease

This tutorial assumes you have a basic understanding of PyTorch and how to train a simple model. It will showcase training on multiple GPUs through a process called Distributed Data Parallelism (DDP) through three different levels of increasing abstraction: Native PyTorch DDP through the pytorch.distributed module Utilizing ๐Ÿค— Accelerate’s light wrapper around pytorch.distributed that also helps ensure the code can be run    

Read more

Evaluating Language Model Bias with ๐Ÿค— Evaluate

While the size and capabilities of large language models have drastically increased over the past couple of years, so too has the concern around biases imprinted into these models and their training data. In fact, many popular language models have been found to be biased against specific religions and genders, which can result in the promotion of discriminatory ideas and the perpetuation of harms against marginalized groups. To help the community explore these kinds of biases and strengthen our understanding […]

Read more

Training Stable Diffusion with Dreambooth using ๐Ÿงจ Diffusers

Dreambooth is a technique to teach new concepts to Stable Diffusion using a specialized form of fine-tuning. Some people have been using it with a few of their photos to place themselves in fantastic situations, while others are using it to incorporate new styles. ๐Ÿงจ Diffusers provides a Dreambooth training script. It doesn’t take long to train, but it’s hard to select the right set of hyperparameters and it’s easy to overfit. We conducted a lot of experiments to analyze […]

Read more
1 12 13 14 15 16 70