Large Language Models: A New Moore’s Law?
A few days ago, Microsoft and NVIDIA introduced Megatron-Turing NLG 530B, a Transformer-based model hailed as “the world’s largest and most powerful generative language model.” This is an impressive show of Machine Learning engineering, no doubt about it. Yet, should we be excited about this mega-model trend? I, for one, am not. Here’s why.
Read moreCourse Launch Community Event
We are excited to share that after a lot of work from the Hugging Face team, part 2 of the Hugging Face Course will be released on November 15th! Part 1 focused on teaching you how to use a pretrained model, fine-tune it on a text classification task then upload the result to the Model Hub. Part 2
Read moreScaling up BERT-like model Inference on modern CPU – Part 2
As we detailed in our previous blog post, Intel Xeon CPUs provide a set of
Read moreFine-tuning XLS-R for Multi-Lingual ASR with 🤗 Transformers
New (11/2021): This blog post has been updated to feature XLSR’s successor, called XLS-R. Wav2Vec2 is a pretrained model for Automatic Speech Recognition (ASR) and was released in September 2020 by Alexei Baevski, Michael Auli, and Alex Conneau. Soon after the superior performance of Wav2Vec2 was demonstrated on one
Read moreIntroducing the 🤗 Data Measurements Tool: an Interactive Tool for Looking at Datasets
tl;dr: We made a tool you can use online to build, measure, and compare datasets. Click to access the 🤗 Data Measurements Tool here. As developers of a fast-growing unified repository for Machine Learning datasets (Lhoest et al. 2021), the 🤗 Hugging Face team has been working on supporting good practices for dataset documentation (McMillan-Major et al., 2021). While static (if evolving) documentation represents a necessary first step in this direction, getting a good sense of what is actually in […]
Read moreGetting Started with Hugging Face Transformers for IPUs with Optimum
Transformer models have proven to be extremely efficient on a wide range of machine learning tasks, such as natural language processing, audio processing, and computer vision. However, the prediction speed of these large models can make them impractical for
Read moreIntroducing Snowball Fight ☃️, our First ML-Agents Environment
We’re excited to share our first custom Deep Reinforcement Learning environment: Snowball Fight 1vs1 🎉. Snowball Fight is a game made with Unity ML-Agents, where you shoot snowballs against a Deep Reinforcement Learning agent. The game is hosted on Hugging Face Spaces. 👉 You can play it online here In this post, we’ll
Read more