March 13, 2026 huggingface

Fine-Tune Wav2Vec2 for English ASR with 🤗 Transformers

Wav2Vec2 is a pretrained model for Automatic Speech Recognition (ASR) and was released in September 2020 by Alexei Baevski, Michael Auli, and Alex Conneau. Using a novel contrastive pretraining objective, Wav2Vec2 learns powerful speech representations from more than 50.000 hours of unlabeled speech. Similar, to BERT’s masked language

March 13, 2026 huggingface

My Journey to a serverless transformers pipeline on Google Cloud

March 13, 2026 huggingface

The Partnership: Amazon SageMaker and Hugging Face

Look at these smiles! Today, we announce a strategic partnership between Hugging Face and Amazon to make it easier for companies to leverage State of the Art Machine Learning models, and ship cutting-edge NLP features faster. Through this partnership, Hugging Face is leveraging Amazon Web Services as its Preferred Cloud Provider to deliver services to its

March 13, 2026 huggingface

Understanding BigBird’s Block Sparse Attention

Transformer-based models have shown to be very useful for many NLP tasks. However, a major limitation of transformers-based models is its O(n2)O(n^2)O(n2) time & memory complexity (where nn

March 13, 2026 huggingface

Distributed Training: Train BART/T5 for Summarization using 🤗 Transformers and Amazon SageMaker

March 13, 2026 huggingface

Introducing 🤗 Accelerate

Run your raw PyTorch training scripts on any kind of device. Most high-level libraries above PyTorch provide support for distributed training and mixed precision, but the abstraction they introduce require a user to learn a new API if they want to customize the underlying training loop. 🤗 Accelerate was created for PyTorch users who like to have full control over their training

March 13, 2026 huggingface

Scaling up BERT-like model Inference on modern CPU – Part 1

Back in October 2019, my colleague Lysandre Debut published a comprehensive (at the time) inference performance benchmarking blog (1). Since then, 🤗 transformers (2) welcomed a tremendous number of new architectures and thousands of new models were added to the 🤗 hub (3) which now counts more than 9,000 of them as of first quarter of 2021.

March 13, 2026 huggingface

Using & Mixing Hugging Face Models with Gradio 2.0

Cross-posted from the Gradio blog. The Hugging Face Model Hub has more than 10,000 machine learning models submitted by users. You’ll find all kinds of natural language processing models that, for example, translate between Finnish

March 13, 2026 huggingface

Few-shot learning in practice: GPT-Neo and the 🤗 Accelerated Inference API

In many Machine Learning applications, the amount of available labeled data is a barrier to producing a high-performing model. The latest developments in NLP show that you can overcome this limitation by providing a few examples at inference time with a large language model – a technique known as Few-Shot Learning. In this blog post, we’ll explain what Few-Shot Learning is, and

March 13, 2026 huggingface

Sentence Transformers in the Hugging Face Hub

Over the past few weeks, we’ve built collaborations with many Open Source frameworks in the machine learning ecosystem. One that gets us particularly excited is Sentence Transformers. Sentence Transformers is a framework for sentence, paragraph and image

« 1 … 7 8 9 10 11 … 78 »