Supercharged Searching on the Hugging Face Hub

The huggingface_hub library is a lightweight interface that provides a programmatic approach to exploring the hosting endpoints Hugging Face provides: models, datasets, and Spaces. Up until now, searching on the Hub through this interface was tricky to pull off, and there were many aspects of it a user had to “just know” and get    

Read more

Making automatic speech recognition work on large files with Wav2Vec2 in πŸ€— Transformers

Tl;dr: This post explains how to use the specificities of the Connectionist Temporal Classification (CTC) architecture in order to achieve very good quality automatic speech recognition (ASR) even on arbitrarily long files or during live inference. Wav2Vec2 is a popular pre-trained model for speech recognition. Released in September 2020 by Meta AI Research, the novel architecture catalyzed progress in    

Read more

Getting Started with Sentiment Analysis using Python

Sentiment analysis is the automated process of tagging data according to their sentiment, such as positive, negative and neutral. Sentiment analysis allows companies to analyze data at scale, detect insights and automate processes. In the past, sentiment analysis used to be limited to researchers, machine learning engineers or data scientists with experience in natural language processing. However, the AI community has built    

Read more

BERT 101 πŸ€— State Of The Art NLP Model Explained

BERT, short for Bidirectional Encoder Representations from Transformers, is a Machine Learning (ML) model for natural language processing. It was developed in 2018 by researchers at Google AI Language and serves as a swiss army knife solution to 11+ of the most common language tasks, such as sentiment analysis and named entity recognition. Language has historically been difficult for computers to β€˜understand’. Sure, computers can collect, store, and read text inputs but they lack basic language context. So, along came […]

Read more

Image search with πŸ€— datasets

πŸ€— datasets is a library that makes it easy to access and share datasets. It also makes it easy to process data efficiently — including working with data which doesn’t fit into memory. When datasets was first launched, it was associated mostly with text data. However, recently, datasets has added increased    

Read more

Accelerate BERT inference with Hugging Face Transformers and AWS Inferentia

notebook: sagemaker/18_inferentia_inference The adoption of BERT and Transformers continues to grow. Transformer-based models are now not only achieving state-of-the-art performance in Natural Language Processing but also for Computer Vision, Speech, and Time-Series. πŸ’¬Β πŸ–ΌΒ πŸŽ€Β β³ Companies are now slowly moving from the experimentation and research phase to the production phase in    

Read more
1 7 8 9 10 11 1,069