huggingface_hub v1.0: Five Years of Building the Foundation of Open Machine Learning

TL;DR: After five years of development, huggingface_hub has reached v1.0 – a milestone that marks the library’s maturity as the Python package powering 200,000 dependent libraries and providing core functionality for accessing over 2 million public models, 0.5 million public datasets, and 1 million public Spaces. This release introduces breaking changes designed to support the next decade of open machine learning, driven by a global community of almost 300 contributors and millions of users. 🚀 We highly recommend upgrading to […]

Read more

Streaming datasets: 100x More Efficient

We boosted load_dataset(‘dataset’, streaming=True), streaming datasets without downloading them with one line of code! Start training on multi-TB datasets immediately, without complex setups, downloading, no “disk out of space”, or 429 “stop requesting!” errors.It’s super fast! Outrunning our local SSDs when training on 64xH100 with 256 workers downloading data. We’ve improved streaming to have 100x fewer requests, → 10Ă— faster data resolution → 2x sample/sec, → 0 worker crashes at 256 concurrent workers. Loading data, especially at the terabyte scale, […]

Read more

How to Build a Healthcare Robot from Simulation to Deployment with NVIDIA Isaac for Healthcare

Simulation has been a cornerstone in medical imaging to address the data gap. However, in healthcare robotics until now, it’s often been too slow, siloed, or difficult to translate into real-world systems. That’s now changing. With new advances in GPU-accelerated simulation and digital twins, developers can design, test, and validate robotic workflows entirely in virtual environments – reducing prototyping time from months to days,    

Read more
1 67 68 69 70 71 74