Ideas: Language technologies for everyone with Kalika Bali

[MUSIC FADES]  I’m excited to be live in the booth today with Kalika Bali, a principal researcher at Microsoft Research India. Kalika is working on language technologies that she hopes will bring the benefits of generative AI to under-resourced and underserved language communities around the world. Kalika, it’s a pleasure to speak with you today. Welcome to Ideas!  KALIKA BALI: Thank you. Thank you, Gretchen. Thank you for having me.  HUIZINGA: So before we dive in on the big ideas […]

Read more

Pydantic: Simplifying Data Validation in Python

Pydantic’s primary way of defining data schemas is through models. A Pydantic model is an object, similar to a Python dataclass, that defines and stores data about an entity with annotated fields. Unlike dataclasses, Pydantic’s focus is centered around automatic data parsing, validation, and serialization. The best way to understand this is to create your own models, and that’s what you’ll do next. Working With Pydantic BaseModels Suppose you’re building an application used by a human resources department to manage […]

Read more

Generating QR Codes With Python

From restaurant e-menus to airline boarding passes, QR codes have numerous applications that impact your day-to-day life and enrich the user’s experience. Wouldn’t it be great to make them look good, too? With the help of this video course, you’ll learn how to use Python to generate beautiful QR codes for your personal use case. In its most basic format, a QR code contains black squares and dots on a white background, with information that any smartphone or device with […]

Read more

A Technical Introduction to Stable Diffusion

The introduction of GPT-3, particularly its chatbot form, i.e. the ChatGPT, has proven to be a monumental moment in the AI landscape, marking the onset of the generative AI (GenAI) revolution. Although prior models existed in the image generation space, it’s the GenAI wave that caught everyone’s attention. Stable Diffusion is a member of the GenAI family for image generation. It is known for its possibility to customization, freely available to run on your own hardware, and actively improving. It […]

Read more

Highlights from Machine Translation and Multilinguality in March 2024

Did Translation Models Get More Robust Without Anyone Even Noticing? Folks from Lisbon study how robust the newest MT systems are against source-side noise. Machine translation using large models, including translation-specific NLLB or via LLMs (such as Tower or GPT-3.5), is much more robust both towards synthetic noise (the nice feature of synthetic noise is that you can check the translation quality for different noise levels) and also real-world noisy data from social networks. Tracing the Roots of Facts in […]

Read more

Python News: What’s New From March 2024

While many people went hunting for Easter eggs, the Python community stayed active through March 2024. The free-threaded Python project reached a new milestone, and you can now experiment with disabling the GIL in your interpreter. The Python Software Foundation does a great job supporting the language with limited resources. They’ve now announced a new position that will support users of PyPI. NumPy is an old workhorse in the data science space. The library is getting a big facelift, and […]

Read more

GGUF vs GGML : Understanding the Differences

Introduction The year 2023 was a landmark year for the Large Language Models (LLMS) ecosystem which saw a large number of proprietary models and open-source models becoming accessible to the public at large. It also saw LLMs getting published for CPUs in GGML format thus making LLMs accessible to people who lacked powerful GPUs. Soon in August 2023, another format called GGUF was released for CPU-based LLMs. With such rapid changes in the ecosystem many people especially beginners are confused […]

Read more

Install and Execute Python Applications Using pipx

A straightforward way to distribute desktop and command-line applications written in Python is to publish them on the Python Package Index (PyPI), which hosts hundreds of thousands of third-party packages. Many of these packages include runnable scripts, but using them requires decent familiarity with the Python ecosystem. With pipx, you can safely install and execute such applications without affecting your global Python interpreter. To help you get to grips with pipx, you can download the supplemental materials, which include a […]

Read more

Research Focus: Week of April 1, 2024

Welcome to Research Focus, a series of blog posts that highlights notable publications, events, code/datasets, new hires and other milestones from across the research community at Microsoft. NEW RESEARCH In the same way that tools can help people complete tasks beyond their innate abilities, tools are essential for large language models (LLMs) to acquire up-to-date information and take consequential actions in external environments. Existing work on tool-augmented LLMs primarily focuses on the broad coverage of tools and the flexibility of […]

Read more

Brief Introduction to Diffusion Models for Image Generation

The advance of generative machine learning models makes computers capable of creative work. In the scope of drawing pictures, there are a few notable models that allow you to convert a textual description into an array of pixels. The most powerful models today are part of the family of diffusion models. In this post, you will learn how this kind of model works and how you can control its output. Let’s get started. Brief Introduction to Diffusion Models for Image […]

Read more
1 3 4 5 6 7 905