Improve ChatGPT with Knowledge Graphs

ChatGPT has shown impressive capabilities in processing and generating human-like text. However, it is not without its imperfections. A primary concern is the model’s propensity to produce either inaccurate or obsolete answers, often called “hallucinations.” The New York Times recently highlighted this issue in their article, “Here’s What Happens When Your Lawyer Uses ChatGPT.” It presents a lawsuit where a lawyer leaned heavily on ChatGPT to assist in preparing a court filing for a client suing an airline. The model […]

Read more

4-bit LLM Quantization with GPTQ

Recent advancements in weight quantization allow us to run massive large language models on consumer hardware, like a LLaMA-30B model on an RTX 3090 GPU. This is possible thanks to novel 4-bit quantization techniques with minimal performance degradation, like GPTQ, GGML, and NF4. In the previous article, we introduced naïve 8-bit quantization techniques and the excellent LLM.int8(). In this article, we will explore the popular GPTQ algorithm to understand how it works and implement it using the AutoGPTQ library. You […]

Read more

A Beginner’s Guide to LLM Fine-Tuning

The growing interest in Large Language Models (LLMs) has led to a surge in tools and wrappers designed to streamline their training process. Popular options include FastChat from LMSYS (used to train Vicuna) and Hugging Face’s transformers/trl libraries (used in my previous article). In addition, each big LLM project, like WizardLM, tends to have its own training script, inspired by the original Alpaca implementation. In this article, we will use Axolotl, a tool created by the OpenAccess AI Collective. We […]

Read more

ExLlamaV2: The Fastest Library to Run LLMs

Quantizing Large Language Models (LLMs) is the most popular approach to reduce the size of these models and speed up inference. Among these techniques, GPTQ delivers amazing performance on GPUs. Compared to unquantized models, this method uses almost 3 times less VRAM while providing a similar level of accuracy and faster generation. It became so popular that it has recently been directly integrated into the transformers library. ExLlamaV2 is a library designed to squeeze even more performance out of GPTQ. […]

Read more

Future of OTT: Exploring AI’s Impact on Streaming

Video streaming platforms are embracing artificial intelligence (AI) tools to enhance content recommendations, tailoring the viewing experience for individual users and simplifying content discovery. Industry experts anticipate AI’s expansion into scripting, dubbing, and even allowing users to participate directly in content streaming. Manish Kalra, Chief Business Officer at ZEE5 India, highlights the active use of AI in content recommendation, personalization, cross-device compatibility, and audience analytics across over-the-top (OTT) platforms. AI’s impact extends to social media marketing through creative content and […]

Read more

Using Python for Data Analysis

Data analysis is a broad term that covers a wide range of techniques that enable you to reveal any insights and relationships that may exist within raw data. As you might expect, Python lends itself readily to data analysis. Once Python has analyzed your data, you can then use your findings to make good business decisions, improve procedures, and even make informed predictions based on what you’ve discovered. Before you start, you should familiarize yourself with Jupyter Notebook, a popular […]

Read more

IMF Analysis Warns: Artificial Intelligence to Impact 40% of Jobs

The International Monetary Fund (IMF) has released a new analysis predicting that artificial intelligence (AI) is poised to influence almost 40% of all jobs globally. Kristalina Georgieva, the Managing Director of the IMF, emphasizes that, in most scenarios, AI is likely to exacerbate overall inequality. Ms. Georgieva calls for proactive measures from policymakers to address this “troubling trend” and prevent technology from intensifying social tensions. According to the IMF report, advanced economies will experience a higher impact, with AI affecting […]

Read more
1 112 113 114 115 116 988