Learn how to Build and Deploy a Chatbot in Minutes using Rasa (IPL Case Study!)

Introduction Have you ever been stuck at work while a pulsating cricket match was going on? You need to meet a deadline but you just can’t concentrate because your favorite team is locked in a fierce battle for a playoff spot. Sounds familiar? I’ve been in this situation a lot in my professional career and checking my phone every 5 minutes was not really an option! Being a data scientist, I looked at this challenge from the lens of an […]

Read more

DataHack Radio #24: Exploring and Designing Chatbots with RASA’s Justina Petraitytė

Introduction Chatbots are the most common application of Natural Language Processing (NLP). Organizations are scrambling to integrate chatbots into their daily functions to enhance and personalize our experience. As a data science professional, I’m always curious about how these chatbots are built. Rasa is one such open source framework that we can leverage to build our own chatbots. So we are delighted to have Rasa’s data scientist and Head of Developer Relations, Justina Petraitytė, on our DataHack Radio podcast! Justina […]

Read more

Knowledge Graph – A Powerful Data Science Technique to Mine Information from Text (with Python code)

Overview Knowledge graphs are one of the most fascinating concepts in data science Learn how to build a knowledge graph to mine information from Wikipedia pages You will be working hands-on in Python to build a knowledge graph using the popular spaCy library   Introduction Lionel Messi needs no introduction. Even folks who don’t follow football have heard about the brilliance of one of the greatest players to have graced the sport. Here’s his Wikipedia page: Quite a lot of […]

Read more

Quick Introduction to Bag-of-Words (BoW) and TF-IDF for Creating Features from Text

The Challenge of Making Machines Understand Text “Language is a wonderful medium of communication” You and I would have understood that sentence in a fraction of a second. But machines simply cannot process text data in raw form. They need us to break down the text into a numerical format that’s easily readable by the machine (the idea behind Natural Language Processing!). This is where the concepts of Bag-of-Words (BoW) and TF-IDF come into play. Both BoW and TF-IDF are […]

Read more

Simple NLP in Python With TextBlob: Tokenization

Introduction The amount of textual data on the Internet has significantly increased in the past decades. There’s no doubt that the processing of this amount of information must be automated, and the TextBlob package is one of the fairly simple ways to perform NLP – Natural Language Processing. It provides a simple API for diving into common natural language processing (NLP) tasks such as part-of-speech tagging, noun phrase extraction, tokenization, sentiment analysis, classification, translation, and more. No special technical prerequisites […]

Read more

The Essential NLP Guide for data scientists (with codes for top 10 common NLP tasks)

Introduction Organizations today deal with huge amount and wide variety of data – calls from customers, their emails, tweets, data from mobile applications and what not. It takes a lot of effort and time to make this data useful. One of the core skills in extracting information from text data is Natural Language Processing (NLP). Natural Language Processing (NLP) is the art and science which helps us extract information from text and use it in our computations and algorithms. Given […]

Read more

Predicting Movie Genres using NLP – An Awesome Introduction to Multi-Label Classification

Introduction I was intrigued going through this amazing article on building a multi-label image classification model last week. The data scientist in me started exploring possibilities of transforming this idea into a Natural Language Processing (NLP) problem. That article showcases computer vision techniques to predict a movie’s genre. So I had to find a way to convert that problem statement into text-based data. Now, most NLP tutorials look at solving single-label classification challenges (when there’s only one label per observation). […]

Read more

7 Amazing NLP Hack Sessions to Watch out for at DataHack Summit 2019

Picture a world where: Machines are able to have human-level conversations with us Computers understand the context of the conversation without having to be told what the subject is These machines can even write full-blown essays after being given the theme of the topic This isn’t a movie script or a futuristic scenario – this is all happening right now thanks to the power of Natural Language Processing (NLP)! Here’s the incredible rise charted by Google Trends in the last […]

Read more

Build Your First Text Classification model using PyTorch

Overview Learn how to perform text classification using PyTorch Grasp the importance of Pack Padding feature Understand the key points involved while solving text classification Introduction I always turn to State of the Art architectures to make my first submission in data science hackathons. Implementing the State of the Art architectures has become quite easy thanks to deep learning frameworks such as PyTorch, Keras, and TensorFlow. These frameworks provide an easy way to implement complex model architectures and algorithms with […]

Read more

Translating Strings in Python with TextBlob

Introduction Text translation is a difficult computer problem that gets better and easier to solve every year. Big companies like Google are actively working on improving their text translation services which enables the rest of us to use them freely. Apart from their great personal use, these services can be used by developers through various APIs. This article is about TextBlob which uses one such API to perform text translation. What is TextBlob? TextBlob is a text-processing library written in […]

Read more
1 10 11 12 13 14 15