An Exhaustive Guide to Detecting and Fighting Neural Fake News using NLP

Overview Neural fake news (fake news generated by AI) can be a huge issue for our society This article discusses different Natural Language Processing methods to develop robust defense against Neural Fake News, including using the GPT-2 detector model and Grover (AllenNLP) Every data science professional should be aware of what neural fake news is and how to combat it   Introduction Fake news is a major concern in our society right now. It has gone hand-in-hand with the rise […]

Read more

What is Tokenization in NLP? Here’s All You Need To Know

Highlights Tokenization is a key (and mandatory) aspect of working with text data We’ll discuss the various nuances of tokenization, including how to handle Out-of-Vocabulary words (OOV)   Introduction Language is a thing of beauty. But mastering a new language from scratch is quite a daunting prospect. If you’ve ever picked up a language that wasn’t your mother tongue, you’ll relate to this! There are so many layers to peel off and syntaxes to consider – it’s quite a challenge. […]

Read more

Who is the world cheering for? 2014 FIFA WC winner predicted using Twitter feed (in R)

Sports are filled with emotions! Cheering of audience, reactions to events on various media channels are some of the factors, which make a huge impact on the mind of the players. If people support you, your chances to win are greatly enhanced. Live example of this fact, are the statistics of Indian cricket team playing in India and abroad. The win rate of Indian cricket team in India is approximately twice the win rate abroad. Football is again a game driven largely by emotions. […]

Read more

Kaggle Solution: What’s Cooking ? (Text Mining Competition)

Introduction Tutorial on Text Mining, XGBoost and Ensemble Modeling in R I came across What’s Cooking competition on Kaggle last week. At first, I was intrigued by its name. I checked it and realized that this competition is about to finish. My bad! It was a text mining competition.  This competition went live for 103 days and ended on 20th December 2015. Still, I decided to test my skills. I downloaded the data set, built a model and managed to get a score of […]

Read more

Introduction to Computational Linguistics and Dependency Trees in data science

Introduction In recent years, the amalgam of deep learning fundamentals with Natural Language Processing techniques has shown a great improvement in the information mining tasks on unstructured text data. The models are now able to recognize natural language and speech comparable to human levels. Despite such improvements, discrepancies in the results still exist as sometimes the information is coded very deep in the syntaxes and syntactic structures of the corpus. Example – Problem with Neural Networks For example, a conversation […]

Read more

An Introduction to Text Summarization using the TextRank Algorithm (with Python implementation)

Introduction Text Summarization is one of those applications of Natural Language Processing (NLP) which is bound to have a huge impact on our lives. With growing digital media and ever growing publishing – who has the time to go through entire articles / documents / books to decide whether they are useful or not? Thankfully – this technology is already here. Have you come across the mobile app inshorts? It’s an innovative news app that converts news articles into a […]

Read more

Comprehensive Guide to Text Summarization using Deep Learning in Python

Introduction “I don’t want a full report, just give me a summary of the results”. I have often found myself in this situation – both in college as well as my professional life. We prepare a comprehensive report and the teacher/supervisor only has time to read the summary. Sounds familiar? Well, I decided to do something about it. Manually converting the report to a summarized version is too time taking, right? Could I lean on Natural Language Processing (NLP) techniques […]

Read more

A Comprehensive Guide to Build your own Language Model in Python!

Overview Language models are a crucial component in the Natural Language Processing (NLP) journey These language models power all the popular NLP applications we are familiar with – Google Assistant, Siri, Amazon’s Alexa, etc. We will go from basic language models to advanced ones in Python here   Introduction We tend to look through language and not realize how much power language has. Language is such a powerful medium of communication. We have the ability to build projects from scratch […]

Read more

Create Natural Language Processing based Apps for iOS in Minutes! (using Apple’s Core ML 3)

Overview Intrigued by Apple’s iOS apps? Learn how to build Natural Language Processing (NLP) iOS apps in this article We’ll be using Apple’s Core ML 3 to build these NLP iOS apps This is a hands-on step by step tutorial with code   Introduction I love working in the Natural Language Processing (NLP) space. The last couple of years have been a goldmine for me – the level and quality of developments have been breathtaking. But this comes with its […]

Read more

Build a Natural Language Generation (NLG) System using PyTorch

Overview Introduction to Natural Language Generation (NLG) and related things- Data Preparation Training Neural Language Models Build a Natural Language Generation System using PyTorch Introduction In the last few years, Natural language processing (NLP) has seen quite a significant growth thanks to advancements in deep learning algorithms and the availability of sufficient computational power. However, feed-forward neural networks are not considered optimal for modeling a language or text. This is because the feed-forward network does not take into consideration the […]

Read more
1 16 17 18 19 20 22