How to Perform Basic Text Analysis without Training Dataset

This article was published as a part of the Data Science Blogathon Overview This article will give you a basic understanding of how text analysis works. Learn the various steps of the NLP pipeline Derivation of the overall sentiment of the text. Dashboard depicting the general statistics and sentiment analysis of the text. Abstract In this modern digital era, a large amount of information is generated per second. Most of the data humans generate through WhatsApp messages, tweets, blogs, news articles, […]

Read more

Text Mining Simplified – IPL 2020 Tweet Analysis with R

This article was published as a part of the Data Science Blogathon. Introduction Text mining utilizes different AI technologies to automatically process data and generate valuable insights, enabling companies to make data-driven decisions. Text mining identifies facts, relationships, and assertions that would otherwise remain buried in the mass of textual big data. Once extracted, this information is converted into a structured form that can be further analyzed, or presented directly using clustered HTML tables, mind maps, charts, etc. Advantages of […]

Read more

6 Practices to enhance the performance of a Text Classification Model

Introduction A few months back, I was working on creating a sentiment classifier for Twitter data. After trying the common approaches, I was still struggling to get good accuracy on the results. Text classification problems and algorithms have been around for a while now. They are widely used for Email Spam Filtering by the likes of Google and Yahoo, for conducting sentiment analysis of twitter data and automatic news categorization in google alerts. However, while dealing with enormous amount of text […]

Read more

spaCy Tutorial to Learn and Master Natural Language Processing (NLP)

Introduction spaCy is my go-to library for Natural Language Processing (NLP) tasks. I’d venture to say that’s the case for the majority of NLP experts out there! Among the plethora of NLP libraries these days, spaCy really does stand out on its own. If you’ve used spaCy for NLP, you’ll know exactly what I’m talking about. And if you’re new to the power of spaCy, you’re about to be enthralled by how multi-functional and flexible this library is. The factors […]

Read more

How to Use Texthero to Prepare a Text-based Dataset for Your NLP Project

Introduction Natural Language Processing (NLP) is one of the most important fields of study and research in today’s world. It has many applications in the business sector such as chatbots, sentiment analysis, and document classification. Preprocessing and representing text is one of the trickiest and most annoying parts of working on an NLP project. Text-based datasets can be incredibly thorny and difficult to preprocess. But fortunately, the latest Python package called Texthero can help you solve these challenges. What is […]

Read more