NLP: Answer Retrieval from Document using Python

This article was published as a part of the Data Science Blogathon Introduction → This article focuses on answer retrieval from a document by using similarity and difference metrics. This task falls under Natural Language Processing which is a subset of Deep Learning. In this article we will be understanding the concept of general similarity algorithms and how can they be applied to complete our task. The article will be based on python for the coding part. How to Approach → To […]

Read more

Topic extraction From Prime Minister Modi’s Speech

This article was published as a part of the Data Science Blogathon INTRODUCTION Artificial Intelligence (AI) has been a trendy term among individuals for many years. Earlier, when we used to hear the term “AI”, we could only think about Robots. However AI is not limited to robots, and nowadays, every electronic device we use has AI associated with it, be it smartphones, smart TVs, refrigerators, or Air conditioners. AI basically means a machine can take its decision without human intervention. […]

Read more

Part 6: Step by Step Guide to Master NLP – Word2Vec

This article was published as a part of the Data Science Blogathon Introduction This article is part of an ongoing blog series on Natural Language Processing (NLP). In the previous article of this series, we completed the statistical or frequency-based word embedding techniques, which are pre-word embedding era techniques. So, in this article, we will discuss the recent word-era embedding techniques. NOTE: In recent word-era embedding, there are many such techniques but in this article, we will discuss only the Word2Vec […]

Read more

Topic modeling With Naive Bayes Classifier

This article was published as a part of the Data Science Blogathon Introduction Naive Bayes is a powerful tool that leverages Bayes’ Theorem to understand and mimic complex data structures. In recent years, it has commonly been used for Natural Language Processing (NLP) tasks, such as text categorization. Today, we will be constructing a Naive Bayes text classifier for topic categorization. Before we move forward with the explanation, I want to emphasize that Naive Bayes is not the traditional method of […]

Read more

Part 5: Step by Step Guide to Master NLP – Word Embedding and Text Vectorization

This article was published as a part of the Data Science Blogathon Introduction This article is part of an ongoing blog series on Natural Language Processing (NLP). Up to the previous part of this article series, we almost completed the necessary steps involved in text cleaning and normalization pre-processing. After that, we will convert the processed text to numeric feature vectors so that we can feed it to computers for Machine Learning applications. NOTE: Some concepts included in the pipeline of […]

Read more

Beginner Projects to Learn Natural Language Processing using Python !

This article was published as a part of the Data Science Blogathon Machines understanding language fascinates me, and that I often ponder which algorithms Aristotle would have accustomed build a rhetorical analysis machine if he had the possibility. If you’re new to Data Science, getting into NLP can seem complicated, especially since there are many recent advancements within the field. it’s hard to grasp where to begin. Table of Contents 1.What can Machines Understand? 2.Project 1:Word Cloud 3.Project 2:Spam Detection 4.Project […]

Read more

Beginner’s Guide To Natural Language Processing Using SpaCy

This article was published as a part of the Data Science Blogathon Pre-requisites Basic Knowledge of Natural Language Processing Hands-on practice of Python Introduction As we know data has some kind of meaning in its position. For every moment, mostly text data is getting generated in different formats like SMS, reviews, Emails, and so on. The main purpose of this article is to understand the basic idea of NLP using the library- SpaCy. So let’s go ahead. In this article, we […]

Read more

Topic Modeling and Latent Dirichlet Allocation(LDA) using Gensim and Sklearn : Part 1

This article was published as a part of the Data Science Blogathon Introduction Let’s say you have a client who has a publishing house. Your client comes to you with two tasks: one he wants to categorize all the books or the research papers he receives weekly on a common theme or a topic and the other task is to encapsulate large documents into smaller bite-sized texts. Is there any technique and tool available that can do both of these two […]

Read more

Part 8: Step by Step Guide to Master NLP – Useful Natural Language Processing Tasks

This article was published as a part of the Data Science Blogathon Introduction This article is part of an ongoing blog series on Natural Language Processing (NLP). Up to part-7 of this series, we completed the most useful concepts in NLP. While going away in this series, let’s first discuss some of the useful tasks of NLP so that you have much clarity about what you can do by learning the NLP. After this part, we will start our discussion on […]

Read more

Part 4: Step by Step Guide to Master NLP – Text Cleaning Techniques

This article was published as a part of the Data Science Blogathon Introduction This article is part of an ongoing blog series on Natural Language Processing (NLP). In the previous part of this blog series, we complete the initial steps involved in text cleaning and preprocessing that are related to NLP. Now, in continuation of that part, in this article, we will cover the next techniques involved in the NLP pipeline of Text preprocessing. In this article, we will first discuss […]

Read more
1 2 3 4 7