Indexing in Natural Language Processing for Information Retrieval

This article was published as a part of the Data Science Blogathon Overview This blog covers GREP(Global-Regular-Expression-Print) and its drawbacks Then we move on to Document Term Matrix and Inverted Matrix Finally, we end with dynamic and distributed indexing image source-https://javarevisited.blogspot.com/2011/06/10-examples-of-grep-command-in-unix-and.html#axzz6zwakOXgt     Global Regular Expression Print Whenever we are dealing with a small amount of data, we can use the grep command very efficiently. It allows us to search one or more files for lines that contain a pattern. For […]

Read more

NLTK: A Beginners Hands-on Guide to Natural Language Processing

This article was published as a part of the Data Science Blogathon Introduction:  NLTK is a toolkit build for working with NLP in Python. It provides us various text processing libraries with a lot of test datasets. A variety of tasks can be performed using NLTK such as tokenizing, parse tree visualization, etc… In this article, we will go through how we can set up NLTK in our system and use them for performing various NLP tasks during the text processing […]

Read more

Let’s Understand How does a chatbot work ?

Introduction A technology that makes the interaction between humans and machines in natural language possible, is an Artificial Intelligence Chatbot! They act like a typical search engine but with more enhanced features. Applications of Artificial Intelligence Chatbots are spread over various domains including eCommerce, healthcare, education, travel, automation, finance, hospitality, insurance, and so on. The chatbots are domain-specific and do what they are intended for.  The applications in their domain include: answering customer queries, booking services like flights, movie tickets, […]

Read more

FuzzyWuzzy Python Library: Interesting Tool for NLP and Text Analytics

This article was published as a part of the Data Science Blogathon Introduction There are many ways to compare text in python. But, often we search for an easy way to compare text. Comparing text is needed for various text analytics and Natural Language Processing purposes. One of the easiest ways of comparing text in python is using the fuzzy-wuzzy library. Here, we get a score out of 100, based on the similarity of the strings. Basically, we are given the similarity […]

Read more

Text Analysis with Spacy to Master NLP techniques

This article was published as a part of the Data Science Blogathon Natural Language Processing(NLP) is a branch of Artificial Intelligence that deals with Daily Language. Have you ever wonder how Alexa, Siri, Google Assistant understand us with voice and respond to us. Human Language is the fuzziest and complex. As they receive text input first preprocessing of text happens and many techniques are embedded which lets them understand grammar. In this tutorial, we will study some techniques which are helpful […]

Read more

Part 7: Step by Step Guide to Master NLP – Word Embedding in Detail

This article was published as a part of the Data Science Blogathon Introduction This article is part of an ongoing blog series on Natural Language Processing (NLP). In the previous articles (part-5 and 6), we completed the different text vectorization and word embeddings techniques in detail. In this article, firstly we will discuss the co-occurrence matrix, which is also a word vectorization technique and after that, we will be discussing new concepts related to the Word embedding that includes, Applications of […]

Read more

Word Sense Disambiguation: Importance in Natural Language Processing

This article was published as a part of the Data Science Blogathon Introduction In human language, often a word is used in more than one way. Understanding the various usage patterns in the language is important for various Natural Language Processing Applications. ( Image: https://www.pexels.com/photo/book-eyeglasses-eyewear-page-261857/ ) In various usage situations, the same word can mean differently. As, a vast majority of the information online, is in English, for the sake of simplicity, let us deal with examples in the English language only. […]

Read more

Practical Guide to Word Embedding System

This article was published as a part of the Data Science Blogathon Pre-requisites – Basic knowledge of Python – Understanding of basics of NLP(Natural Language Processing)   Introduction In natural language processing, word embedding is used for the representation of words for Text Analysis, in the form of a vector that performs the encoding of the meaning of the word such that the words which are closer in that vector space are expected to have similar in mean. Consider, boy-men vs […]

Read more

Part 3: Topic Modeling and Latent Dirichlet Allocation (LDA) using Gensim and Sklearn

This article was published as a part of the Data Science Blogathon Overview In the previous two installments, we had understood in detail the common text terms in Natural Language Processing (NLP), what are topics, what is topic modeling, why it is required, its uses, types of models and dwelled deep into one of the important techniques called Latent Dirichlet Allocation (LDA). In this last leg of the Topic Modeling and LDA series, we shall see how to extract topics through […]

Read more

Understanding Natural Language Processing -A Beginner’s Guide

This article was published as a part of the Data Science Blogathon Introduction:   Source: https://www.asksid.ai/blog/what-is-natural-language-processing/ Language is very important when we want to communicate with each other. Every human can talk and tell others what they need and listen with language. These languages can be anything like English, Spanish, Hindi, Malayalam, etc… We can express our ideas to others in this medium. Language is one of the critical components of human intelligence. Every day we interact with humans, but how about […]

Read more
1 2 3 4 5 14