Identifying The Language of A Document Using NLP!

This article was published as a part of the Data Science Blogathon Introduction The goal of this article is to identify the language from the written text. The text in documents is available in many languages and when we don’t know the language it becomes very difficult sometimes to tell this to google translator as well. For most translators, we have to tell both the input language and the desired language. If you had a text written in Spanish and you […]

Read more

Training BERT Text Classifier on Tensor Processing Unit (TPU)

Training hugging face most famous model on TPU for social media Tunisian Arabizi sentiment analysis.   Introduction The Arabic speakers usually express themself in local dialect on social media, so Tunisians use Tunisian Arabizi which consists of Arabic written in form of Latin alphabets. The sentiment analysis relies on cultural knowledge and word sense with contextual information. We will be using both Arabizi dialect and sentimental analysis to solve the problem in this project. The competition is hosted on Zindi which […]

Read more

Spam Detection – An application of Deep Learning

This article was published as a part of the Data Science Blogathon What each big tech company wants is the Security and Safety of its customers. By detecting spam alerts in emails and messages, they want to secure their network and enhance the trust of their customers. The official messaging app of Apple and the official chatting app of Google i.e Gmail is unbeatable examples of such applications where the process of spam detection and filtering works well to protect users […]

Read more

Getting Started with Natural Language Processing using Python

This article was published as a part of the Data Science Blogathon Why NLP? Natural Language Processing has always been a key tenet of Artificial Intelligence (AI). With the increase in the adoption of AI, systems to automate sophisticated tasks are being built. Some of these examples are described below. Diagnosing rare form of cancer –  At the University of Tokyo’s Institute of Medical Science, doctors used artificial intelligence to successfully diagnose a rare type of leukemia. The doctors used an AI […]

Read more

Detecting Fake News with Natural Language Processing

This article was published as a part of the Data Science Blogathon 1. Introduction We consume news through several mediums throughout the day in our daily routine, but sometimes it becomes difficult to decide which one is fake and which one is authentic. Do you trust all the news you consume from online media? Every news that we consume is not real. If you listen to fake news it means you are collecting the wrong information from the world which can […]

Read more

Indexing in Natural Language Processing for Information Retrieval

This article was published as a part of the Data Science Blogathon Overview This blog covers GREP(Global-Regular-Expression-Print) and its drawbacks Then we move on to Document Term Matrix and Inverted Matrix Finally, we end with dynamic and distributed indexing image source-https://javarevisited.blogspot.com/2011/06/10-examples-of-grep-command-in-unix-and.html#axzz6zwakOXgt     Global Regular Expression Print Whenever we are dealing with a small amount of data, we can use the grep command very efficiently. It allows us to search one or more files for lines that contain a pattern. For […]

Read more

FuzzyWuzzy Python Library: Interesting Tool for NLP and Text Analytics

This article was published as a part of the Data Science Blogathon Introduction There are many ways to compare text in python. But, often we search for an easy way to compare text. Comparing text is needed for various text analytics and Natural Language Processing purposes. One of the easiest ways of comparing text in python is using the fuzzy-wuzzy library. Here, we get a score out of 100, based on the similarity of the strings. Basically, we are given the similarity […]

Read more

Part 3: Topic Modeling and Latent Dirichlet Allocation (LDA) using Gensim and Sklearn

This article was published as a part of the Data Science Blogathon Overview In the previous two installments, we had understood in detail the common text terms in Natural Language Processing (NLP), what are topics, what is topic modeling, why it is required, its uses, types of models and dwelled deep into one of the important techniques called Latent Dirichlet Allocation (LDA). In this last leg of the Topic Modeling and LDA series, we shall see how to extract topics through […]

Read more

How to Perform Basic Text Analysis without Training Dataset

This article was published as a part of the Data Science Blogathon Overview This article will give you a basic understanding of how text analysis works. Learn the various steps of the NLP pipeline Derivation of the overall sentiment of the text. Dashboard depicting the general statistics and sentiment analysis of the text. Abstract In this modern digital era, a large amount of information is generated per second. Most of the data humans generate through WhatsApp messages, tweets, blogs, news articles, […]

Read more

Learn to Develop Simple Chatbots using Python and Deep Learning!

This article was published as a part of the Data Science Blogathon Introduction A Chatbot is an application(software) that is used to manage an online chat conversation through text or text to speech format. Most of the chatbots are accessed online through various websites or assistances(virtual) with a popup. Examples:- E-commerce websites, health, news, etc. Image source: https://www.syncfusion.com/blogs/wp-content/uploads/2020/01/tile.jpg   Agenda of this article: 1) Data and Libraries 2) Initialize Training of Chatbot 3) Build the Deep Learning Model 4) Build GUI of […]

Read more
1 2 3 4 10