Must Known Techniques for text preprocessing in NLP

This article was published as a part of the Data Science Blogathon

In any Machine learning task, cleaning or preprocessing the data is as important as model building. Text data is one of the most unstructured forms of available data and when comes to deal with Human language then it’s too complex. Have you ever wondered how Alexa, Siri, Google assistant can understand, process, and respond in Human language. NLP is a technology that works behind it where before any response lots of text preprocessing takes place. This tutorial will study the main text preprocessing techniques that you must know to work with any text data.

text preprocessing in NLP

Table of Contents

  • Overview on NLP
  • Text Preprocessing
  • Libraries used to deal with NLP Problems
  • Text Preprocessing Techniques
    • Expand Contractions
    • Lower Case
    • Remove Punctuations
    • Remove words and digits containing digits
    • Remove Stopwords
    • Rephrase Text
    • Stemming and Lemmatization
    • Remove White spaces
  • EndNote

Introduction to NLP

Natural Language Processing

 

 

 

To finish reading, please visit source site