DataHack Radio #23: Ines Montani and Matthew Honnibal – The Brains behind spaCy

https://soundcloud.com/datahack-radio/ines-montani-matthew-honnibal-the-brains-behind-spacy Introduction What would you do if you had the chance to pick the brains behind one of the most popular Natural Language Processing (NLP) libraries of our era? A library that has helped usher in the current boom in NLP applications and nurtured tons of NLP scientists? Well – you invite the creators on our popular DataHack Radio podcast and let them do the talking! We are delighted to welcome Ines Montani and Matt Honnibal, the developers of spaCy […]

Read more

Building a Recommendation System using Word2vec: A Unique Tutorial with Case Study in Python

Overview Recommendation engines are ubiquitous nowadays and data scientists are expected to know how to build one Word2vec is an ultra-popular word embeddings used for performing a variety of NLP tasks We will use word2vec to build our own recommendation system. Curious how NLP and recommendation engines combine? Let’s find out!   Introduction Be honest – how many times have you used the ‘Recommended for you’ section on Amazon? Ever since I found out a few years back that machine […]

Read more

A Comprehensive Guide to Attention Mechanism in Deep Learning for Everyone

 Overview The attention mechanism has changed the way we work with deep learning algorithms Fields like Natural Language Processing (NLP) and even Computer Vision have been revolutionized by the attention mechanism We will learn how this attention mechanism works in deep learning, and even implement it in Python   Introduction “Every once in a while, a revolutionary product comes along that changes everything.” – Steve Jobs What does one of the most famous quotes of the 21st century have to do with […]

Read more

DataHack Radio #12: Exploring the Nuts and Bolts of Natural Language Processing with Sebastian Ruder

https://soundcloud.com/datahack-radio/episode-12-sebastian-ruder Introduction There’s text everywhere around us, from digital sources like social media to physical objects like books and print media. The amount of text data being generated every day is mind boggling and yet we’re not even close to harnessing the full power of natural language processing. I see a ton of aspiring data scientists interested in this field, but they often turn away daunted by the challenges NLP presents. It’s such a niche line of work, and we […]

Read more

A Must-Read NLP Tutorial on Neural Machine Translation – The Technique Powering Google Translate

Introduction “If you talk to a man in a language he understands, that goes to his head. If you talk to him in his own language, that goes to his heart.” – Nelson Mandela The beauty of language transcends boundaries and cultures. Learning a language other than our mother tongue is a huge advantage. But the path to bilingualism, or multilingualism, can often be a long, never-ending one. There are so many little nuances that we get lost in the […]

Read more

OpenAI’s GPT-2: A Simple Guide to Build the World’s Most Advanced Text Generator in Python

Overview Learn how to build your own text generator in Python using OpenAI’s GPT-2 framework GPT-2 is a state-of-the-art NLP framework – a truly incredible breakthrough We will learn how it works and then implements our own text generator using GPT-2   Introduction “The world’s best economies are directly linked to a culture of encouragement and positive feedback.” Can you guess who said that? It wasn’t a President or Prime Minister. It certainly wasn’t a leading economist like Raghuram Rajan. […]

Read more

Sentiment Analysis in Python With TextBlob

Introduction State-of-the-art technologies in NLP allow us to analyze natural languages on different layers: from simple segmentation of textual information to more sophisticated methods of sentiment categorizations. However, it does not inevitably mean that you should be highly advanced in programming to implement high-level tasks such as sentiment analysis in Python. Sentiment Analysis The algorithms of sentiment analysis mostly focus on defining opinions, attitudes, and even emoticons in a corpus of texts. The range of established sentiments significantly varies from […]

Read more

How I used NLP (Spacy) to screen Data Science Resumes

Resume making is very tricky. A candidate has many dilemmas, whether to state a project at length or just mention the bare minimum whether to mention many skills or just mention his/her core competency skill whether to mention many programming languages or just cite a few whether to restrict the resume to 2 pages or 1 page These dilemmas are equally hard for Data Scientists looking for a change or even for aspiring Data Scientist. Now before you wonder where […]

Read more

FlashText – A library faster than Regular Expressions for NLP tasks

People like me working in the field of Natural Language Processing almost always come across the task of replacing words in a text. The reasons behind replacing the words may be different. Some of them are. “would’ve” and “would have” represent the same thing. So changing all the occurrences of “would’ve” to “would have” is one such task. Changing all Case Variations to a single form i.e Python, pytHon, pYthon, pythoN etc. to python Changing all the synonyms of a word to […]

Read more

Text Mining 101: A Stepwise Introduction to Topic Modeling using Latent Semantic Analysis (using Python)

Introduction Have you ever been inside a well-maintained library? I’m always incredibly impressed with the way the librarians keep everything organized, by name, content, and other topics. But if you gave these librarians thousands of books and asked them to arrange each book on the basis of their genre, they will struggle to accomplish this task in a day, let alone an hour! However, this won’t happen to you if these books came in a digital format, right? All the […]

Read more
1 9 10 11 12 13 15