Hacks to perform faster Text Mining in R

Introduction Data science demands versatility. Move away from your regular methods, challenge your ways of working, explore new ways of doing things more efficiently. On reminiscing about my old days, my initial years in data science, I had also got trapped by this devil of ‘complacency’. At one point, I was not challenging myself enough. I wasn’t  experimenting with the ways of doing work. I accepted the things as they were, until I realized ‘Complacency is a state of mind […]

Read more

DataHack Radio #12: Exploring the Nuts and Bolts of Natural Language Processing with Sebastian Ruder

https://soundcloud.com/datahack-radio/episode-12-sebastian-ruder Introduction There’s text everywhere around us, from digital sources like social media to physical objects like books and print media. The amount of text data being generated every day is mind boggling and yet we’re not even close to harnessing the full power of natural language processing. I see a ton of aspiring data scientists interested in this field, but they often turn away daunted by the challenges NLP presents. It’s such a niche line of work, and we […]

Read more

8 Awesome Data Science Capstone Projects from Praxis Business School

Introduction It is not the strongest or the most intelligent who will survive but those who can best manage change. Evolution is the only way anything can survive in this universe. And when it comes to industry relevant education in a fast evolving domain like Machine Learning and Artificial Intelligence – it is necessary to evolve or you will simply perish (over time). I have personally experienced this first hand while building Analytics Vidhya. It still amazes me to see […]

Read more

OpenAI’s GPT-2: A Simple Guide to Build the World’s Most Advanced Text Generator in Python

Overview Learn how to build your own text generator in Python using OpenAI’s GPT-2 framework GPT-2 is a state-of-the-art NLP framework – a truly incredible breakthrough We will learn how it works and then implements our own text generator using GPT-2   Introduction “The world’s best economies are directly linked to a culture of encouragement and positive feedback.” Can you guess who said that? It wasn’t a President or Prime Minister. It certainly wasn’t a leading economist like Raghuram Rajan. […]

Read more

6 Exciting Open Source Data Science Projects you Should Start Working on Today

Overview Here are six open-source data science projects to enhance your skillset These projects cover a diverse set of domains, from computer vision to natural language processing (NLP), among others Pick your favorite open-source data science project(s) and get coding!   Introduction I recently helped out in a round of interviews for an open data scientist position. As you can imagine, there were candidates from all kinds of backgrounds – software engineering, learning and development, finance, marketing, etc. What stood […]

Read more

FlashText – A library faster than Regular Expressions for NLP tasks

People like me working in the field of Natural Language Processing almost always come across the task of replacing words in a text. The reasons behind replacing the words may be different. Some of them are. “would’ve” and “would have” represent the same thing. So changing all the occurrences of “would’ve” to “would have” is one such task. Changing all Case Variations to a single form i.e Python, pytHon, pYthon, pythoN etc. to python Changing all the synonyms of a word to […]

Read more

Text Mining 101: A Stepwise Introduction to Topic Modeling using Latent Semantic Analysis (using Python)

Introduction Have you ever been inside a well-maintained library? I’m always incredibly impressed with the way the librarians keep everything organized, by name, content, and other topics. But if you gave these librarians thousands of books and asked them to arrange each book on the basis of their genre, they will struggle to accomplish this task in a day, let alone an hour! However, this won’t happen to you if these books came in a digital format, right? All the […]

Read more

DataHack Radio #24: Exploring and Designing Chatbots with RASA’s Justina Petraitytė

Introduction Chatbots are the most common application of Natural Language Processing (NLP). Organizations are scrambling to integrate chatbots into their daily functions to enhance and personalize our experience. As a data science professional, I’m always curious about how these chatbots are built. Rasa is one such open source framework that we can leverage to build our own chatbots. So we are delighted to have Rasa’s data scientist and Head of Developer Relations, Justina Petraitytė, on our DataHack Radio podcast! Justina […]

Read more

Knowledge Graph – A Powerful Data Science Technique to Mine Information from Text (with Python code)

Overview Knowledge graphs are one of the most fascinating concepts in data science Learn how to build a knowledge graph to mine information from Wikipedia pages You will be working hands-on in Python to build a knowledge graph using the popular spaCy library   Introduction Lionel Messi needs no introduction. Even folks who don’t follow football have heard about the brilliance of one of the greatest players to have graced the sport. Here’s his Wikipedia page: Quite a lot of […]

Read more

Introduction to Structuring Customer complaints explained with examples

Introduction In past, if you were not particularly happy with a service or a product, you would go to the service provider or the shop and lodge a complaint. With services-businesses going online and due to enormous scale, lodging complaints in-person may not be always possible. Electronic ways such as emails, social media and particularly websites like www.consumercomplaints.in focusing on such issues, are widely used platforms to vent out the anger as well as publicizing the issue in expectancy of […]

Read more
1 4 5 6 7