Text Preprocessing

This is the second step of the NLP end-to-end pipeline. In this step, We generally perform basic preprocessing and then advanced preprocessing but it depends on problem to problem. Let’s see the steps of text preprocessing. Lowercasing:- This is the first step of data preprocessing. It’s compulsory for all kinds of problems because whenever we work on an    

Read more

Python News: What’s New From October 2022

As always, October was an eventful month for the Python community this year. It brought the final Python 3.11 release, a few beta versions of the next major releases of popular Python libraries, and some exciting developments in the Python community. Here’s a quick recap of the most interesting happenings from October 2022. The Python 3.11 Release By far, this month’s most exciting event was the release of Python 3.11, overshadowing a few other interesting news items. October was chosen […]

Read more

Highlights from Machine Translation and Multilinguality in October 2022

Here are my monthly highlights from paper machine translation and multilinguality that appeared on arXiv, many of them preprints from the upcoming EMNLP conference. Folks from Amazon published a pre-print that introduces a simple method of how to make pre-trained multilingual representation more robust towards noisy inputs. It is a very straightforward approach: they sample typos based on Wikipedia logs and use those during model training. In addition, they add a contrastive loss that forces the noisy versions of sentences […]

Read more

Linear Regression From Scratch, and Avoiding Built-ins

Over the past few months, I had done quite a number of courses in Machine Learning (the popular ones) — Machine Learning by Andrew Ng, and the first three courses of the Deep Learning Specialisation. Just as I was thinking whether to move on to Convolutional Neural Networks or not, I thought it would be best to give myself a chance to implement what I’ve learnt so far. It would help me to strengthen my fundamentals, and that would definitely […]

Read more
1 135 136 137 138 139 927