EasyOCR Python Tutorial with Examples

Introduction EasyOCR is a Python library for Optical Character Recognition (OCR) that allows you to easily extract text from images and scanned documents. In this tutorial, we will understand the basics of using the Python EasyOCR package with examples to show how to extract text from images along with various parameter settings. EasyOCR Python Package Overview Reader Class EasyOCR Python package consists of the base

Read more

YOLOv6 Explained with Tutorial and Example

Introduction In this article, we will introduce the new object detection model YOLOv6 which has been making buzz in the computer vision community ever since its GitHub was made public a few days back. We will take a brief look at its architecture and the improvement that its author claim. Then we will explain how to use YOLOv6 with step by step tutorial and example. What is YOLOv6? YOLOv6 is the object detection model created by a team at Meituan […]

Read more

Split and Merge Image Color Space Channels in OpenCV and NumPy

Introduction In this tutorial, we will show you how to split the image into it’s multiple channels by using the OpenCV cv2.split() function and also with Numpy. We will also show you how we can merge the single channel images to generate the multi-channel image with the cv2.merge() function. Color Spaces and Channels The colors of images can be organized in many ways and this scheme is known as color space. For example, a popular way of representing the image […]

Read more

HEXA: Self-supervised pretraining with hard examples improves visual representations

Humans perceive the world through observing a large number of visual scenes around us and then effectively generalizing—in other words, interpreting and identifying scenes they haven’t encountered before—without heavily relying on labeled annotations for every single scene. One of the core aspirations in artificial intelligence is to develop algorithms and techniques that endow computers with a strong generalization ability to learn only from raw pixel data to make sense of the visual world, which aligns more closely with how humans […]

Read more

Top 15 Open-Source Datasets of 2020 that every Data Scientist Should add to their Portfolio!

Overview Here is a list of Top 15 Datasets for 2020 that we feel every data scientist should practice on The article contains 5 datasets each for machine learning, computer vision, and NLP By no means is this list exhaustive. Feel free to add other datasets in the comments below   Introduction For the things we have to learn before we can do them, we learn by doing them -Aristotle I am sure everyone can attest to this saying. No […]

Read more

A Comprehensive Guide to Attention Mechanism in Deep Learning for Everyone

 Overview The attention mechanism has changed the way we work with deep learning algorithms Fields like Natural Language Processing (NLP) and even Computer Vision have been revolutionized by the attention mechanism We will learn how this attention mechanism works in deep learning, and even implement it in Python   Introduction “Every once in a while, a revolutionary product comes along that changes everything.” – Steve Jobs What does one of the most famous quotes of the 21st century have to do with […]

Read more

The Winning Approaches from codeFest 2018 – NLP, Computer Vision and Machine Learning!

Introduction Analytics Vidhya’s hackathons are one of the best ways to evaluate how far you’ve traveled in your data science journey. And what better way than to put your skills to the test against the top data scientists from around the globe? Participating in these hackathons also helps you understand where you need to improve and what else you can learn to get a better score in the next competition. And a very popular demand after each hackathon is to […]

Read more

Course Review: Hands On Computer Vision with OpenCV & Python

Introduction In this article I will be providing a review of the Udemy course Hands On Computer Vision with OpenCV & Python by Shrobon Biswas featured on the Udemy online learning site. At the time of this writing I would say that the course is moderately successful with a total of 146 ratings averaging to 4.1/5 stars along with a total enrollment of 851 students. According to the description, this course is introductory in nature and geared towards novice to […]

Read more

PyTesseract: Simple Python Optical Character Recognition

Introduction Humans can understand the contents of an image simply by looking. We perceive the text on the image as text and can read it. Computers don’t work the same way. They need something more concrete, organized in a way they can understand. This is where Optical Character Recognition (OCR) kicks in. Whether it’s recognition of car plates from a camera, or hand-written documents that should be converted into a digital copy, this technique is very useful. While it’s not […]

Read more

Affine Image Transformations in Python with Numpy, Pillow and OpenCV

In this article I will be describing what it means to apply an affine transformation to an image and how to do it in Python. First I will demonstrate the low level operations in Numpy to give a detailed geometric implementation. Then I will segue those into a more practical usage of the Python Pillow and OpenCV libraries. This article was written using a Jupyter notebook and the source can be found at my GitHub repo so, please feel free […]

Read more
1 2