Reading and Writing CSV Files in Python with Pandas

There are many ways of reading and writing CSV files in Python. There are a few different methods, for example, you can use Python’s built in open() function to read the CSV (Comma Separated Values) files or you can use Python’s dedicated csv module to read and write CSV files. Depending on your use-case, you can also use Python’s Pandas library to read and write CSV files. In this article, you will see how to use Python’s Pandas library to […]

Read more

One-Hot Encoding in Python with Pandas and Scikit-Learn

Introduction In computer science, data can be represented in a lot of different ways, and naturally, every single one of them has its advantages as well as disadvantages in certain fields. Since computers are unable to process categorical data as these categories have no meaning for them, this information has to be prepared if we want a computer to be able to process it. This action is called preprocessing. A big part of preprocessing is encoding – representing every single […]

Read more

Reading and Writing Excel (XLSX) Files in Python with the Pandas Library

Introduction Just like with all other types of files, you can use the Pandas library to read and write Excel files using Python as well. In this short tutorial, we are going to discuss how to read and write Excel files via DataFrames. In addition to simple reading and writing, we will also learn how to write multiple DataFrames into an Excel file, how to read specific rows and columns from a spreadsheet, and how to name single and multiple […]

Read more

An indispensable Python : Data sourcing to Data science.

Data analysis echo system has grown all the way from SQL’s to NoSQL and from Excel analysis to Visualization. Today, we are in scarceness of the resources to process ALL (You better understand what i mean by ALL) kind of data that is coming to enterprise. Data goes through profiling, formatting, munging or cleansing, pruning, transformation steps to analytics and predictive modeling. Interestingly, there is no one tool proved to be an effective solution to run all these operations { Don’t forget the […]

Read more

Beginner’s Tutorial on the Pandas Python Library

Pandas is an open source Python package that provides numerous tools for data analysis. The package comes with several data structures that can be used for many different data manipulation tasks. It also has a variety of methods that can be invoked for data analysis, which comes in handy when working on data science and machine learning problems in Python. Advantages of Using Pandas The following are some of the advantages of the Pandas library: It can present data in […]

Read more

Creating a Simple Recommender System in Python using Pandas

Introduction Have you ever wondered how Netflix suggests movies to you based on the movies you have already watched? Or how does an e-commerce websites display options such as “Frequently Bought Together”? They may look relatively simple options but behind the scenes, a complex statistical algorithm executes in order to predict these recommendations. Such systems are called Recommender Systems, Recommendation Systems, or Recommendation Engines. A Recommender System is one of the most famous applications of data science and machine learning. […]

Read more
1 2 3