10 Standard Datasets for Practicing Applied Machine Learning

Last Updated on May 20, 2020 The key to getting good at applied machine learning is practicing on lots of different datasets. This is because each problem is different, requiring subtly different data preparation and modeling methods. In this post, you will discover 10 top standard machine learning datasets that you can use for practice. Let’s dive in. Update Mar/2018: Added alternate link to download the Pima Indians and Boston Housing datasets as the originals appear to have been taken […]

Read more

5 Top Machine Learning Podcasts

Machine learning podcasts are now a thing. There are now enough of us interested in this obscure geeky topic that there are podcasts dedicated to chatting about the ins and outs of predictive modeling. There has never been a better time to get started and working in this amazing field. In this post, I want to share the 5 podcasts on machine learning and data science that I listen to. Let’s dive in. Overview Here’s the short list of machine […]

Read more

7 Time Series Datasets for Machine Learning

Last Updated on August 21, 2019 Machine learning can be applied to time series datasets. These are problems where a numeric or categorical value must be predicted, but the rows of data are ordered by time. A problem when getting started in time series forecasting with machine learning is finding good quality standard datasets on which to practice. In this post, you will discover 8 standard time series datasets that you can use to get started and practice time series […]

Read more

What Is Time Series Forecasting?

Last Updated on August 15, 2020 Time series forecasting is an important area of machine learning that is often neglected. It is important because there are so many prediction problems that involve a time component. These problems are neglected because it is this time component that makes time series problems more difficult to handle. In this post, you will discover time series forecasting. After reading this post, you will know: Standard definitions of time series, time series analysis, and time […]

Read more

Time Series Forecasting as Supervised Learning

Last Updated on August 15, 2020 Time series forecasting can be framed as a supervised learning problem. This re-framing of your time series data allows you access to the suite of standard linear and nonlinear machine learning algorithms on your problem. In this post, you will discover how you can re-frame your time series problem as a supervised learning problem for machine learning. After reading this post, you will know: What supervised learning is and how it is the foundation […]

Read more

How to Go From Working in a Bank To Hired as Senior Data Scientist at Target

Last Updated on December 7, 2016 How Santhosh Sharma Went FromWorking in the Loans Department of a Bank toGetting Hired as a Senior Data Scientist at Target. Santhosh Sharma recently reached out to me to share his inspirational story and I want to share it with you. His story shows how with enthusiasm for machine learning, taking the initiative, sharing your results and a little luck can change your career and throw you deep into applied machine learning. After reading this interview, you will know: How Santhosh […]

Read more

How to Load and Explore Time Series Data in Python

Last Updated on April 30, 2020 The Pandas library in Python provides excellent, built-in support for time series data. Once loaded, Pandas also provides tools to explore and better understand your dataset. In this post, you will discover how to load and explore your time series dataset. After completing this tutorial, you will know: How to load your time series dataset from a CSV file using Pandas. How to peek at the loaded data and calculate summary statistics. How to […]

Read more

How to Normalize and Standardize Time Series Data in Python

Last Updated on August 28, 2019 Some machine learning algorithms will achieve better performance if your time series data has a consistent scale or distribution. Two techniques that you can use to consistently rescale your time series data are normalization and standardization. In this tutorial, you will discover how you can apply normalization and standardization rescaling to your time series data in Python. After completing this tutorial, you will know: The limitations of normalization and expectations of your data for […]

Read more

Basic Feature Engineering With Time Series Data in Python

Last Updated on September 15, 2019 Time Series data must be re-framed as a supervised learning dataset before we can start using machine learning algorithms. There is no concept of input and output features in time series. Instead, we must choose the variable to be predicted and use feature engineering to construct all of the inputs that will be used to make predictions for future time steps. In this tutorial, you will discover how to perform feature engineering on time […]

Read more

How To Resample and Interpolate Your Time Series Data With Python

Last Updated on February 11, 2020 You may have observations at the wrong frequency. Maybe they are too granular or not granular enough. The Pandas library in Python provides the capability to change the frequency of your time series data. In this tutorial, you will discover how to use Pandas in Python to both increase and decrease the sampling frequency of time series data. After completing this tutorial, you will know: About time series resampling, the two types of resampling, […]

Read more
1 808 809 810 811 812 922