Case Study: Predicting the Onset of Diabetes Within Five Years (part 1 of 3)

Last Updated on August 22, 2019 This is a guest post by Igor Shvartser, a clever young student I have been coaching. This post is part 1 in a 3 part series on modeling the famous Pima Indians Diabetes dataset that will introduce the problem and the data. Part 2 will investigate feature selection and spot checking algorithms and Part 3 in the series will investigate improvements to the classification accuracy and final presentation of results. Kick-start your project with my […]

Read more

Case Study: Predicting the Onset of Diabetes Within Five Years (part 2 of 3)

Last Updated on August 22, 2019 This is a guest post by Igor Shvartser, a clever young student I have been coaching. This post is part 2 in a 3 part series on modeling the famous Pima Indians Diabetes dataset (update: download from here).  In Part 1 we defined the problem and looked at the dataset, describing observations from the patterns we noticed in the data. In this we will introduce the methodology, spot checking algorithms, and review initial results. Kick-start your […]

Read more

Case Study: Predicting the Onset of Diabetes Within Five Years (part 3 of 3)

Last Updated on August 22, 2019 This is a guest post by Igor Shvartser, a clever young student I have been coaching. This post is part 3 in a 3 part series on modeling the famous Pima Indians Diabetes dataset that will investigate improvements to the classification accuracy and present final results (update: download from here). In Part 1 we defined the problem and looked at the dataset, describing observations from the patterns we noticed in the data. In Part 2 we […]

Read more

Rapidly Accelerate Your Progress in Applied Machine Learning With Weka

Last Updated on August 22, 2019 Why start with Weka over another tool like the R environment or Python for applied machine learning? In this post you will discover why Weka is the perfect platform for beginners interested in rapidly getting good at applied machine learning. After reading this post you will know: Why getting started in applied machine learning is hard. The one most important thing to focus on when getting started in applied machine learning. How to make best […]

Read more

How to Download and Install the Weka Machine Learning Workbench

Last Updated on August 22, 2019 The Weka machine learning workbench is a powerful and yet easy to use platform for predictive modeling. In this post you will discover how you can install Weka on your workstation fast, and get started with machine learning. After reading this post you will know: How to install the all-in-one version of Weka for Windows or Mac. How to install Java and Weka separately on Windows or Mac. How to install Weka on Linux […]

Read more

A Tour of the Weka Machine Learning Workbench

Last Updated on August 22, 2019 Weka is an easy to use and powerful machine learning platform. It provides a large number of machine learning algorithms, feature selection methods and data preparation filters. In this post you will discover the Weka machine learning workbench and take a tour of the key interfaces that you can use on your machine learning projects. After reading this post you will know about: The interfaces supported by the Weka machine learning workbench. Those interfaces that […]

Read more

How To Load CSV Machine Learning Data in Weka

Last Updated on August 22, 2019 You must be able to load your data before you can start modeling it. In this post you will discover how you can load your CSV dataset in Weka. After reading this post, you will know: About the ARFF file format and how it is the default way to represent data in Weka. How to load a CSV file in the Weka Explorer and save it in ARFF format. How to load a CSV […]

Read more

Standard Machine Learning Datasets To Practice in Weka

Last Updated on December 11, 2019 It is a good idea to have small well understood datasets when getting started in machine learning and learning a new tool. The Weka machine learning workbench provides a directory of small well understood datasets in the installed directory. In this post you will discover some of these small well understood datasets distributed with Weka, their details and where to learn more about them. We will focus on a handful of datasets of differing […]

Read more

How to Better Understand Your Machine Learning Data in Weka

Last Updated on August 22, 2019 It is important to take your time to learn about your data when starting on a new machine learning problem. There are key things that you can look at to very quickly learn more about your dataset, such as descriptive statistics and data visualizations. In this post you will discover how you can learn more about your data in the Weka machine learning workbench my reviewing descriptive statistics and visualizations of your data. After […]

Read more

How to Normalize and Standardize Your Machine Learning Data in Weka

Last Updated on December 11, 2019 Machine learning algorithms make assumptions about the dataset you are modeling. Often, raw data is comprised of attributes with varying scales. For example, one attribute may be in kilograms and another may be a count. Although not required, you can often get a boost in performance by carefully choosing methods to rescale your data. In this post you will discover how you can rescale your data so that all of the data has the […]

Read more
1 2 3 4