Data, Learning and Modeling

Last Updated on January 6, 2017 There are key concepts in machine learning that lay the foundation for understanding the field. In this post, you will learn the nomenclature (standard terms) that is used when describing data and datasets. You will also learn the concepts and terms used to describe learning and modeling from data that will provide a valuable intuition for your journey through the field of machine learning. Data Machine learning methods learn from examples. It is important […]

Read more

How to Define Your Machine Learning Problem

Last Updated on June 7, 2016 The first step in any project is defining your problem. You can use the most powerful and shiniest algorithms available, but the results will be meaningless if you are solving the wrong problem. In this post you will learn the process for thinking deeply about your problem before you get started. This is unarguably the most important aspect of applying machine learning. What is the problem?Photo attributed to Eleaf, some rights reserved Problem Definition […]

Read more

How to Prepare Data For Machine Learning

Last Updated on August 16, 2020 Machine learning algorithms learn from data. It is critical that you feed them the right data for the problem you want to solve. Even if you have good data, you need to make sure that it is in a useful scale, format and even that meaningful features are included. In this post you will learn how to prepare data for a machine learning algorithm. This is a big topic and you will cover the […]

Read more

How to Evaluate Machine Learning Algorithms

Last Updated on August 16, 2020 Once you have defined your problem and prepared your data you need to apply machine learning algorithms to the data in order to solve your problem. You can spend a lot of time choosing, running and tuning algorithms. You want to make sure you are using your time effectively to get closer to your goal. In this post you will step through a process to rapidly test algorithms and discover whether or not there […]

Read more

How to Improve Machine Learning Results

Last Updated on August 16, 2020 Having one or two algorithms that perform reasonably well on a problem is a good start, but sometimes you may be incentivised to get the best result you can given the time and resources you have available. In this post, you will review methods you can use to squeeze out extra performance and improve the results you are getting from machine learning algorithms. When tuning algorithms you must have a high confidence in the […]

Read more

How to Identify Outliers in your Data

Last Updated on August 16, 2020 Bojan Miletic asked a question about outlier detection in datasets when working with machine learning algorithms. This post is in answer to his question. If you have a question about machine learning, sign-up to the newsletter and reply to an email or use the contact form and ask, I will answer your question and may even turn it into a blog post. Kick-start your project with my new book Data Preparation for Machine Learning, […]

Read more

How to Use Machine Learning Results

Last Updated on June 7, 2016 Once you have found and tuned a viable model of your problem it is time to make use of that model. You may need to revisit your why and remind yourself what form you need a solution for the problem you are solving. The problem is not addressed until you do something with the results. In this post you will learn tactics for presenting your results in answer to a question and considerations when […]

Read more

4 Self-Study Machine Learning Projects

Last Updated on September 5, 2019 There are many paths into the field of machine learning and most start with theory. If you are a programmer then you already have the skills to decompose problems into their constituent parts and to prototype small projects in order to learn new technologies, libraries and methods. These are important skills for any professional programmer and these skills can be used to get started in machine learning, today. These are important skills for any […]

Read more

What is Data Mining and KDD

Last Updated on August 16, 2020 I am very interested in processes. I want to know good ways to do things, even the best way to do things if possible. Even if you don’t have skill or deep understanding, process can get you a long way. It can lead the way and skill and deep understanding can follow. At least, I have used this to drive much of my work. I think it’s useful to study data mining as it […]

Read more

Reproducible Machine Learning Results By Default

Last Updated on August 16, 2020 It is good practice to have reproducible outcomes in software projects. It might even be standard practice by now, I hope it is. You can take any developer off the street and they should be able to follow your process to check out the code base from revision control and make a build of the software ready to use. Even better if you have a procedure for setting up an environment and for releasing […]

Read more
1 787 788 789 790 791 935