Naive Bayes Classification explained with Python code

Machine Learning is a vast area of Computer Science that is concerned with designing algorithms which form good models of the world around us (the data coming from the world around us). Within Machine Learning many tasks are – or can be reformulated as – classification tasks. In classification tasks we are trying to produce a model which can give the correlation between the input data  and the class  each input belongs to. This model is formed with the feature-values of the input-data. […]

Read more

What are the Big Guys Using?

Summary:  The largest companies utilizing the most data science resources are moving rapidly toward more integrated advanced analytic platforms.  The features they are demanding are evolving to promote speed, simplicity, quality, and manageability.  This has some interesting implications for open source R and Python widely taught in schools but significantly less necessary with these more sophisticated platforms.   We continue to be dazzled, and perhaps rightly so, by the advances in deep learning and question answering machines like Watson.  And […]

Read more

Open Source Deep Learning Frameworks and Visual Analytics

Deep Learning gets more and more traction. It basically focuses on one section of Machine Learning: Artificial Neural Networks. This article explains why Deep Learning is a game changer in analytics, when to use it, and how Visual Analytics allows business analysts to leverage the analytic models built by a (citizen) data scientist. What is Deep Learning and Artificial Neural Networks? Deep Learning is the modern buzzword for artificial neural networks, one of many concepts and algorithms in machine learning […]

Read more

Learn Python for Data Science from Scratch

Python is a multipurpose programming language and widely used for Data Science, which is termed as the sexiest job of this century. Data Scientist mine thru the large dataset to gain insight and make meaningful data driven decisions. Python is used as general purposed programming language and used for Web Development, Networking, Scientific computing etc. We will be discussing further about the series of awesome libraries in python such as numpy, scipy & pandas for data manipulation & wrangling and […]

Read more

Why R is Bad for You

Summary:  Someone had to say it.  In my opinion R is not the best way to learn data science and not the best way to practice it either.  More and more large employers agree.   Someone had to say it.  I know this will be controversial and I welcome your comments but in my opinion R is not the best way to learn data science and not the best way to practice it either.   Why Should We Care What […]

Read more

Will Python Replace Java?

  According to the IT programming trend, Java is currently more popular than other programming languages in terms of number of jobs, number of existing Java developers and overall usage statics in IT compared to Python. According to the latest usage statistics posted on a popular Technology Survey site, Java is being used by 3.0% websites as a server-side programming language, whereas only 0.2% of websites use Python. However, all the recent reports have highlighted that the usage and popularity of Python […]

Read more

How to automatically create Base Line Estimators using scikit learn.

For any machine learning problem, say a classifier in this case, it’s always handy to create quickly a base line classifier against which we can compare our new models. You don’t want to spend a lot of time creating these base line classifiers; you would rather spend that time in building and validating new features for your final model. In this post we will see how we can rapidly create base line classifier using scikit learn package for any dataset. […]

Read more

Deep Learning with TensorFlow in Python

Classifying the letters with notMNIST dataset Let’s first learn about simple data curation practices, and familiarize ourselves with some of the data that are going to be used for deep learning using tensorflow. The notMNIST dataset to be used with python experiments. This dataset is designed to look like the classic MNIST dataset, while looking a little more like real data: it’s a harder task, and the data is a lot less ‘clean’ than MNIST. Preprocessing First the dataset needs to be downloaded and […]

Read more

Four great machine learning eBooks

Want to learn machine learning? Looking for data science tutorials and guides to help you master your data and produce actionable, game-changing insights? Look no further than this list of machine learning eBooks from the Packt team…. 1. Python Machine Learning Python Machine Learning is today one of the most popular machine learning titles on the market. And it’s not hard to see why – by bridging the gap between theory and practice, the author Sebastian Raschka provides you with an […]

Read more

Some Analysis with Astronomy data (in Python)

The following figure shows the median FITS file computed from the above FITS files using the binapprox algorithm. 2. Cross-matching When investigating astronomical objects, like active galactic nuclei (AGN), astronomers compare data about those objects from different telescopes at different wavelengths. This requires positional cross-matching to find the closest counterpart within a given radius on the sky. In this activity you’ll cross-match two catalogues: one from a radio survey, the AT20G Bright Source Sample (BSS) catalogue and one from an optical survey, the SuperCOSMOS all-sky galaxy catalogue. The BSS catalogue […]

Read more
1 893 894 895 896 897 922