Step by step guide to extract insights from free text (unstructured data)

Text Mining is one of the most complex analysis in the industry of analytics. The reason for this is that, while doing text mining, we deal with unstructured data. We do not have clearly defined observation and variables (rows and columns). Hence, for doing any kind of analytics, you need to first convert this unstructured data into a structured dataset and then proceed with normal modelling framework. The additional step of converting an unstructured data into a structured format is […]

Read more

Training with historical data! Surely, you’re joking says the IoT asset that just got connected

By Priya Sharma – Sr. Data Scientist -IoT Analytics, SAS Institute Inc. Saurabh Mishra – Product Management, IoT, SAS Institute Inc. June 12, 2020 Description: Majority of AI approaches are based on the construct of training against historical data and then inferencing new data. While this is a sound and proven approach, a lot of IoT assets coming online don’t have historical data and we don’t necessarily have the time to wait. Modern Machine Learning methods can be employed to […]

Read more

Picking an Analytic Platform

Summary: Picking an analytic platform when first starting out in data science almost always means working with what we’re most comfortable.  But as organizations grow larger there is a need for standardization and for selecting one, or a few analytic tools.   Picking an analytic platform when first starting out in data science almost always means working with what we’re most comfortable.  That in turn almost always means whatever we used in college (or your certificate course) be it R, […]

Read more

Characteristics of Good Visual Analytics and Data Discovery Tools

Visual Analytics and Data Discovery allow analysis of big data sets to find insights and valuable information. This is much more than just classical Business Intelligence (BI). See this article for more details and motivation: “Using Visual Analytics to Make Better Decisions: the Death Pill Exa…“. Let’s take a look at important characteristics to choose the right tool for your use cases. Visual Analytics Tool Comparison and Evaluation Several tools are available on the market for Visual Analytics and Data […]

Read more

R, Python or SAS: Which one should you learn first?

Python, R and SAS are the three most popular languages in data science. If you are new to the world of data science and aren’t experienced in either of these languages, it makes sense to be unsure of whether to learn R, SAS or Python. Don’t fret, by the time you’re done reading this article, you will know without a doubt which language is the right one for you. Overview R – R is the lingua franca of statistics. It is a […]

Read more

Why R is Bad for You

Summary:  Someone had to say it.  In my opinion R is not the best way to learn data science and not the best way to practice it either.  More and more large employers agree.   Someone had to say it.  I know this will be controversial and I welcome your comments but in my opinion R is not the best way to learn data science and not the best way to practice it either.   Why Should We Care What […]

Read more