Data Science Trends of the Future 2022

Photo credit: Unsplash. Data Science is an exciting field for knowledge workers because it increasingly intersects with the future of how industries, society, governance and policy will function. While it’s one of those vague terms thrown around a lot for students, it’s actually fairly simple to define. Data science is an interdisciplinary field that uses scientific methods, processes, algorithms and systems to extract knowledge and insights from structured and unstructured data, and apply knowledge and actionable insights from data across […]

Read more

Beginners Guide to Topic Modeling in Python

Introduction Analytics Industry is all about obtaining the “Information” from the data. With the growing amount of data in recent years, that too mostly unstructured, it’s difficult to obtain the relevant and desired information. But, technology has developed some powerful methods which can be used to mine through the data and fetch the information that we are looking for. One such technique in the field of text mining is Topic Modelling. As the name suggests, it is a process to […]

Read more

Training with historical data! Surely, you’re joking says the IoT asset that just got connected

By Priya Sharma – Sr. Data Scientist -IoT Analytics, SAS Institute Inc. Saurabh Mishra – Product Management, IoT, SAS Institute Inc. June 12, 2020 Description: Majority of AI approaches are based on the construct of training against historical data and then inferencing new data. While this is a sound and proven approach, a lot of IoT assets coming online don’t have historical data and we don’t necessarily have the time to wait. Modern Machine Learning methods can be employed to […]

Read more

Python Scikit-learn to simplify Machine learning : { Bag of words } To [ TF-IDF ]

Text (word) analysis and tokenized text modeling always give a chill air around ears, specially when you are new to machine learning. Thanks to Python and its extended libraries for its warm support around text analytics and machine learning. Scikit-learn is a savior and excellent support in text processing when you also understand some of the concept like “Bag of word”, “Clustering” and “vectorization”. Vectorization is  must-to-know technique for all machine leaning learners, text miner and algorithm implementor. I personally consider […]

Read more

Machine Learning – Anomaly Detection: “Finding a Needle in a Haystack”

After exploring formulation, classification, benchmarking, we explore another facet of Machine Learning: anomaly detection. This part is key in the IoT transformation, as it enables internet-connected AI devices to alert, adapt and respond accordingly. Once properly trained, an IoT could not only warn and prevent imminent failure, but also execute a response, adaptive to the anomaly detected. In this process, we’ll explore intrinsic hurdles that makes the anomaly detection process a non-trivial task of “finding a needle in haystack”. Opportunities abound to explore, and any univariate sequential […]

Read more

Characteristics of Good Visual Analytics and Data Discovery Tools

Visual Analytics and Data Discovery allow analysis of big data sets to find insights and valuable information. This is much more than just classical Business Intelligence (BI). See this article for more details and motivation: “Using Visual Analytics to Make Better Decisions: the Death Pill Exa…“. Let’s take a look at important characteristics to choose the right tool for your use cases. Visual Analytics Tool Comparison and Evaluation Several tools are available on the market for Visual Analytics and Data […]

Read more

Career Transition Towards Data Science: Planning a Learning Sabbatical

At the time of writing this post, I am nine months into my learning sabbatical. You can read about my journey here: “Career Transition Towards Data Analytics & Science”. Today I will share with you how you can plan your own, unique learning sabbatical, regardless of its scope and duration – anywhere between 1 and 12 months. Let’s get started. Begin with the end in mind If you have ever read Stephen Covey’s “7 Habits of Highly Effective People” you […]

Read more

Why Excel Users Should Learn Python

Latest update: November 16, 2018 Microsoft Excel has been around for over 30 years now, and chances are it’s not going to change in the foreseeable future. In fact, Excel is facing immense competition from challengers such as Google Spreadsheets and well-funded start-ups like Airtable, which are both going after Excel’s massive user base of approximately 500 million worldwide. Tech-savvy small and mid-sized businesses embrace innovative alternatives to Excel. However, making a dent in the large enterprise space is a […]

Read more

Should Python Become Your Official Corporate Language, Along With English?

English is becoming the official language in the global business world, being currently spoken by approximately 1.75 billion people worldwide according to Harvard Business Review. While English is the fastest spreading language in human history, a significant proportion of businesses are still resistant to giving up on their native language. Just try having a casual conversation in English with German employees at their corporate headquarters canteen (I am German, just for the record). However, pressures are piling up, not only […]

Read more

Mental Framework For A Data Driven Digital Transformation

Over the last years, my small business has undergone a digital transformation from a marketing service company to a data literacy consultancy. What does a data literacy consultancy do? We teach business users within large enterprises to work with data, and we help them acquire the necessary skills from state of the art Excel to Python, querying structured, semi-structured and unstructured databases, as well as math, statistics, and probability. Throughout our transition, we applied a set of techniques, principles, and […]

Read more
1 2