Machine Learning in Cyber Security — Malicious Software Installation

Introduction Monitoring of user activities performed by local administrators is always a challenge for SOC analysts and security professionals. Most of the security framework will recommend the implementation of a whitelist mechanism. However, the real world is often not ideal. You will always have different developers or users having local administrator rights to bypass controls specified. Is there a way to monitor the local administrator activities?

Read more

Beginners Tutorial for Regular Expressions in Python

Importance of Regular Expressions In last few years, there has been a dramatic shift in usage of general purpose programming languages for data science and machine learning. This was not always the case – a decade back this thought would have met a lot of skeptic eyes! This means that more people / organizations are using tools like Python / JavaScript for solving their data needs. This is where Regular Expressions become super useful. Regular expressions are normally the default way […]

Read more

How to Use Texthero to Prepare a Text-based Dataset for Your NLP Project

Introduction Natural Language Processing (NLP) is one of the most important fields of study and research in today’s world. It has many applications in the business sector such as chatbots, sentiment analysis, and document classification. Preprocessing and representing text is one of the trickiest and most annoying parts of working on an NLP project. Text-based datasets can be incredibly thorny and difficult to preprocess. But fortunately, the latest Python package called Texthero can help you solve these challenges. What is […]

Read more

10 Statistical Functions in Excel every Analytics Professional Should Know

Overview Microsoft Excel is an excellent tool for learning and executing statistical functions Here are 12 statistical functions in Excel that you should master for a successful analytics career   Let’s Excel in Statistics! “Statistics is the grammar of Science.” – Karl  Pearson Let’s make that a bit more relevant for us – Statistics is the grammar of “Data” Science. You’ll notice that almost every successful data science professional or analytics professional has a solid understanding of statistics – but […]

Read more

What is AWS? Why Every Data Science Professional Should Learn Amazon Web Services

Overview Amazon Web Services (AWS) is the leading cloud platform for deploying machine learning solutions Every data science professional should learn how AWS works   Introduction “Your machine ran out of memory.” Sounds familiar? It certainly is for me – especially anytime I try to run a complex machine learning algorithm on my personal machine. It’s quite a frustrating experience that a lot of data science professionals feel. We don’t have the unlimited computing power of the tech behemoths – […]

Read more

How I Became a Data Science Competition Master from Scratch

Overview Winning data science competitions can be a complex process – but you can crack the top 3 if you have a framework to follow Hear from a top data science hackathon expert and how he went from scratch to winning data science competitions   Introduction There is no alternative to learning through experience. Especially in the data science industry! I recently won the top prize in Zindi’s Zimnat Insurance Recommendation challenge – an achievement that ranks top among my […]

Read more

6 Key Points you Should Focus on for your Next Data Science Interview

Overview Preparing for your next data science interview? You need to ensure you’re covering your basics Here are 6 key points we’ve taken from our data science interview experience that you should focus on   Introduction You’ve finally done it! You have landed an interview for a data science role. Now, a day before your interview, you’re not sure what to study. The day is almost here but there is so much to cover! Sound familiar? Interviews can be daunting […]

Read more

5 Popular NoSQL Databases Every Data Science Professional Should Know About

Overview NoSQL databases are ubiquitous in the industry – a data scientist is expected to be familiar with these databases Here, we will see what is a NoSQL database and why you should learn about it We will also look at the features of 5 different NoSQL databases   Introduction Here’s a piece of advice I wish someone had given me when I was starting out in data science – learn as much as you can about working with databases. […]

Read more
1 5 6 7