Data Science Trends of the Future 2022

Photo credit: Unsplash. Data Science is an exciting field for knowledge workers because it increasingly intersects with the future of how industries, society, governance and policy will function. While it’s one of those vague terms thrown around a lot for students, it’s actually fairly simple to define. Data science is an interdisciplinary field that uses scientific methods, processes, algorithms and systems to extract knowledge and insights from structured and unstructured data, and apply knowledge and actionable insights from data across […]

Read more

Open Source Deep Learning Frameworks and Visual Analytics

Deep Learning gets more and more traction. It basically focuses on one section of Machine Learning: Artificial Neural Networks. This article explains why Deep Learning is a game changer in analytics, when to use it, and how Visual Analytics allows business analysts to leverage the analytic models built by a (citizen) data scientist. What is Deep Learning and Artificial Neural Networks? Deep Learning is the modern buzzword for artificial neural networks, one of many concepts and algorithms in machine learning […]

Read more

Starting to develop in PySpark with Jupyter installed in a Big Data Cluster

Is not a secret that Data Science tools like Jupyter, Apache Zeppelin or the more recently launched Cloud Data Lab and Jupyter Lab are a must be known for the day by day work so How could be combined the power of easily developing models and the capacity of computation of a Big Data Cluster? Well in this article I will share very simple step to start using Jupyter notebooks for PySpark in a Data Proc Cluster in GCP. Final goal Prerequisites 1. Have a Google Cloud […]

Read more