INTRODUCTION

o Kafka is just like a messaging System.

o It is distributed Platform or Application.

o Cluster is made up of more than one Kafka Server.

o Production Environment Kafka is referred as Kafka Cluster.

o Each Kafka Server is referred as Broker.

Architecture

Kafka is a Fault Tolerant

Ability of a System to…

What is Airflow?

Airflow is a platform to Programmatically Author, Schedule and Monitor Workflows or Data Pipeline.

What is Workflow?

§ A Sequence of tasks.

§ Schedule or Triggered an email

§ Frequently used to handle big data processing pipeline.

E.g. of Workflow

Traditional ETL Approach:

(Data Science)

•Python and SQL

After almost a year at my first job I must say this is the most critical skill a Data Science Aspirant should have.

Without Python I can’t imagine surviving in my current job.

Python is one of the most powerful languages I ever come across. It is everywhere. …

After failure in interview, I thought let’s try to learn this system design in depth. Then I understand to crack interview we don’t need to learn everything. Fundamentals of System Design is enough to crack this interviews. I gone through various You Tube Channel which taught System Design. …

“ARISE BE FEARLESS BE STRONG. TAKE THE RESPONSIBILITY ON YOURSELF AND KNOW THAT YOU ARE THE CREATORS OF YOUR FUTURE. ALL THE HELP AND POWER YOU SEEK LIES WITHIN YOU.SO RISE AND CREATE YOUR OWN FUTURE.”
-BY SWAMI VIVEKANAND

What you should know to take part in any blockchain-related technology…

Vikas Maurya

Lead Data Scientist at Tata Power Ltd.

Get the Medium app

A button that says 'Download on the App Store', and if clicked it will lead you to the iOS App store
A button that says 'Get it on, Google Play', and if clicked it will lead you to the Google Play store