Premium Only Content

Realtime Data Streaming | End To End Data Engineering Project
In this video, you will be building a real-time data streaming pipeline, covering each phase from data ingestion to processing and finally storage. We'll utilize a powerful stack of tools and technologies, including Apache Airflow, Python, Apache Kafka, Apache Zookeeper, Apache Spark, and Cassandra—all neatly containerized using Docker.
📚 What You'll Learn:
👉 Setting up a data pipeline with Apache Airflow
👉 Streaming data with Kafka and Kafka Connect
👉 Using Zookeeper for distributed synchronization
👉 Data processing with Apache Spark
👉 Data storage solutions with Cassandra and PostgreSQL
👉 Containerizing your data engineering environment with Docker
✨ Timestamps: ✨
0:00 Introduction
0:53 System architecture
3:47 Getting data from API with Airflow
17:10 Docker Compose for the architecture
26:09 Streaming data into Kafka
44:29 Apache Spark and Cassandra setup
49:33 Streaming data into cassandra
1:27:05 Outro
👦🏻 My Linkedin: https://www.linkedin.com/in/yusuf-ganiyu-b90140107/
🚀 Twitter: https://twitter.com/YusufOGaniyu
📝 Medium: https://medium.com/@yusuf.ganiyu
🌟 Please LIKE ❤️ and SUBSCRIBE for more AMAZING content! 🌟
🔗 Useful Links and Resources:
✅ Code: https://github.com/airscholar/e2e-data-engineering.git
✅ Medium Article: https://medium.com/@yusuf.ganiyu/realtime-data-engineering-project-with-airflow-kafka-spark-cassandra-and-postgres-804bcd963974
✅ Docker Compose Documentation: https://docs.docker.com/compose/
✅ Apache Kafka Official Site: https://kafka.apache.org/
✅ Apache Spark Official Site: https://spark.apache.org/
✅ Apache Airflow Official Site: https://airflow.apache.org/
✅ Cassandra: https://cassandra.apache.org/
✅ Confluent Docs: https://docs.confluent.io/home/overview.html
✨ Tags ✨
Data Engineering, Apache Airflow, Kafka, Apache Spark, Cassandra, PostgreSQL, Zookeeper, Docker, Docker Compose, ETL Pipeline, Data Pipeline, Big Data, Streaming Data, Real-time Analytics, Kafka Connect, Spark Master, Spark Worker, Schema Registry, Control Center, Data Streaming
✨ Hashtags ✨
#confluent #DataEngineering #ApacheAirflow #Kafka #ApacheSpark #Cassandra #PostgreSQL #Docker #ETLPipeline #DataPipeline #StreamingData #RealTimeAnalytics
-
4:24:53
Right Side Broadcasting Network
8 hours agoLIVE REPLAY: President Trump and UK Prime Minister Starmer Meet and Hold a Press Conference 2/27/25
108K41 -
2:12:33
Adam Carolla
2 days agoKamala Harris vs. Trump Wildfire Response + Jillian Michaels on Motherhood/Ayahuasca & Sperm Donors!
17.4K11 -
LIVE
Barry Cunningham
7 hours agoTRUMP DAILY BRIEFING: PRESIDENT TRUMP AND UK PRIME MINISTER STARMER HOLD PRESS CONFERENCE!
1,155 watching -
1:44:47
The Quartering
4 hours agoEpstein Files RELEASED Today, Gene Hackman Mystery & Trump Vs Woke Reporters!
67.1K27 -
1:19:38
Ben Shapiro
5 hours agoEp. 2147 - HUGE: The Washington Post SURRENDERS…To Reality!
76.6K16 -
1:01:36
Russell Brand
18 hours agoThe Battle for Power: Trump’s Plan, Elon’s Purge & the DEI Illusion – SF545
181K58 -
1:58:33
The Charlie Kirk Show
5 hours agoAmerica's Spiritual Revival + Runaway Judges + Vivek For Ohio | Habba, Waggoner, Vivek | 2.27.2025
128K21 -
58:33
The Dan Bongino Show
7 hours agoAmerica's A-Team Hits The Ground Running (Ep. 2432) - 02/27/2025
839K1.34K -
1:19:38
The Rubin Report
6 hours agoCNN Host Brutally Mocked for Trying to Profit Off of What He Called a ‘Conspiracy’
95.6K60 -
2:00:42
Benny Johnson
22 hours ago🚨BREAKING: Epstein List, Flight Logs RELEASED Any Minute By Trump | ELITES IN PANIC
183K230