Premium Only Content
Smart City End to End Realtime Data Engineering Project | Get Hired as an AWS Data Engineer
In this video, you will be building a Smart City End to End Realtime data streaming pipeline covering each phase from data ingestion to processing and finally storage. We'll utilize tools like IOT devices, Apache Zookeeper, Apache Kafka, Apache Spark, Docker, Python, AWS Cloud, AWS Glue, AWS Athena, AWS IAM, AWS Redshift and finally PowerBI to visualize data on Reshift.
Like this video?
- Buy me a coffee: https://www.buymeacoffee.com/yusuf.ganiyu
- Become a member: https://www.youtube.com/@codewithyu/join
Timestamps:
0:00 Introduction
1:29 System Architecture
7:22 Project Setup
9:00 Docker containers setup and coding
26:17 IOT services producer
38:19 Vehicle information Generator
48:10 GPS Information Generator
50:13 Traffic information Generator
53:13 Weather information Generator
58:35 Emergency Incident Generator
1:03:39 Producing IOT Data to Kafka
1:14:43 AWS S3 setup with policies
1:16:38 AWS IAM Roles and Credentials Management
1:19:14 Apache Spark Realtime Streaming from Kafka
2:01:14 Fixing Schema Issues in Apache Spark Structured Streaming
2:07:31 AWS Glue Crawlers
2:10:23 Working with AWS Athena
2:13:22 Loading Data into Redshift from AWS Glue Data Catalog
2:17:58 Connecting and Querying Redshift DW with DBeaver
2:20:51 Connecting Redshift to AWS Glue Catalog
2:23:34 Fixing IAM Permission issues with Redshift
2:26:05 Outro
👦🏻 My Linkedin: https://www.linkedin.com/in/yusuf-ganiyu-b90140107/
🚀 X(Twitter): https://x.com/YusufOGaniyu
📝 Medium: https://medium.com/@yusuf.ganiyu
🌟 Please LIKE ❤️ and SUBSCRIBE for more AMAZING content! 🌟
🔗 Useful Links and Resources:
✅ Docker Compose Documentation: https://docs.docker.com/compose/
✅ Apache Kafka Official Site: https://kafka.apache.org/
✅ Apache Spark Official Site: https://spark.apache.org/
✅ Confluent Docs: https://docs.confluent.io/home/overview.html
✅ S3 Documentation: https://docs.aws.amazon.com/s3/
✅ AWS IAM Documentation: https://docs.aws.amazon.com/IAM/latest/UserGuide/introduction.html
✨ Tags ✨
Data Engineering, Apache Airflow, Kafka, Apache Spark, Cassandra, PostgreSQL, Zookeeper, Docker, Docker Compose, ETL Pipeline, Data Pipeline, Big Data, Streaming Data, Real-time Analytics, Kafka Connect, Spark Master, Spark Worker, Schema Registry, Control Center, Data Streaming
✨ Hashtags ✨
#confluent #DataEngineering #ApacheAirflow #Kafka #ApacheSpark #Cassandra #PostgreSQL #Docker #ETLPipeline #DataPipeline #StreamingData #RealTimeAnalytics
-
14:36
The StoneZONE with Roger Stone
1 day agoRoger Stone Delivers Riveting Speech at Turning Point’s AMFEST 2024 | FULL SPEECH
72.4K23 -
18:59
Fit'n Fire
12 hours ago $5.13 earnedZenith ZF5 The Best MP5 Clone available
35.5K1 -
58:34
Rethinking the Dollar
21 hours agoTrump Faces 'Big Mess' Ahead | RTD News Update
31.2K5 -
5:35
Dermatologist Dr. Dustin Portela
21 hours ago $1.63 earnedUnboxing Neutrogena PR Box: Skincare Products and Surprises!
24.5K4 -
11:20
China Uncensored
20 hours agoCan the US Exploit a Rift Between China and Russia?
54.2K20 -
2:08:48
TheSaltyCracker
15 hours agoLefty Grifters Go MAGA ReeEEeE Stream 12-22-24
251K696 -
1:15:40
Man in America
18 hours agoThe DISTURBING Truth: How Seed Oils, the Vatican, and Procter & Gamble Are Connected w/ Dan Lyons
147K133 -
6:46:07
Rance's Gaming Corner
20 hours agoTime for some RUMBLE FPS!! Get in here.. w/Fragniac
173K4 -
1:30:48
Josh Pate's College Football Show
20 hours ago $11.31 earnedCFP Reaction Special | Early Quarterfinal Thoughts | Transfer Portal Intel | Fixing The Playoff
105K1 -
23:55
CartierFamily
3 days agoElon & Vivek TRIGGER Congress as DOGE SHUTS DOWN Government
143K162