IBM Data Engineering
I am using this course as a guide and a way to get a hand-on understanding of thr data engineering landscape, however, IBMs cloud data sources are not widely used in the industry.
November
Postgresql
Pandas
December:
AWS:
Kineses
S3
Lambda
DynamoDB
Redshift
Janurary
PySpark
Airflow
Snowflake
February
Calculus 1 Refresher
Mathematics for Machine Learning
https://www.coursera.org/specializations/mathematics-machine-learning
March
Advanced Statistics
Utilities
Learning these tools slowly with every project
Shell Scripting
Docker & Kubernetes
Better Roadmap
https://github.com/datastacktv/data-engineer-roadmap