Pranav Sadagopan

Data Engineer

Skills

AWS AWS
Python Python
SQL SQL
PySpark PySpark
Apache Airflow Apache Airflow
Kafka Kafka
Apache Superset Apache Superset
Flink SQL Flink SQL
dbt dbt
Apache Iceberg Apache Iceberg
Delta Lake Delta Lake
Linux Linux
Windows Windows
Docker Docker
Jenkins Jenkins

Personal Projects

Apache Flink-Driven Real-time CDC Pipeline

Real-time CDC data streaming pipeline from MySQL to Apache Iceberg using Apache Flink.

Flink SQL Apache Iceberg Hive Metastore S3 MySQL

Speeding Insights: F1 Telemetry Data Pipeline

Scalable telemetry data pipeline for analyzing Formula One racing data using Apache Spark.

PySpark Airflow dbt Apache Iceberg S3 Trino