Data engineer based in New York City, with experience building end-to-end data pipelines on modern cloud infrastructure. Currently focused on GCP-based architectures and exploring LLM integration into data workflows.
Open to data engineering roles — feel free to reach out.
Languages · Python · SQL
Transform · dbt · Apache Spark
Orchestration · Apache Airflow
Cloud & storage · GCP · BigQuery · Google Cloud Storage
Infrastructure · Terraform · Docker
ML / AI · scikit-learn · LLMs · RAG pipelines
End-to-end data pipeline processing 2017–2021 NYC Citibike trip data. Raw zipped CSVs are ingested into Google Cloud Storage via Airflow, transformed with dbt into BigQuery fact and dimension tables, and visualized in a Looker Studio dashboard.
dbt Airflow BigQuery Terraform Docker GCP Spark
- Data Engineering Zoomcamp — DataTalks.Club certificate: C0F6-D1
- Google Cloud Certified Professional Database Engineer, certificate:00835ba11b854522a1298930cc5905ad
- Google Cloud Certified Professional Cloud Architect Engineer, certificate: 6af5b2513914415b81e16af3d92bd92a
- Google Cloud Certified Professional Data Engineer, certificate: 7da97b3ca55040b6b4fbc2cfacbdc624
- Google Cloud Certified Professional Machine Learning Engineer, certificate: 29b0afc8f51946b1b172c433e99ac65b
- Google Cloud Certified Associate Cloud Engineer, certificate: eb79a95a28354195a6ccc7d35ec22153

