👋 Welcome! I’m documenting my progress through #DEZoomcamp DataTalksClub’s Data Engineering Zoomcamp.
Containerize a Postgres-based data pipeline and automate GCP infrastructure with Terraform.
Docker | Terraform | Python | Google Cloud Platform (GCP) | Postgres
# Sample Ingestion Script (taxi_ingest.py)
import pandas as pd
from sqlalchemy import create_engine
def load_data(df: pd.DataFrame, user: str, password: str, host: str, port: str, db: str):
engine = create_engine(f'postgresql://{user}:{password}@{host}:{port}/{db}')
df.to_sql('nyc_taxi', engine, if_exists='replace')
docker-compose.yaml
.đź”— Module 1 Code