Retail Sales Analytics Pipeline

SourceCSV
ETLPython
WarehouseBigQuery
BI LayerLooker Studio

Pipeline Explorer

Click a step

Extract: read CSV from data/raw

# ingestion/load_to_bigquery.py (excerpt)
df = pd.read_csv(csv_path)
df.columns = (
  df.columns.astype(str).str.strip().str.lower()
    .str.replace(r"[^a-z0-9]+", "_", regex=True).str.strip("_")
)
df = df.dropna(how="all")

Source Data CSV

Source retail CSV screenshot

BigQuery Load

BigQuery raw to staging screenshot

Transformed

Transformed staging sales screenshot

Live Dashboard (Looker Studio)

Embedded