Implemented pandas-based cleaning rules in data_preprocessing.py, transformations for salesorder.csv → clean_salesorder.csv, pipeline testing via multiple DAG runs.
This project is a reference implementation of a batch-oriented analytics pipeline designed to reflect how data transformations evolve from simple scripts to structured data platform workflows. In many ...