Building scalable data pipelines and automation solutions
4+ years of experience designing ETL pipelines, optimizing cloud data warehouses, and delivering big data solutions that reduce processing time and improve scalability.

About
Results-driven Data Engineer with expertise in designing scalable ETL pipelines, optimizing cloud data warehouses, and delivering big data solutions. Hands-on with Matillion, Databricks, SSIS, Snowflake, PySpark, Azure, and Power BI to automate data workflows and improve reporting accuracy.
Proven track record of reducing ETL runtime by 30% and improving scalability for large-scale enterprise datasets across global operations.
Experience
Data Engineer
Leading Global Automotive & Manufacturing Enterprise (Supply Chain Division)
- Designed and deployed 15+ scalable ETL/ELT pipelines using Snowflake, Matillion, and SSIS — reducing processing time by 30% and improving scalability for enterprise datasets
- Developed the PFEP KPI dashboard in Databricks, Power BI, and PySpark — reducing manual tracking by 25+ hours/month and improving cross-team visibility
- Built and delivered 10+ executive dashboards in Power BI — improving leadership decision-making and defect tracking
- Optimized a Python-based classification algorithm, reducing execution time from 2 hours to under 5 minutes
- Integrated heterogeneous data sources (Oracle, SQL Server, Snowflake, Azure Blob) into centralized reporting models, improving reporting accuracy by 15%
Key Projects
Global Manufacturing KPI Dashboard
Built an executive-level Power BI dashboard to track global operational metrics including quality, productivity, and delivery. Achieved SLA-driven refresh times of under 30 minutes with 95%+ data accuracy across multiple global plants.
PFEP Dashboard
Built a centralized reporting platform providing complete visibility into parts information across supplier, packaging, and warehouse operations. Eliminated manual reporting, saving 25+ analyst hours/month.
VPI Tariff Estimation & Simulation
Automated ingestion of program data from SharePoint into Databricks workflows. Designed and optimized Python algorithms to handle hierarchical relationships, reducing runtime from 2 hours to under 5 minutes.
Core Skills
Programming & Processing
Python (Pandas, NumPy), SQL, PySpark, DAX
Data Engineering & ETL/ELT
Matillion, Databricks, SSIS, Snowflake
Cloud & Infrastructure
Azure (Blob, Synapse, Data Lake), AWS (S3), Hive
Big Data & Streaming
Apache Spark, Apache Kafka, Hadoop ecosystem
Data Modeling & Warehousing
Star/Snowflake schema, Normalization, Performance Tuning
Visualization & Analytics
Power BI, Excel
Certifications
- Snowflake Core Pro Certified
- Microsoft Azure Data Fundamentals (DP-900)