Senior Data Engineer
I enjoy improving the performance, reliability, and clarity of data systems—whether that's through pipeline redesigns, cloud migrations, or hands-on engineering in Spark, SQL, and Python.
I gravitate toward collaborative, remote-first teams, and I enjoy contributing to architecture discussions while staying close to the code.
I'm also passionate about mentoring newer engineers, creating clear documentation, and helping teams adopt better practices around data quality, lineage, and maintainability.
Download PDF2015-2019
Virginia Tech
Magna Cum Laude
2019-2020
Bloom Institute of Technology
Issued Dec 2024 - Expires Jan 2027
April 2020 - present | Remote
Senior Data Engineer
Led multi-stage migration of 20+ pipelines from Hadoop 2 to Hadoop 3 to GCP, modernizing core reporting infrastructure
Owned reporting pipelines for the Quotes System (~30 GB / 250M rows) and Pro Extended Aisle (~2 TB / 6M rows), enabling visibility into key B2B sales and margin metrics
Cut Pro Extended Aisle runtime from 4 hrs to 45 mins, delivering start-of-day reporting instead of mid-afternoon and improving operational responsiveness
Upgraded Quotes System refresh from daily to every 30 mins, accelerating insights for sales, pricing, and finance teams
Identified negative-margin quotes during pricing issue, preventing $3.7M in losses
Built and maintained Spark (Python/Scala) jobs; troubleshot multi-stage pipeline dependencies across Hadoop and GCP
Created Confluence documentation (logic, consumers, lineage, ADRs) used by 5+ downstream teams
Mentored new engineers and led internal sessions on GCP/BigQuery best practices
Supported additional domains (CRM, Tool Rental, Blueprint) to ensure reliability and modernization success
Partnered with data architecture teams to align pipelines with long-term platform strategy
Roanoke, VA
(703)-896-8911
ericwuerfel@protonmail.com