Data Engineer
Preston Thomas
I build dependable data pipelines and analytics systems that turn messy, distributed inputs
into decision-ready insight.
About
As a Data Engineer Intern at Mutually Human, I develop scalable ETL pipelines using Azure Data Factory
and Databricks while supporting production reporting workflows with SQL, Python/PySpark, and Java.
I am pursuing a Bachelor of Science in Mathematics and Computer Science at Grand Valley State University,
focused on engineering reliable, high-impact, data-driven systems.
Certifications
- Microsoft Certified: Fabric Data Engineer Associate (DP-700)
- Databricks Academy Accreditation - AI Agents Fundamentals
- Databricks Certified Data Engineer Associate (in progress)
Skills
Data Platforms
DatabricksPySparkAzure Data FactoryMicrosoft Fabric
Databases
PostgreSQLMicrosoft SQL ServerSQLite
Tools
PandasDjangoGitHubAzure DevOpsDockerTerraformJira
Experience
- Built a multithreaded Java service in SSIS to ingest 8 tables in parallel and cut runtime from 3+ hours to 15 minutes.
- Owned an end-to-end ETL pipeline for a $20B national retailer using Azure Data Factory and Databricks with PySpark transformations.
- Help improve reliability across reporting workflows for 20+ client environments through optimized data engineering patterns.
- Developed a SQL-based reconciliation tool that identified 97 retired devices and removed $45,000+ in annual overhead.
Outside of Work
I am a Detroit sports fan, enjoy fantasy football and NFL-focused side projects, and spend free time gaming,
lifting, and being outdoors with friends and family when Michigan weather cooperates.