Data Engineer

Preston Thomas

I build dependable data pipelines and analytics systems that turn messy, distributed inputs into decision-ready insight.

Preston Thomas

About

As a Data Engineer Intern at Mutually Human, I develop scalable ETL pipelines using Azure Data Factory and Databricks while supporting production reporting workflows with SQL, Python/PySpark, and Java.

I am pursuing a Bachelor of Science in Mathematics and Computer Science at Grand Valley State University, focused on engineering reliable, high-impact, data-driven systems.

Certifications

  • Microsoft Certified: Fabric Data Engineer Associate (DP-700)
  • Databricks Academy Accreditation - AI Agents Fundamentals
  • Databricks Certified Data Engineer Associate (in progress)

Skills

Languages

PythonSQLJava

Data Platforms

DatabricksPySparkAzure Data FactoryMicrosoft Fabric

Databases

PostgreSQLMicrosoft SQL ServerSQLite

Tools

PandasDjangoGitHubAzure DevOpsDockerTerraformJira

Experience

Data Engineer Intern

Mutually Human · Grand Rapids, MI · July 2024 - Present

  • Built a multithreaded Java service in SSIS to ingest 8 tables in parallel and cut runtime from 3+ hours to 15 minutes.
  • Owned an end-to-end ETL pipeline for a $20B national retailer using Azure Data Factory and Databricks with PySpark transformations.
  • Help improve reliability across reporting workflows for 20+ client environments through optimized data engineering patterns.

Information Technology Intern

Morrison Industries · Grand Rapids, MI · January 2024 - July 2024

  • Developed a SQL-based reconciliation tool that identified 97 retired devices and removed $45,000+ in annual overhead.

Outside of Work

I am a Detroit sports fan, enjoy fantasy football and NFL-focused side projects, and spend free time gaming, lifting, and being outdoors with friends and family when Michigan weather cooperates.