Hayden T. Brown

Data Engineer

Building production ETL pipelines and data infrastructure. 3+ years in the field.
I make reliable systems so we can get to the fun stuff. ☺︎

Currently working at NY's largest labor union and pursuing OMSCS at Georgia Tech.

Experience

Senior Data Analyst

CSEA Local 1000 · Sep 2024 - Present

Albany, NY

  • Own Python ETL pipeline standardizing 1,400+ agency files, identifying 30K+ unreported union-eligible workers
  • Migrated data stack to AWS cloud: 6 databases, 1M+ historical records, zero data loss, 99.9% uptime
  • Architect production PostgreSQL schema replacing legacy system, defining core entities and hardware requirements
  • Serve as technical liaison between data team and union executives, translating business requirements into data architecture

Data Analyst

CSEA Local 1000 · Apr 2022 - Sep 2024

Albany, NY

  • Built 30+ live dashboards in SplashBI for organizing campaigns serving over 250K workers in NY
  • Security overhaul: refactored 300+ production PostgreSQL queries, eliminated SQL injection vulnerabilities
  • Designed Neo4j graph database modeling 1,800+ employers, enabling complex relationship queries

Enterprise Skills

Languages

Python SQL JavaScript Shell

Data Engineering

ETL Pipelines Data Modeling Batch Processing

Databases

PostgreSQL Redis MongoDB Neo4j InfluxDB

Cloud & Infra

AWS (EC2, S3) Docker CI/CD

Tools

Git Pandas NumPy scikit-learn SplashBI

Education

Georgia Institute of Technology

M.S. Computer Science (Computing Systems)

Expected May 2028

Binghamton University

B.S. Computer Science · B.A. Mathematical Sciences

May 2020

Projects

theshelf.blog

in-progress

Art criticism platform supplemented by integrated feeds of consumption data, such as playtime or reading pace.

AstroSQLiteETL Pipelines

LudoGraph

production

Graph database schema that models the influence of individual video game design ideas across history.

Schema DesignGraph DatabaseCypher

PokeMath

production

Python battle simulator for Pokemon Red, Blue, and Yellow. Optimizes movepools and ranks individual performance.

SimulationData Visualization