David J. Vigh


Amgen - Chicago, IL

Senior Data Engineer
October 2023 - Present
  • Developing and maintaining a multi-cloud data platform
  • Leading the development of new internal tools to enhance operational efficiency
  • Building a custom orchestrator to decrease data turnaround times from days to hours
  • Guiding direct reports in crafting impactful software solutions

Horizon Therapeutics - Chicago, IL

Senior Data Engineer
July 2022 - October 2023
  • Facilitated a smooth transition of an upgraded data format
  • Expanded data coverage by integrating additional sources, ensuring comprehensive insights
  • Oversaw and mentored a summer intern in successfully developing an internal doctor lookup tool
Data Engineer II
July 2021 - July 2022
  • Increasing scalability using a multi-cloud platform
  • Continue MDM project development with stakeholders across analytics departments
  • Managing and improving the scalability and capabilities of the data science platform
  • Improving ETL pipelines with new features and data sources
Data Engineer
January 2020 - June 2021
  • Decreased processing time by 5x of monthly deliveries of 10B rows of data
  • Designed, built, and maintained multiple MDM projects improving organizational clarity of data sets
  • Developed automated data pulls using Selinium
August 2019 - December 2019
  • Migrated on premises distributed computing environment to GCP to increase platform scalability

Software Developer and Data Engineer - Chicago, IL

June 2018 - August 2019
  • Developed data warehouses for several projects
  • Designed and implemented web scraper in Python

Clarkson University - Potsdam, NY

Statistics Teaching Assistant
Sep. 2017 - May 2018
  • Explained technical statistical concepts to students in clear comprehensible terms
  • Developed course curriculum


Clarkson University - Potsdam, NY
MS, Applied Data Science
May 2017
BS, Computer Science
May 2018
Computer Science
Git, Python, Go, SQL, noSQL, C++, APIs, RESTful Interfaces, Django, Flask, Linux, HTML, CSS, and Machine Learning
Data Management
GCP (compute, storage, dataproc), AWS (S3, EC2, Lambda), MongoDB, Pyspark, Spark, SQL Server, Selinium, Tableau, Docker