Globe icon Login icon Recap icon Search icon Tickets icon


Data Engineering Fellowship, Baseball Informatics - Pittsburgh Pirates (Pittsburgh · PA)

Pittsburgh Pirates jobs
Sports Jobs in Pittsburgh · PA
Technical Services: Statistics
*In order to be considered for this role, after clicking "apply now" above and being redirected, you must fully complete the application process on the follow-up screen. *

We are an equal opportunity employer and all qualified applicants will receive consideration for employment without regard to race, color, religion, national origin, sex, sexual orientation, age, disability, gender identity, marital or veteran status, or any other protected class.

The Pirates Why
The Pittsburgh Pirates are a storied franchise in Major League Baseball who are reinventing themselves on every level. Boldly and relentlessly pursuing excellence by:
  • purposefully developing a player and people-centered culture;
  • deeply connecting with our fans, partners, and colleagues;
  • passionately creating lifetime memories for generations of families and friends; and
  • meaningfully impacting our communities and the game of baseball.
At the Pirates, we believe in the power of a diverse workforce and strive to create an inclusive culture centered in Passion, Innovation, Respect, Accountability, Teamwork, Empathy, and Service.

Job Summary
We are seeking individuals who are excited about the opportunity to work with baseball data sets that span a variety of domains, from player development to scouting to ball tracking and player tracking (including StatCast and Hawk-Eye).

In this role, you will utilize your problem-solving and coding skills to execute and enhance the processes that extract, transform, clean, and load data to and from external sources. You’ll also collaborate with analysts and developers in working to integrate data, playing a role in how it is used to impact players and decisions.

  • The main responsibility for this role is in assisting Baseball Informatics staff in the daily operation, reporting, maintenance, and performance of the data assets used within Baseball Operations.
  • Specifically, the person in this position will be responsible for executing daily reporting and ETL (extraction, transformation, and loading) across departments within Baseball Operations. 
  • The person will also ensure that the automated and manual processes used to ETL data to and from external sources are operational and timely. 
    • This will involve debugging, improving, designing, and implementing these processes primarily built in Python as they interact with APIs and SQL Server, AWS, Snowflake, DataRobot, and SQL Server Reporting Services.
Through this fellowship, you will get both a high-level understanding of the structure and applicability of baseball data as well as specific knowledge of those data sets. You’ll also work alongside analysts and developers to help create insights from the data.

  • Experience handling various types of data including JSON, XML, CSV
  • Coding skills in Python
  • A good understanding of or experience in building, maintaining, and querying databases
  • Better than beginner-level proficiency in SQL to perform data manipulation
  • Basic understanding of statistical concepts (probability, distributions and their measurements, variability, selection bias, linear regression)
  • Ability to work independently and be collaborative
  • A desire to learn
  • Ability to monitor and manage data operations that occur each day
  • Ability to debug, troubleshoot, and document code to solve problems
  • Ability to proactively communicate to resolve issues in a timely and clear fashion
  • Ability to manage and prioritize a task list