Data Engineer - Pittsburgh Penguins (Pittsburgh · PA)

Pittsburgh Penguins jobs
Sports Jobs in Pittsburgh · PA
Technical Services: Technical/Engineering
We are seeking a Data Engineer with a passion for hockey to develop cloud-based data pipelines and automated data processing for our NHL Hockey Club.

Through your work, you will support out team’s efforts to compete and win championships. Our team embraces core values of integrity, innovation, and inclusion. We pride ourselves on providing meaningful guidance and opportunities to develop and expand their skill sets while also engaging with the broader analytics community. In doing so, we hope to create a path for a more diverse group of highly talented people to push the cutting edge of hockey analytics.

We believe that a diverse team is vital to building the world’s best sports intelligence platform. Thus, we strongly encourage you to apply if you identify with any marginalized community across race, ethnicity, gender, sexual orientation, veteran status, or disability. We are committed to creating an inclusive environment where all of our employees are enabled and empowered to succeed and thrive.

As a Data Engineer, you will be expected to:
  • Design, develop, document, and maintain the schemas and ETL pipelines for our internal sports databases and data warehouses
  • Implement and test collection, mapping, and storage procedures for secure access to team, league, and third-party data sources
  • Develop algorithms for quality assurance and imputation to prepare data for exploratory analysis and quantitative modeling
  • Profile and optimize automated data processing tasks
  • Coordinate with data providers around planned changes to raw data feeds
  • Deploy and maintain system and database monitoring tools
  • Collaborate and communicate effectively in a distributed work environment
  • Fulfill other related duties and responsibilities, including rotating platform support

A qualified entry-level candidate will be able to demonstrate several of the following and will be excited to learn the rest working with us:

  • Experience with ETL architecture and development in a cloud-based environment
  • Fluency in SQL development and an understanding of database and data warehousing technologies
  • Proficiency with R (strongly preferred), Python, Scala, and/or other data-oriented programming languages
  • Experience with automated data quality validation across large data sets
  • Strong software-engineering and problem-solving skills

A qualified senior candidate will be able to demonstrate all of the above at a higher level of competency plus the following:

  • Expertise developing complex databases and data warehouses for large-scale, cloud-based analytics systems
  • Experience with task orchestration and workflow automation tools (Airflow preferred)
  • Experience building and overseeing team-wide data quality initiatives
  • Experience adapting, retraining, and retooling in a rapidly changing technology environment

We are an equal opportunity employer and all qualified applicants will receive consideration for employment without regard to race, color, religion, national origin, sex, sexual orientation, age, disability, gender identity, marital or veteran status, or any other protected class.