Full Time

Staff Site Reliability Engineer

2K
Remote!
$90,000 - $130,000* / year

Job Description

Job Overview

As a Staff Site Reliability Engineer at 2K, you will play a pivotal role in ensuring the reliability, availability, and performance of our online services and platforms. You will collaborate with cross-functional teams to design and implement robust infrastructure solutions, enhance operational processes, and proactively address system issues. Your expertise will help shape the direction of our reliability initiatives and guide the team in best practices, ultimately delivering a seamless experience for our customers and stakeholders.



Job Responsibilities

  • Develop, implement, and maintain scalable infrastructure solutions to support business operations.
  • Monitor system performance and troubleshoot issues to ensure optimal operation and reliability.
  • Collaborate with engineering teams to facilitate a DevOps culture through the integration of automated processes.
  • Design and conduct post-incident reviews, implementing changes based on learnings.
  • Manage deployment pipelines, ensuring efficient and smooth transitions from development to production environments.
  • Develop and maintain documentation related to systems architecture, processes, and troubleshooting guides.
  • Participate in on-call rotations to provide support and maintain service health across systems.


Qualifications

  • Bachelor's degree in Computer Science, Engineering, or a related field; advanced degree preferred.
  • Minimum of 5 years of experience in site reliability engineering, systems engineering, or a related role.
  • In-depth knowledge of cloud computing platforms (e.g., AWS, Azure, Google Cloud).
  • Proficiency in scripting and programming languages such as Python, Go, or Ruby.
  • Experience with containerization technologies such as Docker and orchestration frameworks like Kubernetes.
  • Strong understanding of networking, security, and monitoring concepts.
  • Excellent problem-solving skills and ability to work in a fast-paced environment.


Benefits

  • Competitive salary and performance-based bonuses.
  • Comprehensive health, dental, and vision insurance plans.
  • Retirement savings plans with company matching contributions.
  • Generous paid time off and holiday leave.
  • Flexible work schedules and remote work options.
  • Access to professional development programs and training opportunities.
  • Employee discounts on games and merchandise.


Technologies & Tools

In this role, the Staff Site Reliability Engineer will utilize a variety of technologies including cloud services (AWS, Azure), monitoring tools (Prometheus, Grafana), containerization tools (Docker, Kubernetes), and CI/CD pipelines (Jenkins, GitLab CI). Familiarity with infrastructure as code (IAC) tools like Terraform or AWS CloudFormation will also be essential for successfully managing deployments and maintaining service reliability.



Ideal Candidates

The ideal candidate for the Staff Site Reliability Engineer position at 2K will exhibit a strong technical acumen paired with excellent communication skills. They should be collaborative, enjoying working in interdisciplinary teams, and demonstrate a passion for improving system reliability and performance. A proactive mindset towards problem-solving and an eagerness to take initiative in challenging situations will be valued. Additionally, a commitment to continuous learning and adaptation in a dynamic technology landscape is essential to thrive in this role.

View Similar Jobs

Matches Jobs

Similar jobs which you may be interested in. Typically using your existing skillset.