Full Time

Sr. Site Reliability Engineer

Veza
Remote!
$120,000 - $160,000* / year

Job Description

Overview

Veza is seeking a highly skilled and experienced Senior Site Reliability Engineer to join our dynamic engineering team. The ideal candidate will play a critical role in enhancing and maintaining system reliability and performance. This position requires a deep understanding of cloud infrastructure, tooling, and practices that support scalability, security, and uptime. As a Senior Site Reliability Engineer, you will collaborate closely with development and operational teams to implement best practices and innovative solutions that drive a reliable environment.



Job Responsibilities

  • Design and implement automated systems for monitoring, logging, and performance analysis.
  • Ensure high availability and reliability of production systems through proactive issue detection and resolution.
  • Develop and maintain scalable infrastructure using modern cloud technologies.
  • Collaborate with development teams to build release and deployment practices that enhance system reliability.
  • Participate in on-call support rotation to address system incidents and ensure swift recovery.
  • Continuously evaluate and optimize existing systems and workflows for performance and cost-effectiveness.
  • Provide mentorship and guidance to junior engineers and team members, fostering a culture of learning and improvement.


Qualifications

  • Bachelor's degree in Computer Science, Engineering, or a related field; Master's degree is a plus.
  • 5+ years of experience as a Site Reliability Engineer or similar role within a production environment.
  • Proficiency in programming languages such as Python, Go, or Java.
  • Strong experience with configuration management tools (e.g., Terraform, Ansible).
  • In-depth knowledge of cloud platforms (AWS, Azure, or Google Cloud) and container orchestration technologies (e.g., Kubernetes).
  • Excellent analytical skills with a focus on problem-solving and debugging complex systems.
  • Strong understanding of networking, security, and performance best practices.


Benefits

  • Competitive salary and performance-based bonuses.
  • Comprehensive health, dental, and vision insurance plans.
  • 401(k) retirement plan with company matching.
  • Flexible work hours and the option for remote work.
  • Generous paid time off and holiday leave.
  • Opportunities for professional development and continuing education.
  • Company-sponsored events and team-building activities.


Technologies & Tools

The Senior Site Reliability Engineer will work with various technologies and tools essential for maintaining system reliability. Key technologies include cloud computing platforms (AWS, Azure, Google Cloud), containerization and orchestration tools (Docker, Kubernetes), configuration management and automation tools (Terraform, Ansible), as well as monitoring and logging tools (Prometheus, Grafana, ELK Stack). Familiarity with CI/CD pipelines and version control systems (Git) will also be critical.



Ideal Candidates

The ideal candidate for the Senior Site Reliability Engineer position at Veza will possess strong technical acumen, exceptional problem-solving abilities, and a collaborative spirit. They should be proactive in driving results and possess a strong desire to improve processes and systems continually. A successful candidate will communicate effectively, adapting their messaging for various stakeholders within the organization, and be willing to take the initiative on complex projects. Ultimately, they should embody a commitment to delivering high-quality, reliable systems and services.

View Similar Jobs

Matches Jobs

Similar jobs which you may be interested in. Typically using your existing skillset.