Full Time

Site Reliability Engineer

Interact Software
Remote!
$90,000 - $150,000* / year

Job Description

Job Overview

As a Site Reliability Engineer (SRE) at Interact Software, you will be responsible for maintaining and enhancing the stability, reliability, and performance of our software systems in a cloud environment. You will collaborate with development teams to design and implement scalable infrastructure and automation solutions while ensuring that our services operate at maximum efficiency. This role is pivotal in bridging the gap between software development and IT operations, allowing us to deliver high-quality experiences to our customers.



Job Responsibilities

  • Develop and maintain reliable systems and services in a cloud environment.
  • Implement monitoring and alerting systems to ensure uptime and performance.
  • Collaborate with developers to improve application performance and reliability.
  • Automate repetitive tasks and streamline development processes.
  • Troubleshoot production issues and conduct root cause analysis.
  • Participate in on-call rotations to respond to incidents promptly.
  • Design and manage system backups and disaster recovery plans.
  • Continuously improve system architecture and deployment practices.


Qualifications

  • Bachelor’s degree in Computer Science, Information Technology, or related field.
  • 3+ years of experience in Site Reliability Engineering, DevOps, or a similar role.
  • Proficiency in programming languages such as Python, Go, or Java.
  • Experience with cloud platforms such as AWS, Azure, or Google Cloud.
  • Strong understanding of containerization technologies such as Docker and Kubernetes.
  • Familiarity with monitoring and logging tools like Prometheus, Grafana, and ELK Stack.
  • Solid understanding of networking, load balancing, and database management.
  • Excellent problem-solving skills and a proactive approach to system design.


Benefits

  • Competitive salary with performance-based bonuses.
  • Comprehensive health, dental, and vision insurance plans.
  • Generous annual leave and paid time off policies.
  • Flexible working hours and remote work opportunities.
  • Professional development and continuous learning opportunities.
  • Work-life balance initiatives including wellness programs.
  • Retirement plan with company matching contributions.


Technologies & Tools

In this role, you will utilize a variety of technologies and tools to optimize our systems' reliability and performance. Key technologies include major cloud service providers (AWS, Azure, Google Cloud), container orchestration platforms (Kubernetes, Docker), and CI/CD pipelines. You will also leverage monitoring and observability tools like Prometheus, Grafana, and ELK Stack to ensure that system health is maintained and performance metrics are met.



Ideal Candidates

The ideal candidate for the Site Reliability Engineer position at Interact Software possesses a strong technical background combined with a collaborative spirit. You should be detail-oriented and proactive, capable of anticipating issues before they arise, and designing robust solutions accordingly. Effective communication skills are essential, as the role involves constant collaboration with cross-functional teams. A passion for learning and staying updated on emerging technologies will also make you a valuable asset to our team.

View Similar Jobs

Matches Jobs

Similar jobs which you may be interested in. Typically using your existing skillset.