Full Time

Site Reliability Engineer

Dexian Inc
Remote!
$100,000 - $150,000* / year

Job Description

Job Overview

Dexian Inc. is seeking a proactive and skilled Site Reliability Engineer (SRE) to enhance the reliability, availability, and performance of our distributed systems. In this pivotal role, you will collaborate with cross-functional teams to automate and improve operational processes, contributing to the design and tooling that promote stability and efficiency across our platforms. The ideal candidate will leverage a strong understanding of both software engineering and systems administration to solve complex problems and enhance user experiences through reliable and scalable systems.



Job Responsibilities

  • Design, implement, and maintain system reliability and availability monitoring solutions.
  • Develop and deploy automation tools to streamline operations and reduce manual intervention.
  • Collaborate with development teams to ensure that applications are designed with reliability and scalability in mind.
  • Manage service uptime and incident response, overseeing troubleshooting and root cause analysis.
  • Conduct capacity planning and performance tuning of the infrastructure and applications.
  • Participate in on-call rotations and provide support during outages and high-impact incidents.
  • Document system architecture, incidents, and standard operating procedures, ensuring that knowledge is shared across the team.


Qualifications

  • Bachelor’s degree in Computer Science, Engineering, or related field.
  • 3+ years of experience in a Site Reliability Engineering, DevOps, or related role.
  • High proficiency in at least one programming language (e.g., Python, Go, Java).
  • Strong experience with cloud platforms (e.g., AWS, Azure, Google Cloud).
  • Proven knowledge of containerization (Docker, Kubernetes) and orchestration technologies.
  • Familiarity with configuration management tools (e.g., Ansible, Chef, Puppet).
  • Excellent problem-solving and analytical skills with the ability to troubleshoot complex systems.
  • Strong communication skills and the ability to work effectively with teams and stakeholders.


Benefits

  • Competitive salary and performance-based bonuses.
  • Comprehensive health, dental, and vision insurance plans.
  • 401(k) retirement plan with company match.
  • Flexible work hours and remote work options.
  • Professional development opportunities including workshops and certifications.
  • Generous paid time off and holiday policy.
  • Employee wellness programs and support resources.


Technologies & Tools

The Site Reliability Engineer at Dexian Inc. will work with a variety of modern technologies and tools designed to enhance operational efficiency. Expect to engage with cloud services like Amazon Web Services (AWS) or Google Cloud Platform, utilize container orchestration platforms such as Kubernetes, and apply monitoring solutions like Prometheus or Grafana. Additionally, proficiency in scripting and programming languages such as Python or Go, along with configuration management tools like Ansible or Terraform, will be integral to the role.



Ideal Candidates

The ideal candidate for the Site Reliability Engineer role at Dexian Inc. is a detail-oriented individual with a passion for technology and a commitment to excellence. They thrive in collaborative environments, possess a deep understanding of both software and systems, and are driven by the challenge of improving system reliability. Strong communication skills, coupled with an analytical mindset and the ability to adapt to the dynamic nature of tech operations, will further enhance their suitability for this position. An eagerness to learn and share knowledge within the team is also essential.

View Similar Jobs

Matches Jobs

Similar jobs which you may be interested in. Typically using your existing skillset.