Senior Site Reliability Engineer

Remote: 
Full Remote
Contract: 
Work from: 

Offer summary

Qualifications:

Proven experience in designing and maintaining scalable cloud infrastructure., Strong knowledge of Kubernetes management and Infrastructure as Code (IaC) using Terraform., Proficiency in automation scripting with Ansible, Bash, and Python., Experience with CI/CD pipelines and monitoring tools like Prometheus and Grafana..

Key responsibilities:

  • Design and maintain highly available cloud infrastructure for the project.
  • Manage Kubernetes clusters and develop automation scripts to enhance operational efficiency.
  • Lead incident response efforts and conduct root cause analysis for system issues.
  • Mentor junior engineers and guide technical decisions to foster team collaboration.

Intetics logo
Intetics Information Technology & Services SME https://intetics.com/
501 - 1000 Employees
See all jobs

Job description

Intetics Inc., a global technology company providing custom software application development, distributed professional teams, software product quality assessment, and “all-things-digital” solutions, is seeking a highly skilled and experienced Senior Site Reliability Engineer to join our dynamic team.

About the project:
This project offers a unified platform that consolidates vulnerability, threat, and asset data, enabling organizations to effectively prioritize and remediate critical exposures. By integrating information from over 150 security tools, it provides a centralized hub for risk-based vulnerability management, automating workflows and enhancing security outcomes. The platform is designed to scale and streamline vulnerability and exposure management programs, ensuring efficient mitigation of potential threats

The following skills and experience are key to succeeding in this role:

• Design, build, and maintain highly available and scalable cloud infrastructure

• Manage and scale Kubernetes clusters to handle dynamic workloads in a secure
environment

• Develop and deploy Infrastructure as Code (IaC) using Terraform

• Develop automation scripts using Ansible, Bash, and Python to reduce operational toll

• Build and maintain CI/CD pipelines using Bitbucket, GitHub, or GitLab

• Build and monitor system health using tools like Prometheus, Grafana, Loki, or
CloudWatch

• Lead and mentor junior engineers, guide technical decision, and foster a culture of
collaborate and continual improvement

• Facilitate incident response, conduct root cause analysis, and blameless retrospectives

• Experience working with relational databases

Preferred Qualifications:

• DevSecOps or related field experience with hands-on expertise in AWS and GCP

• Familiarity with SingleStore or distributed database systems

• Experience with building or containerizing PHP applications

• Proven leadership experience with a track record of helping drive team success

• Experience working with vulnerability scanning technologies on any part of the tech
stack (e.g. SCA, SAST, DAST, IAST, VM Scanning, Container, etc.)

Required profile

Experience

Industry :
Information Technology & Services
Spoken language(s):
English
Check out the description to know which languages are mandatory.

Other Skills

  • Mentorship
  • Collaboration
  • Leadership

Site Reliability Engineer (SRE) Related jobs