Site Reliability Engineer

extra holidays - extra parental leave - fully flexible
Remote: 
Hybrid
Contract: 
Work from: 
Tel Aviv (IL)

Offer summary

Qualifications:

At least 2 years of hands-on experience as an SRE., Strong Linux system knowledge, including OS, file systems, and networking protocols., Experience working with AWS cloud infrastructure., Proficiency in Python and shell scripting for automation..

Key responsibilities:

  • Automate infrastructure and reduce toil through scripting and tools.
  • Participate in on-call support to resolve production issues promptly.
  • Develop and enhance observability tools like metrics and alerts for environment stability.
  • Collaborate with application teams and stakeholders to improve system reliability and performance.

SafeBreach logo
SafeBreach Computer Hardware & Networking SME https://www.safebreach.com/
51 - 200 Employees
See all jobs

Job description

Description

📍 Locations (Hybrid): Tel Aviv, Israel

🌟 Opportunity Highlights

Safebreach is actively seeking an exceptional Site Reliability Engineer (SRE) to play a pivotal role in our approach to production services and customers.

👋 Who We Are

Combining the Mindset of a CISO and the Toolset of a Hacker, SafeBreach pioneered breach and attack simulation (BAS) and is the most widely used continuous security validation platform. Our platform continuously executes attacks, correlates results to help visualize security gaps, and leverages contextual insights to highlight remediation efforts.

The best thing about SafeBreach? Definitely The People – SafeBreachers are friendly and collaborative, they work hard and dream big. We've built together an amazing culture and we are looking to add more awesome people to our growing team!

🚀 In Technical Terms, You Will

  • Play the role of a hands-on SRE Engineer focused on automation and toil-reduction and participate in Ops cycles to support our product.
  • Perform on-call support function on a rotation basis, providing timely resolution of issues.
  • Easy-to-Use Automation: Continue to grow the infra-automation (Azure, AWS, Jenkins) with a focus on ease of configuration.
  • Environment Stability using Observability: Create and continue to evolve existing Observability (metrics & alerts) and participate in regular monitoring of infrastructure for stability.
  • Collaborative Engagement: Collaborate closely with application owners, Technical Account Managers, Sales engineers team members as part of roadmap execution and continuous improvement of existing systems.


⏳ Interview Process

Average Duration: 21 days

Key steps:

  • Send your application
  • Receive a response from us within 5-7 days


(If selected)

  • Meet the Recruiter (15 minutes)
  • Meet the Hiring Manager (30 minutes)
  • Technical Interview (1-hour)
  • Home Assignment
  • Assignment review call (30 minutes)
  • Meet our team in the office
  • Final steps

Requirements

🫵 Who YOU Are

  • 2+ years of hands- on experience as an SRE.
  • Good Linux foundations, Linux based systems, various system-level topics – OS, file system management, networking protocols & technologies such as SSL/TLS, CDNs
  • Previous experience working with AWS systems is a must.
  • Hands-on programming experience in python + shell scripting.
  • Cloud Infrastructure: Prior experience in deploying workloads and working on either cloud provider (AWS/Azure)
  • Experience with monitoring and alerting tools such as Prometheus, Grafana & CloudWatch.
  • Familiar with managing Jenkins - maintenance, upgrades, deprecation and agents. 
  • Ability to work in a context-switching environment with teams distributed across multiple time zones.
  • Self-driven and eager to learn with a can-do attitude.
  • Well organized and detail oriented, analytical abilities, and problem-solving.
  • Good English written and verbal communication and collaboration skills.


💥 Even BETTER if you have

  • Good networking fundamentals.
  • Experience with Disaster Recovery and Migration.
  • Experience with cost calculators & explorers.
  • Knowledge and experience working with docker and docker-compose.

Required profile

Experience

Industry :
Computer Hardware & Networking
Spoken language(s):
English
Check out the description to know which languages are mandatory.

Other Skills

  • Collaboration
  • Communication
  • Analytical Thinking
  • Detail Oriented
  • Problem Solving

Site Reliability Engineer (SRE) Related jobs