Match score not available

Principal Platform Engineer

extra holidays - extra parental leave
Remote: 
Full Remote
Contract: 
Experience: 
Mid-level (2-5 years)
Work from: 

Offer summary

Qualifications:

7+ years of experience in Linux systems engineering roles supporting bare metal servers and virtualization/container platforms., 3+ years’ Kubernetes administration experience on Red Hat OpenShift., Proven ability to automate processes using scripts, configuration management tools, and CI pipelines., Competency in at least one high-level programming language such as Python or Go..

Key responsabilities:

  • Translate high-level platform design into low-level technical design and implement the Red Hat OpenShift Container Platform.
  • Collaborate with software delivery teams to build and support CI/CD pipelines and self-service mechanisms.
  • Ensure the ongoing stability, availability, performance, and security compliance of the platform.
  • Mentor and develop engineers while participating in an on-call rotation with team members.

DomainTools logo
DomainTools Cybersecurity SME http://www.domaintools.com/
51 - 200 Employees
See all jobs

Job description

Principal Platform Engineers translate high level platform design into low level technical design and are responsible for implementing, administering, supporting, and patching their corresponding platforms. Platform Engineers work closely with Engineering Architects to enable the capabilities defined on roadmaps and blueprints supporting platform programs and initiatives. Platform Engineers are well versed in modern data, infrastructure and integration platforms, industry/technology best practices, and actively work on improving the reliability and scalability of infrastructure.

This position is responsible for the designing, building, supporting, installing, and configuring of the Red Hat OpenShift Container Platform in both Cloud and On-Prem Environments.

  • Installs, configures, and monitors applications and services in the OpenShift cluster.
  • Continually assesses technical components to recommend platform improvements, translating high-level design and RHOS best practices into low-level technical configuration.
  • Ensures the ongoing stability, availability, performance, and security compliance of the platform to meet customer SLAs; authors and executes test cases to validate
  • Collaborates with software delivery teams and architects to build and support self-service mechanisms, CI/CD pipelines, and k8s operators that simplify and accelerate service delivery, in accordance with DevOps and Agile frameworks
  • Maintains the catalog of services for the platform in collaboration with Engineering.
  • Instruments and optimizes application, system, and cluster performance.
  • Forecasts and plans capacity increases to ensure resource availability for engineering teams while meeting budget targets.
  • Helps build and implement Disaster Recovery / Business Continuity plan; conducts related testing of recovery procedures.
  • Helps determine Platform roadmap, manage projects and ticket-based work; ensures these are clearly communicated with stakeholders at all levels.
  • Provides thought leadership on DevOps and Platform Engineering-centric system and process design, giving constructive input to engineers and leaders on proposals and best practices.
  • Builds internal documentation and artifacts describing the mechanisms used for deployment, monitoring, and operators.
  • Leads by showing: mentors and helps develop engineers in a highly demonstrative and collaborative way
  • Participates in an on-call rotation with fellow team members

Location: Remote - US

Compensation: $145,000 - $185,000 + 15% Annual Bonus

Requirements

Requirements:

  • 7+ years of experience in Linux systems engineering roles supporting bare metal servers and virtualization/container platforms
  • 3+ years’ Kubernetes administration experience on Red Hat OpenShift.
  • Experience building and managing infrastructure in both public cloud and physical data center environments using IaC tools
  • 5+ years’ experience with enterprise monitoring and logging solutions like Prometheus, ELK, or similar
  • Proven ability to automate the right things in the simplest way possible (scripts, config management tools, CI pipelines, RHOS Operators, etc.)
  • Solid understanding of networking fundamentals and storage technologies
  • Competency in at least one high level programming language (i.e., Golang, Python, etc.)
  • Ability to communicate well orally and in written form, and publish docs that are easily understood by stakeholders
  • Experience supporting customer-facing SaaS products
  • Willing to dive in and own problems that are new, ambiguous, and/or complex
  • Proven ability to prioritize where improvements are most needed; sees the forest for the trees
  • High team standards around communication, giving and receiving feedback, continuous improvement, and operational excellence

Relevant Technologies:

  • Compute: Rocky Linux, Kubernetes, Red Hat OpenShift Container Platform
  • Storage: ODF, Ceph, Linux LVM and filesystems
  • Languages: Python, Go, or another high-level language
  • Automation Tools: Ansible, Terraform, GitLab, Hashicorp Vault/Consul, ArgoCD
  • Cloud: AWS (EC2, S3, EBS, Glacier)
  • Monitoring/metrics: Prometheus, Icinga/Nagios, Graphite, Grafana, CollectD
  • Logging: Loki, ELK/OpenSearch, Syslog, Journald, Kafka
  • Certifications: RHOS or other Red Hat, Linux/Unix, CCNA

Benefits

DomainTools is the global leader for Internet intelligence and the first place security practitioners go when they need to know. The world's most advanced security teams use our solutions to identify external risks, investigate threats, and proactively protect their organizations in a constantly evolving threat landscape. DomainTools constantly monitors the Internet and brings together the most comprehensive and trusted domain, website and DNS data to provide immediate context and machine-learning driven risk analytics delivered in near real-time.

DomainTools offers a comprehensive benefits package to our employees that includes fully paid medical, dental and vision insurance premiums, a 401k retirement plan with company matching, basic life insurance, flexible PTO and additional well-being benefits.

DomainTools embraces diversity, equity, and inclusion to its fullest as an equal opportunity employer. We build our teams so creativity and innovation can flourish. We believe inclusivity and equity fosters innovation and growth; and we harness this mindset to drive a culture that serves our employees and our customers. We encourage people of all backgrounds, ages, perspectives, and skill sets to apply; and do not discriminate based on age, religion, color, national origin, gender, sexual orientation, gender identity, marital status, veteran status, disability, or any other characteristic protected by law.

Required profile

Experience

Level of experience: Mid-level (2-5 years)
Industry :
Cybersecurity
Spoken language(s):
English
Check out the description to know which languages are mandatory.

Other Skills

  • Teamwork
  • Communication
  • Problem Solving

Platform Engineer Related jobs