Site Reliability Engineer

We are looking for an experienced Site Reliability Engineer to join our expanding IT Operations team.

The successful candidate will be part of a large, global multi-national organization working with an array of platforms, software and systems. It is a challenging role within a rapidly expanding company with a great work ethos that values its employees and invests in their development.

We are proud to be an extraordinary equal opportunity employer. Not only do we welcome all candidates, but we also promise the opportunity to learn, grow, develop skills, gain knowledge, practice your craft, expand your expertise, and be part of a leading multinational company.

Responsibilities

  • Work with the Architecture Team and the Development Teams to translate company needs into infrastructure solutions that will suit those needs and requirements in terms of performance, resource usage, scalability, resilience and observability.
  • The proposed solutions may include on-premises, virtualised/bare-metal, cloud or hybrid architectures and must ensure the use of Continuous Integration and Continuous Delivery, Infrastructure as Code and GitOps approaches.
  • Invest time in developing and maintaining pipelines, scripts and playbooks to continuously reduce the human tasks required to operate the production services (toil).
  • Collaborate with the Architecture Team and the Development Teams in projects for moving production services to cloud environments.
  • Provisioning, operational tasks (performance, scaling, organization, routine patching, security…) and decommissioning of Linux and Windows servers.
  • Provisioning, operational tasks (performance, scaling, organization, routine patching, security) and decommissioning of OpenStack and Kubernetes clusters, deployed resources and running VMs.
  • Provide comprehensive handover, top tier technical assistance and documentation to the operating and monitoring teams.
  • Administer the Active Directory including Users & Accounts, Group Policies, Directory Schema, Sites & Services, Domains & Trusts and Scripts.
  • Management of infrastructure services such as email/SMTP, web, DNS, SNMP, DHCP, DNS, DFS, WDS and others.
  • Use Agile practices and DevOps principles to ensure continuous value delivery and alignment with business and team objectives.
  • Participate in shared on-call rotation.

Requirements

  • Experience in using Terraform to apply Infrastructure as Code.
  • Experience in automating configuration management tasks using Ansible playbooks.
  • Experience in centralized management systems (Puppet, Chef, SSCM / System Center, DSC).
  • Experience in writing scripts for automating infrastructure tasks (Python, shell script…).
  • Experience in writing automation pipelines (Jenkins, Bamboo…) is a plus.
  • Experience working with OpenStack platform (COA certification is a plus).
  • Experience with centralized logging management tools (Splunk, ELK, Fluentd).
  • Experience in Docker usage and writing custom Dockerfiles.
  • Experience in Kubernetes administration.
  • Experience in Kubernetes deployment is a plus CKA certification is a plus.
  • Understanding of Continuous Integration and Continuous Deployment tools (Jenkins, Bamboo, ArgoCD, Spinnaker, …) and practices (deployment strategies, micro-service pattern, …).
  • Clued-up on enterprise level virtualisation (VMware, KVM).
  • Experience with third party public and hybrid cloud environments.
  • Experience participating in projects for migrating on premises infrastructures solutions to cloud or hybrid platforms is a plus.
  • Advanced knowledge of internet services and networking (DNS, email – postfix, HAProxy, …).
  • Wide experience with Unix/Linux systems (Redhat/CentOS Linux) in a large-scale operations, distributed Linux production set-up.
  • Wide experience with Windows systems administration (Active Directory, Powershell) and troubleshooting, especially of event log and services.
  • Knowledge of Windows systems administration and investigation, especially of event log and services.
  • Demonstrated ability to troubleshoot systems and network problems.
  • Experience working under Agile frameworks and DevOps principles.
  • Experience working with SAFE or LESS is a plus.
  • Extremely organized with a strong attention to detail.
  • Ability to work well under pressure.
  • Demonstrated ability to manage multiple tasks and competing priorities.
  • Great communication, interpersonal and teamwork skills.
  • Fluent in English.

What we offer

  • Relocation package.
  • Medical insurance for you and your family.
  • Competitive salary and growth opportunities.
  • Free English and Spanish language classes.
  • Life insurance.
  • Fun social perks – from weekly drinks to big seasonal events.
  • Soft drinks and fresh fruit in the office

Join us! We’d love to hear from you.

Job Category: Infrastructure, IT Operations
Job Type: Full Time
Job Location: Remote, Spain

Menu