The role

As a Site Reliability Engineer at DEXMA you will work along with your peers, but also with the Software Developers (Java/JavaScript) & Data Scientists (Python/R), applying Continous Delivery practices, monitoring the metrics & logs to anticipate possible failures, and help the team adopting Kubernetes

You will be part of a small team that has plenty of autonomy deciding the roadmap and the technologies that will make the difference, who loves automating every repetitive process and helps the whole organization to be efficient, using Lean principles and Continuous Delivery practices.

As part of the team your job will consist of:

  • Manage our Google Cloud infrastructure using Infrastructure as Code
  • Increase the adoption of Kubernetes abroad our infrastructure
  • Coordinate with Software Developers to build and scale micro-services
  • Manage and monitor the service availability using DataDog and PagerDuty
  • Continue improving our CI/CD pipelines to make developers’ work more and more efficient
  • Identify, diagnose and propose improvements or solutions to current (or future) problems
  • Perform rotative 24x7 (remunerated) duties to guarantee service availability (only system-triggered alerts, no on-call support)

About you

  • You are a Site Reliability / DevOps / Systems Engineer with at least 3 years of experience managing cloud environments (Google Cloud or AWS)
  • You like to have autonomy and give a try to new technologies.
  • You believe in Infrastructure as Code
  • You ensure the quality of your work by creating automated tests and by submitting it to Code Reviews
  • You have experience in Continuous Delivery Pipelines (ideally with Spinnaker)
  • You have strong skills in at least one scripting language (bash, perl, python, go, etc.)

Technology stack:

The stack is mainly based on open source technologies with the support of commercial tools and services that make a difference:

  • Cloud Provider: Google Cloud Platform
  • Cloud-native tools: Cloud Functions, Pub/Sub, Storage, DataFlow, ...
  • Cloud Infrastructure Tools: Docker, Kubernetes, Helm, Puppet, Terraform, Packer, Consul, Vault
  • DBs/Middleware: PostgreSQL, MongoDB, Redis, RabbitMQ
  • CI/CD: Jenkins, Artifactory, Spinnaker, SonarQube
  • Monitoring: DataDog, PagerDuty
  • Team Management: JIRA, Confluence, BitBucket, Slack, Google Suite

The offering:

Join a friendly, humble and talented group with 10+ different nationalities, with offices in the center of Barcelona. An ideal place to grow and evolve your career to the next level, with a salary and career path according to your skills and experience. Also we offer:

  • MacBook Pro + large monitor(s)
  • Flexible working hours
  • Remote-friendly [that will continue after the pandemic]
  • Personal budget for training (courses, conference passes, technical books, etc.)
  • Ticket Restaurant / Transport / Kindergarten
  • Andjoy (aka gym4less)
  • Health Insurance (discounted price)
  • Company hackathons

and when it's possible again...:

  • Monthly all-hands lunch
  • Team events (BBQ, football, outdoor activities...)
  • Free office goodies: coffee, fresh fruit, snacks, …

Given the current pandemic situation, all the company has been working 100% remote for almost a year. We are accepting permanent remote applications for those living no farther than ~1.000 km in well-communicated areas. When COVID ends, you will be asked to come once a month to the office to meet your colleagues.