- Posted 01 April 2025
- LocationMadrid
- Job type Permanent
- Reference44045
Devops Engineer
Job description
Devops / Site Reliability Engineer
Madrid
Permanent
The Background
We are partnered with a world class IT services provider based in Spain but with a global presence. They are looking for an experienced Devops / Site Reliability Engineer to join their team.
The Background
In this role, you will play a key part in ensuring the seamless delivery, operation, and continuous improvement of our software platforms and infrastructure. Reporting to the Operations Team Lead, you will collaborate with cross-functional teams to drive automation, observability, and infrastructure optimization initiatives. Your mindset should be hands-on, curious, and proactive always looking to reduce toil, increase system resilience, and mentor others as you grow.
Key Responsibilities
- Collaborate with the Operations Team Lead and engineering squads to ensure seamless delivery and reliable support of our software solutions.
- Design, implement, and maintain CI/CD pipelines that are secure, efficient, and traceable.
- Build and manage infrastructure as code using tools such as Terraform and Atlantis.
- Develop and maintain configuration management and automation solutions using Ansible.
- Manage and evolve Kubernetes clusters and containerized environments using Docker.
- Implement and maintain robust monitoring, observability, and alerting systems (e.g., Prometheus, Grafana, Icinga, ELK).
- Ensure high availability, scalability, and security across infrastructure and applications.
- Migrate manual and repetitive tasks into automated, version-controlled, and documented workflows using GitOps practices.
- Write and maintain Standard Operating Procedures (SOPs) and support knowledge sharing across the team.
- Provide on-call support and participate in incident resolution and post-mortem processes.
- Actively mentor junior team members and contribute to onboarding and team development.
- Stay current with emerging technologies, best practices, and continuously bring improvement ideas to the table.
Requirements
- 5+ years of experience in DevOps, SRE, or Infrastructure roles, preferably in environments focused on automation and scalability.
- Strong organizational skills and ability to work independently or collaboratively across teams.
- In-depth experience with:
- Version control systems (e.g., Git)
- Infrastructure as code (e.g., Terraform, Atlantis)
- Configuration management tools (e.g., Ansible)
- Containerization and orchestration (e.g., Docker, Kubernetes)
- Monitoring and observability stacks (e.g., Prometheus, Grafana, Icinga, ELK)
- CI/CD systems and deployment automation
- Strong understanding of Linux environments, network fundamentals, and cloud- native architectures.
- Intermediate to advanced proficiency in English (written and spoken).
- Ability to learn new technologies quickly, solve problems autonomously, and proactively share knowledge.
- A mindset of mentorship and continuous improvement, with a focus on operational excellence.
Nice to Have:
- Experience with OpenStack, Cloudflare, or AWX.
- Experience scripting or coding in Python, Bash, or similar.
- Contributions to internal tools, open-source projects, or tech communities.
- Experience working closely with architecture and security teams to align infrastructure with business goals.