Site Reliability Engineer
Há 20 horas
Claire Joster is currently recruiting for a reference client in car rental services, who aims to strengthen its internal structure with the integration of a
Site Reliability Engineer
(m/f).
Functions:
- Define Reliability: design, implement, and monitor Service Level Indicators (SLIs) and Service Level Objectives (SLOs) for our production services;
- Automation: write code and scripts (e.g., Python, Go, Bash) to automate operational tasks, system provisioning, and incident remediation;
- Incident Response: act as a key responder for production incidents. Participate in a 24/7 on-call rotation, lead troubleshooting efforts, and drive incidents to resolution;
- Blameless Post-mortems: lead and participate in blameless post-incident reviews to identify root causes and implement lasting corrective actions;
- System Architecture: partner with development teams to design, build, and deploy scalable, highly available, and fault-tolerant systems;
- Monitoring & Observability: build and maintain comprehensive monitoring and logging solutions (e.g., Prometheus, Grafana, ELK Stack, Datadog) to proactively detect and diagnose issues;
- Capacity Planning: monitor system performance and usage, forecast demand, and plan for future capacity needs;
- Reduce Toil: identify and eliminate manual, repetitive operational work by building durable, automated solutions
.
Requirements:
- Minimum 5 years of experience in Site Reliability Engineering, software engineering, or large-scale systems administration;
- Strong experience with cloud platforms (AWS, Azure);
- Proficiency with Infrastructure as Code (IaC) tools (e.g., Terraform, Ansible, CloudFormation);
- Hands-on experience with CI/CD tools (e.g., Jenkins, GitLab CI, GitHub Actions);
- Solid understanding of containerization technologies (Docker) and orchestration systems (Kubernetes);
- Experience with version control systems, particularly Git;
- Familiarity with monitoring and logging tools (e.g., Prometheus, Grafana, ELK Stack);
- Strong analytical and troubleshooting skills;
- Experience with on-call rotations and incident management;
- Good communication skills and a proactive attitude.
-
Site Reliability Engineer
2 semanas atrás
Lisboa, Lisboa, Portugal ISPROX Tempo inteiroISPROX is a talent recruiting organization. Our goal is to find and select the best human capital and talent for our clients in order to help them to grow or sustain as a company. ISPROX has presence in several locations in Europe in order to be as much close as possible from our clients.ISPROX is looking for:We are selecting for our client, a multinational...
-
Site Reliability Engineer
Há 21 horas
Lisboa, Lisboa, Portugal IDW Tempo inteiroA IDW é uma empresa Portuguesa, reconhecida pela qualidade dos seus serviços e recursos humanos, focada em apresentar aos seus clientes as melhores soluções de negócio, baseadas em tecnologias de Informação. Na IDW desenhamos e implementamos soluções e serviços em algumas das maiores empresas a operar em Portugal e a nível internacional.Estamos à...
-
Senior Site Reliability Engineer
Há 20 horas
Lisboa, Lisboa, Portugal INSCALE Tempo inteiroWhy Join Us?JYSKis a global retail chain that brings Scandinavian design and quality to the world through an extensive range of quality products for sleeping and living.JYSKis known for its commitment to simplicity, functionality, and affordability. With over 3,200 stores in 48 countries,JYSKis a trusted brand for customers seeking to create comfortable and...
-
Senior Site Reliability Engineer
1 semana atrás
Lisboa, Lisboa, Portugal Arcesium Tempo inteiroArcesium is a global financial technology firm that solves complex data-driven challenges faced by some of the world's most sophisticated financial institutions. We constantly innovate our platform and capabilities to meet tomorrow's challenges, anticipate the risks our clients encounter, and design advanced solutions to help our clients achieve...
-
DevOps / Site Reliability Engineer
Há 21 horas
Lisboa, Lisboa, Portugal PrimeIT Tempo inteiroA PrimeIT é uma empresa líder com mais de 18 anos de experiência na prestação de serviços tecnológicos nas áreas de IT, Telecomunicações e Engenharia.Especializada emTeam Extension,Managed Services,Software à MedidaeNearshore, contamos atualmente com uma equipa de mais de 2350 profissionais a colaborar em projetos nacionais e internacionais,...
-
Lead Site Reliability Engineer
2 semanas atrás
Lisboa, Lisboa, Portugal EPAM Systems Tempo inteiroWe are looking for aLead Site Reliability Engineerto enhance a global execution platform, delivering robust solutions to trading desks and clients.You will collaborate with expert teams, advancing your expertise in system administration, monitoring, and low-latency technologies. Join us to contribute to cutting-edge financial technology innovations.Note that...
-
Azure Site Reliability Engineer
Há 5 dias
Lisboa, Lisboa, Portugal Findmore Consulting, S.A. Tempo inteiro3 days ago Be among the first 25 applicantsIterable is the leading AI-powered customer engagement platform that helps leading brands like Redfin, SeatGeek, Priceline, Calm, and Box create dynamic, individualized experiences at scale.Our platform empowers organizations to activate customer data, design seamless cross-channel interactions, and optimize...
-
Lead Site Reliability Engineer
Há 6 dias
Lisboa, Lisboa, Portugal Arcesium Tempo inteiroCompany OverviewArcesium is a global financial technology firm that solves complex data-driven challenges faced by some of the world's most sophisticated financial institutions. We constantly innovate our platform and capabilities to meet tomorrow's challenges, anticipate the risks our clients encounter, and design advanced solutions to help our clients...
-
Lead Site Reliability Engineer
Há 6 dias
Lisboa, Lisboa, Portugal Arcesium LLC Tempo inteiroCompany OverviewArcesium is a global financial technology firm that solves complex data-driven challenges faced by some of the world's most sophisticated financial institutions. We constantly innovate our platform and capabilities to meet tomorrow's challenges, anticipate the risks our clients encounter, and design advanced solutions to help our clients...
-
Senior Site Reliability Engineer
2 semanas atrás
Lisboa, Lisboa, Portugal EPAM Systems Tempo inteiroWe are seeking aSenior Site Reliability Engineerto support a global execution platform and deliver high-quality solutions to trading desks and clients.You will work closely with top specialists, developing your skills in system management, monitoring, and low-latency technology. Apply now to be part of a team driving innovation in financial technology.Please...