Site Reliability Engineer

2 semanas atrás


Lisboa, Portugal Decskill Tempo inteiro

Radisson Hotel Group is a leading hospitality company serving as a true host and best partner to guests, owners, business partners and talent. Our unique hotel brands offer award-winning and exceptional hotel experiences, originating from our strong Scandinavian heritage of design and innovation. Our brands embody our modern vision of hospitality, including authentic local tastes, stylish living design, unique locations and vibrant social scenes. Radisson Hotel Group brings a refreshed commitment to hospitality leadership to meet the changing travel industry and the bespoke needs of our guests. We provide exceptional service in all of our hotels across the globe and strive to deliver a hospitality experience that is beyond guest expectations. Role purpose : The SRE Manager ensures the reliability, scalability, and performance of Radisson Hotel Group’s digital web and app platforms. To achieve this, the role will: 1. Lead and mentor the SRE team to design, implement, and operate resilient systems. 2. Establish and enforce best practices for monitoring, incident response, automation, and capacity planning. 3. Partner with product, engineering, and infrastructure teams to embed reliability into the software development lifecycle. Resulting in: 1. Highly available and performant digital platforms that enhance guest experience. 2. Reduced downtime and faster incident resolution across services. 3. A culture of reliability, automation, and continuous improvement within the Digital services. Roles/Responsibilities Lead, coach, and grow a team of SREs, fostering a culture of ownership, collaboration, and innovation. Drive automation of operational tasks, deployments, and monitoring to reduce manual effort and human error. Oversee incident management processes, ensuring timely communication, root cause analysis, and postmortems. Collaborate with software engineering, product, and infrastructure teams to design scalable, secure, and reliable systems. Report on system health, reliability metrics, and operational risks to senior leadership. Location: Madrid, Spain Language skills: Fluency in English is a must Must have experience 7+ years of experience in Site Reliability Engineering, DevOps, or Infrastructure roles. 2+ years in leadership/managerial role, leading distributed teams. Proven track record of managing mission-critical, customer-facing digital platforms. Experience with hybrid cloud environments (Azure, AWS, GCP). Strong knowledge of observability tools (Dynatrace, Prometheus, Grafana, Splunk, etc.). Expertise in automation and Infrastructure-as-Code (Terraform, Ansible, Pulumi). Familiarity with CI/CD pipelines, Kubernetes, and microservices architectures. Desirable experience Hospitality, travel, or e-commerce industry background Solid understanding of networking, security, and distributed systems. Expertise in scripting languages (Python, Go, Bash Travel needs Approximately 10% to Madrid and/or Brussels HQ Soft skills : Strong leadership and people management skills Excellent communication and stakeholder management Strategic thinker with hands-on problem-solving ability Ability to thrive in a fast-paced, global, customer-centric environment Education: University Degree in Computer Science, Engineering, or related field Cloud, agile and/or DevOps certifications preferable.


  • Site Reliability Engineer

    3 semanas atrás


    Lisboa, Portugal Paymentology Tempo inteiro

    Join to apply for the Site Reliability Engineer role at Paymentology. Be among the first 25 applicants. Paymentology is the first truly global issuer‑processor, giving banks and fintechs the technology, team and experience to rapidly issue and process Mastercard, Visa and UnionPay cards across more than 60 countries at scale. Our advanced multi‑cloud...

  • Site Reliability Engineer

    3 semanas atrás


    Lisboa, Portugal Ubique Systems Tempo inteiro

    3 days ago Be among the first 25 applicants Direct message the job poster from Ubique Systems This will be a B2B or Frrelance contract role. Location - Lisbon, Portugal. Responsibilities - Strong working knowledge in DevOps tools (CI/CD pipelines in Jenkins), Git, Bitbucket, XLR - Good Linux skills (proper hands-on of using commands and scripting) as...


  • Lisboa, Portugal GrabJobs Tempo inteiro

    A Site Reliability Engineer at Kevel will define reliability targets, solve security issues, automate tasks, and operate production infrastructure.


  • Lisboa, Portugal GoCardless Tempo inteiro

    The Foundations group seeks an engineer interested in infrastructure management and site reliability to build and scale GoCardless's global platform.

  • Site Reliability Engineer

    2 semanas atrás


    Lisboa, Portugal KCS iT Tempo inteiro

    5 days ago Be among the first 25 applicantsWe’re looking for the special, unique and amazing YOU!@ KCS IT, we look for the ones that stands out, for those that always wants to be better and fight for it, and for those who has the same values that we do: dedication, energy, integrity, transparency, flexibility, trust, honesty, hard work, proactivity, team...


  • Lisboa, Portugal beBeeReliability Tempo inteiro

    Site Reliability Expertise for Global Success We are seeking a highly skilled Senior Site Reliability Engineer to join our dynamic Platform Site Reliability Engineering team. This expert will play a crucial role in ensuring the stability, reliability, and availability of mission-critical production applications on our platform. This includes observing system...


  • Lisboa, Portugal Claire Joster Tempo inteiro

    A company in car rental services is seeking a Site Reliability Engineer in Lisbon, Portugal. The role requires at least 5 years of experience, expertise in cloud platforms like AWS and Azure, and proficiency in automation using Python and Go. Responsibilities include defining reliability metrics, incident response, and developing scalable systems. This...


  • Lisboa, Lisboa, Portugal beBeeReliability Tempo inteiro 100 000 US$ - 120 000 US$ por ano

    Site Reliability Expertise for Global Success We are seeking a highly skilled Senior Site Reliability Engineer to join our dynamic Platform Site Reliability Engineering team. This expert will play a crucial role in ensuring the stability, reliability, and availability of mission-critical production applications on our platform. This includes observing...


  • Lisboa, Portugal GrabJobs Tempo inteiro

    Site Reliability Engineer needed to maintain and improve reliability of services, solve security issues, and automate tasks for a remote engineering team.


  • Lisboa, Portugal INSCALE Tempo inteiro

    Why Join Us? JYSK is a global retail chain that brings Scandinavian design and quality to the world through an extensive range of quality products for sleeping and living. JYSK is known for its commitment to simplicity, functionality, and affordability. With over 3,200 stores in 48 countries, JYSK is a trusted brand for customers seeking to create...