Reliability Systems Engineer

Há 7 dias


Lisboa, Lisboa, Portugal Decskill Tempo inteiro

At Decskill, we empower our team to drive innovation and ensure project success through our Talent Acquisition | Decskill | IT Recruitment platform. Our main mission is to deliver value through knowledge and talent, fostering a culture of excellence and investing in the development and well-being of our people. With over 600 dedicated professionals operating across three core areas: DECSKILL TALENT, DECSKILL BOOST, and DECSKILL CONNECT, we collaborate with clients to implement and manage IT infrastructures that generate long-term value.

Job Description

We are looking for a Site Reliability Engineer to join us in Lisbon. As a key member of our team, you will design, implement, and manage observability solutions to monitor the health and performance of our systems and applications. Your responsibilities will include creating and managing dashboards, alerts, and reports to provide actionable insights into system behavior and performance.

Key Responsibilities:
  • Design, implement, and manage observability solutions to monitor the health and performance of our systems and applications;
  • Create and manage dashboards, alerts, and reports to provide actionable insights into system behavior and performance;
  • Utilize preferred tools like Datadog and OpenTelemetry to build comprehensive observability platforms;
  • Troubleshoot and resolve production issues related to observability and monitoring;
  • Develop and maintain infrastructure as code using tools such as Terraform to manage observability across multi-cloud environments;
  • Develop and maintain documentation for observability solutions, including best practices and standards;
  • Collaborate with development and operations teams to ensure seamless integration of observability practices into the CI/CD pipeline;
  • Engage with internal teams to promote observability best practices and ensure consistent adoption across the organization;
  • Continuously evaluate and implement new observability tools and technologies to improve monitoring and alerting capabilities;
  • Provide mentorship and guidance to junior engineers on observability best practices and tools.
Requirements

To be successful in this role, you will need:

  • Bachelor's or Master's degree in Computer Science, Engineering, or related field (or equivalent experience);
  • Minimum of 4 years in modern observability, with a focus on cloud technologies and automation tools;
  • Proven experience in designing and implementing observability solutions using open standards;
  • Proficiency with observability tools such as Datadog and OpenTelemetry;
  • Experience with container orchestration using Kubernetes;
  • Expertise in infrastructure as code, particularly with Terraform;
  • Solid understanding of cloud hypervisors, specifically Azure and GCP;
  • Previous experience in software development is highly valued;
  • Excellent problem-solving skills and ability to troubleshoot complex issues;
  • Strong communication and collaboration skills, with a proven ability to work effectively in a team environment;
  • Ability to learn and adapt quickly to new technologies and methodologies;
  • Experience with other monitoring and observability tools;
  • Knowledge of additional cloud platforms and services;
  • Familiarity with DevOps practices and tools;
  • Understanding of security best practices and how they apply to observability.


  • Lisboa, Lisboa, Portugal Hexa Consulting Tempo inteiro

    We are seeking a System Reliability Engineer to join our team at Hexa Consulting. This individual will be responsible for designing, deploying, and maintaining reliable systems and applications.The ideal candidate will have a strong background in system reliability engineering, with experience in cloud-based infrastructure and containerization.This is a...


  • Lisboa, Lisboa, Portugal GrabJobs Tempo inteiro

    Job Description:Fyld is a Portuguese consulting company that specializes in IT services, bringing high-performance professionals into the field across various technological areas.We are inspired by sports management philosophy and strive to achieve peak performance with each of our consultants. Our focus is on training and excellence.We are looking for our...


  • Lisboa, Lisboa, Portugal Cisco Systems, Inc. Tempo inteiro

    Senior Site Reliability Engineer - Cisco ThousandEyesLocation: Oeiras, PortugalPlease note that we have a hybrid approach to work and would like to find someone who can come into our offices in Lisbon (Lagoas Park) once a week.Who We AreCisco ThousandEyes is a leading Digital Experience Assurance platform that empowers organizations to deliver seamless...


  • Lisboa, Lisboa, Portugal Cisco Systems, Inc. Tempo inteiro

    .Senior Site Reliability Engineer - Cisco ThousandEyesLocation: Oeiras, PortugalPlease note that we have a hybrid approach to work and would like to find someone who can come into our offices in Lisbon (Lagoas Park) once a week.Who We AreCisco ThousandEyes is a leading Digital Experience Assurance platform that empowers organizations to deliver seamless...


  • Lisboa, Lisboa, Portugal Bizay Tempo inteiro

    OverviewBizay is a rapidly growing marketplace present in 24 countries, focusing on innovative marketing solutions for small and medium-sized enterprises. We aim to revolutionize the way businesses develop their marketing strategies, making it easier for them to communicate effectively.Job SummaryWe are seeking an experienced Site Reliability Engineer to...


  • Lisboa, Lisboa, Portugal TN Portugal Tempo inteiro

    Social network you want to login/join with:Systems Reliability Engineer (Full-remote), LisbonClient:NoesisLocation:Lisbon, PortugalJob Category:OtherEU work permit required:YesJob Reference:910708a83947Job Views:2Posted:22.03.2025Expiry Date:06.05.2025Job Description:Noesis is looking for candidates with the following profile:Main Tasks and...


  • Lisboa, Lisboa, Portugal TN Portugal Tempo inteiro

    Social network you want to login/join with:Systems Reliability Engineer (Full-remote), LisbonClient:NoesisLocation:Lisbon, PortugalJob Category:OtherEU work permit required:YesJob Reference:910708a83947Job Views:2Posted:22.03.2025Expiry Date:06.05.2025Job Description:Noesis is looking for candidates with the following profile:Main Tasks and...


  • Lisboa, Lisboa, Portugal Pertemps ERP (part of Network EMEA) Tempo inteiro

    Are you excited by the idea of working within a global organization, helping to power a suite of innovative, cloud-native applications? Do you thrive in environments centered on cloud technology, microservices, and machine learning? If so, we may have the perfect opportunity for you.About the RoleAs a Senior Site Reliability Engineer, you'll be a key part of...


  • Lisboa, Lisboa, Portugal Pertemps ERP (part of Network EMEA) Tempo inteiro

    Are you excited by the idea of working within a global organization, helping to power a suite of innovative, cloud-native applications? Do you thrive in environments centered on cloud technology, microservices, and machine learning? If so, we may have the perfect opportunity for you. About the Role As a Senior Site Reliability Engineer, you'll be a key part...

  • Site Reliability Engineer

    3 semanas atrás


    Lisboa, Lisboa, Portugal Pertemps Erp (Part Of Network Emea) Tempo inteiro

    Are you excited by the idea of working within a global organization, helping to power a suite of innovative, cloud-native applications?Do you thrive in environments centered on cloud technology, microservices, and machine learning?If so, we may have the perfect opportunity for you.About the RoleAs a Senior Site Reliability Engineer, you'll be a key part of a...

  • Site Reliability Engineer

    4 semanas atrás


    Lisboa, Lisboa, Portugal Grabjobs Tempo inteiro

    .Site Reliability Engineer - Get Hired Fast, LisboaClient:Location:Lisboa, PortugalJob Category:OtherJob search engine-$0-0/monthlyEU work permit required:YesJob Views:58Posted:24.01.2025Expiry Date:10.03.2025Job Description:We are hiring a competitive Site Reliability Engineer to join our vibrant team at Dellent in Porto.Growing your career as a Full Time...

  • Site Reliability Engineer

    2 semanas atrás


    Lisboa, Lisboa, Portugal Grabjobs Tempo inteiro

    .Social network you want to login/join with:Site Reliability Engineer - Get Hired Fast, LisboaClient:Location:Lisboa, PortugalJob Category:Other-$0-0/monthlyEU work permit required:YesJob Views:6Posted:16.03.2025Expiry Date:30.04.2025Job Description:We are hiring a competitive Site Reliability Engineer to join our vibrant team at Dellent in Porto.Growing...


  • Lisboa, Lisboa, Portugal Synctiv Recruitment Tempo inteiro

    .Job Title : Principal Site Reliability Engineer (SRE) | Cloud Reliability Engineer Location : Brussels, Belgium (hybrid) Contract : Permanent Sector : Insurance Languages : English proficiency Context : As a Head Site Reliability Engineer in the Reliability Team, y ou will play a crucial role in ensuring the reliability, availability, and performance of...


  • Lisboa, Lisboa, Portugal TN Portugal Tempo inteiro

    About the Role:We are seeking a highly skilled System Reliability Engineer to join our team at TN Portugal. This is an exciting opportunity for a motivated individual to take on a challenging role and contribute to the success of our organization.Key Responsibilities:Lead and onboard services and teams to ensure reliability tenets are implemented and...


  • Lisboa, Lisboa, Portugal Cisco Systems, Inc. Tempo inteiro

    About the Role:Cisco Systems, Inc. is seeking a highly skilled Distinguished Platform Reliability Expert to join our Production Engineering team.The ideal candidate will possess a strong background in SaaS and operations, with expertise in designing and managing large-scale, highly available distributed systems in the cloud. They will collaborate closely...


  • Lisboa, Lisboa, Portugal Noesis Tempo inteiro

    Noesis is looking for candidates with the following profile:Main Tasks and Responsibilities: Lead and onboard services and teams to the reliability tenets; Establish and maintain Service Level Objectives (SLOs) and Service Level Agreements (SLAs); Design and implement scalable, reliable, and secure infrastructure, while ensuring cloud-native best practices;...


  • Lisboa, Lisboa, Portugal Datadog Tempo inteiro

    Role OverviewWe are seeking a skilled Site Reliability Engineer to join our team at Datadog.The successful candidate will be responsible for ensuring the reliability, availability, and performance of our high-volume environments.Collaborating closely with product engineers, you will contribute to the development of infrastructure frameworks that support our...


  • Lisboa, Lisboa, Portugal Synctiv Recruitment Tempo inteiro

    .Job Title : Principal Site Reliability Engineer (SRE) | Cloud Reliability EngineerLocation : Brussels, Belgium (hybrid)Contract : PermanentSector : InsuranceLanguages : English proficiencyContext :As a Head Site Reliability Engineer in the Reliability Team, you will play a crucial role in ensuring the reliability, availability, and performance of advanced...


  • Lisboa, Lisboa, Portugal Match Profiler Tempo inteiro

    Senior Site Reliability Coordinator RoleWe are seeking an experienced Site Reliability Engineer to join our internal team/client. As a key member of our organization, you will play a vital role in ensuring the reliability and performance of our systems.About UsMatch Profiler is an Information Systems consultant with a strong presence in the national and...


  • Lisboa, Lisboa, Portugal Tn Portugal Tempo inteiro

    .Social network you want to login/join with:Senior Site Reliability Engineer (SRE), LisbonClient: TillsterLocation: Lisbon, PortugalJob Category: OtherEU work permit required: YesJob Reference: f9290ec8a979Job Views: 3Posted: 15.03.2025Expiry Date: 29.04.2025Job Description:Sr. Site Reliability Engineer (SRE)Remote but must be located/reside in PortugalThe...