SRE Resilience
Há 5 dias
Decskill was founded in 2014 as an IT Consulting Company and their main mission is to delivery value through the knowledge. We enable companies to meet the chalenges of digital world by providing our clients with business models that ensure technological capacity, flexibility and agility. We are more than 500 consultants with offices in Lisbon, Porto and Madrid.
DECSKILL operates in 3 main areas:
DECSKILL TALENT, through which we provide our clients with an extension to their IT teams;
DECSKILL BOOST, through which we provide our client with software development models to increase capacity and optimize Time-to-Market, where we create and manage teams that deliver according to their needs, at the desired speed;
DECSKILL CONNECT through which we provide our client with consulting services, as well as the implementation and management of information technology infrastructures.
Our practice results in the creation of value for our customers, either by delivering qualified and value-added services, or through highly qualified and motivated professionals, as well as technology solutions that allow us to operate and transform the business of our customers.
We are looking for a SRE for a new project
Mission
: An SRE focused on Resilience. Someone who can look at a complex system of services, products, applications, and contents that work together for a full E2E customer experience in a telco company and identify areas for improvement to make it more solid, stable, reliable.
Closely related to the Googles definition of an SRE however here almost exclusively focused on resilience itself. This can be before, during or after code has been written for that product.
Responsabilities:
- Define/create/implement standards and drive implementation of resilient design
- Understand what happens if a downstream service fails. How is our upstream response handled? What is the customer experience (impact)?
- Define/create/implement fallback mechanisms/circuit breakers, understand if its appropriate to create one at all. Define/create logic for aforementioned circuit breakers (experience shows todays implementations may have a negative impact)
- How do we tackle E2E resilience on a customer journey?
- Define/create/implement timeouts settings E2E (these have caused negative outcomes in the past)
- Participate in complex operational issues E2E, identifying root causes and architectural solutions (or other improvements) to avoid re-occurrence
- Work closely with architecture team and Tech Leads in early stages of SDLC.
Requirements:
- An environment where services can be built in mobile, web, integration or backend technologies, Google Cloud based and Apigee exposure. Some of the technologies involved are: Angular | Strapi CMS | Squid Proxy | PingFederate | Kotlin and Swift | Apigee | GCP.
- Availability to travel is important, the project requires trips to the UK (once every two months).
- Ability to adapt to different contexts, teams and Clients.
- Teamwork skills but also sense of autonomy.
- Motivation for international projects and ok if travel is included.
- Willingness to collaborate with other players.
- Strong communication skills.
If you're interested in this job please send your CV to with reference "CA/SRE".
Thank you :)
Decskill is committed to equality and non-discrimination with all our talents. We recruit and promote talent, based on diversity and inclusion, regardless of age, gender, ethnicity, race, nationality or any other form of discrimination incompatible with the dignity of the human being.
-
Site Reliability Engineer
2 semanas atrás
Lisboa, Lisboa, Portugal MOZAYDO Tempo inteiroJob Title: Site Reliability Engineer (SRE)Location: Lisbon, PortugalWork model: Full-time, Hybrid (3x office per week)About MozaydoMozaydo was built by people who believe work should feel human - even when powered by technology.We're a remote-first company that connects talent, technology, and purpose to help companies grow sustainably.Here, ownership...
-
Senior Site Reliability Engineer
2 semanas atrás
Lisboa, Lisboa, Portugal QiBit Tempo inteiroWe are looking for aSenior Site Reliability Engineer (SRE)to join the IT team of our client – a company specialized in the financial technology sector.What will be your main tasks and responsibilities?Act as the primary contact and leader for platform incidents, ensuring swift resolution through collaboration with engineering teams and effective...
-
Senior Site Reliability Engineer
Há 4 dias
Lisboa, Lisboa, Portugal Arcesium Tempo inteiroArcesium is a global financial technology firm that solves complex data-driven challenges faced by some of the world's most sophisticated financial institutions. We constantly innovate our platform and capabilities to meet tomorrow's challenges, anticipate the risks our clients encounter, and design advanced solutions to help our clients achieve...
-
Lead Site Reliability Engineer
2 semanas atrás
Lisboa, Lisboa, Portugal Arcesium Tempo inteiroCompany OverviewArcesium is a global financial technology firm that solves complex data-driven challenges faced by some of the world's most sophisticated financial institutions. We constantly innovate our platform and capabilities to meet tomorrow's challenges, anticipate the risks our clients encounter, and design advanced solutions to help our clients...
-
Lead Site Reliability Engineer
Há 2 dias
Lisboa, Lisboa, Portugal Arcesium Tempo inteiroCompany OverviewArcesium is a global financial technology firm that solves complex data-driven challenges faced by some of the world's most sophisticated financial institutions. We constantly innovate our platform and capabilities to meet tomorrow's challenges, anticipate the risks our clients encounter, and design advanced solutions to help our clients...
-
Lead Site Reliability Engineer
Há 2 dias
Lisboa, Lisboa, Portugal Arcesium LLC Tempo inteiroCompany OverviewArcesium is a global financial technology firm that solves complex data-driven challenges faced by some of the world's most sophisticated financial institutions. We constantly innovate our platform and capabilities to meet tomorrow's challenges, anticipate the risks our clients encounter, and design advanced solutions to help our clients...
-
Azure Reliability Engineer
1 semana atrás
Lisboa, Lisboa, Portugal Rumos SA Tempo inteiroWORKPLACE: Lisbon (Hybrid)RESPONSIBILITIES:Implement and maintain highly available infrastructures in Microsoft Azure, with a focus on resilience, scalability, and security;Ensure continuous observability and monitoring of systems using Azure Monitor, Log Analytics, Application Insights, and proactive alerting;Automate operational tasks and...
-
Technical Product Manager
2 semanas atrás
Lisboa, Lisboa, Portugal Mollie Tempo inteiroBuild with usBusinesses deserve better from finance. Less friction, more freedom. Since 2004, Mollie has been on a mission to make payments and money management effortless for every business in Europe.Today, more than 250,000 companies trust our all-in-one platform to get paid, manage money and grow on their terms. Simple, scalable and built with real...
-
Technical Product Manager
2 semanas atrás
Lisboa, Lisboa, Portugal Mollie Tempo inteiroBuild with usBusinesses deserve better from finance. Less friction, more freedom. Since 2004, Mollie has been on a mission to make payments and money management effortless for every business in Europe.Today, more than 250,000 companies trust our all-in-one platform to get paid, manage money and grow on their terms. Simple, scalable and built with real...
-
Program Manager IT Stability
2 semanas atrás
Lisboa, Lisboa, Portugal Welvaart Tempo inteiroAbout WelvaartOn a daily basis, we assume commitments and present solutions to our stakeholders in order to create a structure of human values, based on professionalism, honesty and rigor.With a management based on Human Centered Design, we take care of our professionals with consistent career plans, but flexible with their needs and expectations of...