Lead Site Reliability Engineer
2 semanas atrás
We are looking for aLead Site Reliability Engineerto enhance a global execution platform, delivering robust solutions to trading desks and clients. You will collaborate with expert teams, advancing your expertise in system administration, monitoring, and low-latency technologies. Join us to contribute to cutting-edge financial technology innovations. Note that working on-site at the client's Lisbon office for 2-3 days per week is required. Responsibilities Design and enforce monitoring, alerting, and incident management strategies Automate repetitive tasks and workflows to increase operational efficiency Work alongside software engineering teams to build and launch scalable, dependable systems Execute production deployments carefully to preserve platform stability Handle incident management with thorough analysis and reporting to maintain service quality Engage in on-call duties to support essential systems and services Communicate clearly with colleagues to swiftly resolve technical problems Maintain up-to-date documentation for operational workflows and system settings Drive continuous improvements in system reliability and efficiency through proactive initiatives Requirements Deep understanding of Unix/Linux operating systems and networking with over 5 years experience Proficiency in Unix/Linux shell scripting and programming languages including Python, Perl, C, C++, or Java Experience with monitoring and observability solutions such as ITRS Geneos, Dynatrace, Prometheus, and Grafana Strong troubleshooting skills for complex system issues Experience in environments with high availability and heavy traffic Bachelor's or Master's degree in IT engineering or a related discipline Ability to collaborate effectively within a team and adapt to evolving environments Self-driven with excellent problem-solving capabilities and thorough issue tracking Excellent written and verbal communication abilities with English proficiency at B2+ level Nice to have Familiarity with log analysis tools like Splunk, ELK, Graylog, or Loki Knowledge of network monitoring solutions such as Corvil Experience with relational databases including Oracle, PostgreSQL, MySQL/MariaDB, or KDB/q Understanding of messaging platforms like IBM MQ, Tibco, Solace, LBM, or Kafka Experience with Infrastructure as Code tools such as Ansible or Terraform We offer International projects with top brands Work with global teams of highly skilled, diverse peers Healthcare benefits Employee financial programs Paid time off and sick leave Upskilling, reskilling and certification courses Unlimited access to the LinkedIn Learning library and 22,000+ courses Global career opportunities Volunteer and community involvement opportunities EPAM Employee Groups Award-winning culture recognized by Glassdoor, Newsweek and LinkedIn
-
Lead Site Reliability Engineer
2 minutos atrás
Lisboa, Portugal Hiire Tempo inteiroHiire is helping a global financial technology firm, which recently opened its office in Lisbon, hire a Lead Site Reliability Engineer. The company: We are a global financial technology firm that solves complex data-driven challenges faced by some of the world's most sophisticated financial institutions. We constantly innovate our platform and capabilities...
-
Lead Site Reliability Engineer
Há 5 dias
Lisboa, Lisboa, Portugal Hiire Tempo inteiroHiire is helping a global financial technology firm, which recently opened its office in Lisbon, hire a Lead Site Reliability Engineer. The company: We are a global financial technology firm that solves complex data-driven challenges faced by some of the world's most sophisticated financial institutions. We constantly innovate our platform and...
-
site reliability team lead
Há 3 dias
Lisboa, Portugal beBeeReliability Tempo inteiroSite Reliability Expertise for Global Success We are seeking a highly skilled Senior Site Reliability Engineer to join our dynamic Platform Site Reliability Engineering team. This expert will play a crucial role in ensuring the stability, reliability, and availability of mission-critical production applications on our platform. This includes observing system...
-
site reliability team lead
Há 5 dias
Lisboa, Lisboa, Portugal beBeeReliability Tempo inteiro 100 000 US$ - 120 000 US$ por anoSite Reliability Expertise for Global Success We are seeking a highly skilled Senior Site Reliability Engineer to join our dynamic Platform Site Reliability Engineering team. This expert will play a crucial role in ensuring the stability, reliability, and availability of mission-critical production applications on our platform. This includes observing...
-
Azure Site Reliability Engineer
6 minutos atrás
Lisboa, Portugal act digital Tempo inteiroWe are looking for an Azure Site Reliability Engineer to join a Cloud Operations team focused on digital transformation and cloud optimization. The team works closely with development and infrastructure teams to deliver secure, scalable and highly available cloud platforms. Role Overview As an Azure SRE, you will be responsible for ensuring the operational...
-
Site Reliability Engineer
3 minutos atrás
Lisboa, Portugal Sperton Global AS Tempo inteiroJob Title: Site Reliability Engineer (SRE) Location: Lisbon, Portugal (Hybrid)Job Type: Contract (6 months) Role Overview: We are looking for an experienced Site Reliability Engineer (SRE) to support business-critical systems in the banking and financial services domain. The role has a strong focus on production support, monitoring, automation, CI/CD...
-
Azure Site Reliability Engineer
6 minutos atrás
Lisboa, Portugal act digital Tempo inteiroWe are looking for an Azure Site Reliability Engineer to join a Cloud Operations team focused on digital transformation and cloud optimization. The team works closely with development and infrastructure teams to deliver secure, scalable and highly available cloud platforms. Role Overview As an Azure SRE, you will be responsible for ensuring the operational...
-
Site Reliability Engineer
2 semanas atrás
Lisboa, Portugal Claire Joster Tempo inteiroOverview Claire Joster is recruiting for a reference client in car rental services to strengthen its internal structure with the integration of a Site Reliability Engineer (m/f). Functions - Define Reliability: design, implement, and monitor Service Level Indicators (SLIs) and Service Level Objectives (SLOs) for production services - Automation: write...
-
Lead Site Reliability Engineer
3 semanas atrás
Lisboa, Portugal Arcesium LLC Tempo inteiroArcesium is a global financial technology firm that solves complex data-driven challenges faced by some of the world’s most sophisticated financial institutions. We constantly innovate our platform and capabilities to meet tomorrow’s challenges, anticipate the risks our clients encounter, and design advanced solutions to help our clients achieve...
-
Site Reliability Engineer
3 semanas atrás
Lisboa, Portugal Paymentology Tempo inteiroJoin to apply for the Site Reliability Engineer role at Paymentology. Be among the first 25 applicants. Paymentology is the first truly global issuer‑processor, giving banks and fintechs the technology, team and experience to rapidly issue and process Mastercard, Visa and UnionPay cards across more than 60 countries at scale. Our advanced multi‑cloud...