Core & ML Ops Team Lead - Remote
2 semanas atrás
About Us
At Zyte, we eat data for breakfast and you can eat your breakfast anywhere and work for Zyte. Founded in 2010, we are a globally distributed team of over 250 Zytans working from over 28 countries who are on a mission to enable our customers to extract the data they need to continue to innovate and grow their businesses. We believe that all businesses deserve a smooth pathway to data.
For more than a decade, Zyte has led the way in building powerful, easy-to-use tools to collect, format, and deliver web data, quickly, dependably, and at scale. And today, the data we extract helps thousands of organizations make smarter business decisions, secure competitive advantage, and drive sustainable growth. Today, over 3,000 companies and 1 million developers rely on our tools and services to get the data they need from the web.
Zyte is seeking an experienced Team Lead to manage our Core & MLOps Squad, responsible for "Building the bedrock infrastructure that powers Zyte at scale." This hands-on technical leadership role requires expertise across MLOps, systems programming, and orchestration to lead a cross-functional team in designing and maintaining the scalable foundation that enables all Zyte teams to build and run their services with confidence.
Requirements
What you'll doTechnical Leadership- Design and evolve the core platform (Kubernetes, Mesos, GPU scheduling/autoscaling, distributed compute).
- Own the model platform: registry, experiment tracking, training orchestration, evaluation, serving, and monitoring.
- Build the Golden Path: reference repos, a scaffold CLI, opinionated CI/CD pipelines, runtime contracts (health/metrics/tracing/SLOs), high-performance clients, circuit breakers and other production‑ready defaults.
- Operate a secure, multi‑tenant model registry and training platform with standardized experiment/evaluation harnesses.
- Provide turnkey serving patterns (online + batch), drift/quality monitoring, and rollback playbooks.
- Integrate public/open‑source AI capabilities as managed platform services with cost and data‑governance guardrails.
- Run the squad: roadmap/prioritization, delivery, mentoring, and high engineering standards.
- Partner with product engineering (Zyte API, Scrapy Cloud), Prod Ops, and Security on adoption and rollout plans.
- Mentor the team and foster a platform-thinking mindset.
- Container orchestration (Kubernetes/Knative), GPU provisioning & autoscaling, environment & secret management.
- Operators, sidecars, and internal SDKs/libraries (Go/Rust/Python/Java) that enforce the golden path contract.
- Model platform: registry, experiment tracking, training orchestration, evaluation framework, serving infra, model monitoring.
- Observability: logging/metrics/tracing pipelines;
- Billing pipeline: metering/events/cost tracking abstractions.
- Golden Path: Java, Python, ML templates + CI/CD blueprints + docs + scaffold CLI.
- Reliability enablement (SRE practices), cost governance, supply‑chain security (SBOM, image signing).
- 5+ years experience building distributed systems; 3+ years in MLOps/ML platform engineering (or equivalent impact).
- Knowledge of Linux/OS internals (process model, cgroups/namespaces), networking (TCP/IP, HTTP/2), concurrency, and performance profiling.
- Deep understanding of Kubernetes (bonus: Mesos)
- Proficiency developing high-performance services in Java, Rust, Go or C++ (bonus: familiarity with vert.x and Netty frameworks); strong Python skills.
- Experience with GPU infrastructure (scheduling, containerization, optimization).
- Track record of designing and operating model platforms (registry, training, serving, monitoring) in production.
- Demonstrated success leading technical teams and implementing organization-wide platform solutions.
- Streaming & workflows: Kafka plus Argo/Temporal/Airflow or equivalents.
- eBPF‑based observability, perf tooling, or io_uring experience
- Cost optimization for ML/AI; multi‑tenant quotas and fairness.
- Hands‑on experience authoring Golden Paths (service chassis/templates, CI/CD blueprints, CLI scaffolds).
- SRE practices (SLIs/SLOs, incident management)
Benefits
Benefits:
- We love fostering and nourishing new ideas and bringing them to market
- Become part of a self-motivated, progressive, multi-cultural team.
- Have the freedom and flexibility to work from where you do your best work, as we are a completely remote company.
- Get the chance to work with cutting-edge open-source technologies and tools.
-
DevOps & ML Ops Engineer | Lisbon or Porto
1 semana atrás
Lisboa, Lisboa, Portugal TransPerfect Tempo inteiroJob description DevOps & ML Ops Engineer would be responsible for developing and maintaining scalable, stable services that deliver machine learning models to end users with guaranteed uptime. The primary focus will be on the infrastructure, deployment, and continuous integration/continuous delivery (CI/CD) processes for our ML...
-
Platform Engineering Team Lead
2 semanas atrás
Lisboa, Lisboa, Portugal Zyte Tempo inteiroAbout UsAt Zyte, we eat data for breakfast and you can eat your breakfast anywhere and work for Zyte. Founded in 2010, we are a globally distributed team of over 250 Zytans working from over 28 countries who are on a mission to enable our customers to extract the data they need to continue to innovate and grow their businesses. We believe that all businesses...
-
Data Science Team Lead
Há 7 dias
Lisboa, Lisboa, Portugal Kaizen Gaming Tempo inteiroLet's start with the roleAs the team lead of an AI product team you will lead and guide the team's efforts in delivering high impact AI products. You will be at the core of AI developments, analyzing data, building machine learning models, collaborating with the tech team to create successful AI products. The ideal candidate will combine a deep...
-
Data Science Team Lead
Há 7 dias
Lisboa, Lisboa, Portugal Kaizen Gaming Tempo inteiroWe are Kaizen GamingKaizen Gaming, the team powering Betano, is one of the biggest GameTech companies in the world, operating in 19 markets. We always aim to leverage cutting-edge technology, providing the best experience to our millions of customers who trust us for their entertainment.We are a diverse team of more than 2.700 Kaizeners, from 40+...
-
Lisboa, Lisboa, Portugal Zendesk Tempo inteiroJob DescriptionZendesk's people have one goal in mind: to make Customer Experience better. Our products help more than 125,000 global brands make their billions of customers happy, every day.The AI/ML Platform team is at the forefront of this mission. We build the foundation that powers every AI-driven experience at Zendesk, enabling product teams to build,...
-
NET Team Lead/Architect
1 semana atrás
Lisboa, Lisboa, Portugal Nimber Tempo inteiroHey there, think you stumbled upon this job posting by chance? We dont believe in chance at Nimber.Caught your eye? Great. Keep going...We are Nimber, and we are not just filling positionswere building a team thats ready to shake things up. If youre ready to rewrite the rules and make a real impact, this is your moment. Join us and lets put the future where...
-
Engineering Manager, AI/ML Infrastructure
1 semana atrás
Lisboa, Lisboa, Portugal Zendesk Tempo inteiroJob DescriptionZendesk's people have one goal in mind: to make Customer Experience better. Our products help more than 125,000 global brands make their billions of customers happy, every day.The AI/ML Platform team is at the forefront of this mission. We build the foundation that powers every AI-driven experience at Zendesk, enabling product teams to build,...
-
Technical Lead
Há 2 dias
Lisboa, Lisboa, Portugal Avanade Tempo inteiroSummaryStep into the future of enterprise integration We`re looking for a Technical Lead passionate about microservices, event-driven architecture, and building robust, secure, and scalable integration layers on Azure. This is your chance to be a hands-on leader, shaping critical application architecture and mentoring developers, all while delivering...
-
Senior ML Engineer
Há 2 dias
Lisboa, Lisboa, Portugal Zendesk Tempo inteiroJob DescriptionJob SummaryAt Zendesk, our focus is helping our customers build great relationships with their customers. Founded by three Danish entrepreneurs, Zendesk has experienced remarkable success and growth while maintaining a fun, positive, and down-to-earth culture.We are looking for a Staff Machine Learning Engineer to join our Machine Learning...
-
Core Tech Lead
Há 6 horas
Lisboa, Lisboa, Portugal Wire IT Tempo inteiroBased in Portugal, Wire IT is your specialized IT consulting partner with 18 years of experience driven by an experienced and senior team that helps clients make the right decisions in a fast-moving market.Wire IT's ambition is to grow while keeping true to its nature: agile, people-centered, and fun.As we like to say: Its not only what we do, its how we...