Observability Platform Engineer
WHO WE ARE
Optiver is a tech-driven trading firm and leading global market maker. For over 35 years, Optiver has been improving financial markets...
WHO WE ARE
Optiver is a tech-driven trading firm and leading global market maker. For over 35 years, Optiver has been improving financial markets worldwide, making them more transparent and efficient for all participants. With more than 1,400 employees in offices around the world, we're united in our commitment to improving the market through competitive pricing, execution and thorough risk management. By providing liquidity on multiple exchanges across the world, we actively trade on 70+ exchanges, where we're trusted to always provide accurate buy and sell pricing - no matter the market conditions.
WHAT YOU'LL DO
We are looking for a Senior Observability Platform Engineer to help evolve observability as a business-critical platform capability at Optiver. You will work on the shared platform behind metrics, logs, traces, events, alerts, dashboards, diagnostics, instrumentation and service health.
This is a platform engineering role for someone who enjoys building reliable systems used by other engineers. You will help turn a capable but heterogeneous observability foundation into a globally consistent, regionally federated platform that is reliable at scale, easy to adopt, and deeply embedded in how Optiver builds and operates production systems.
As a Senior Observability Platform Engineer, you will design, build, and operate components that help engineers, operators, trading teams, automated systems, and future agent-based workflows collect, query, understand, and act on production signals. You will work across platform and production domains: building high-scale telemetry pipelines, improving instrumentation quality, creating golden paths for adoption, and making observability more useful during real production investigations.
In this role, you will:
Design, build, and operate components of Optiver's shared observability platform across telemetry collection, ingestion, storage, query, visualisation, alerting, diagnostics, and service health.
Build software, services, APIs, integrations, libraries, dashboards, automation, and reusable patterns that make observability easier to adopt and more reliable to operate.
Improve the scalability, reliability, performance, cost-effectiveness, and operational quality of high-volume telemetry systems.
Improve developer and operator experience through self-service workflows, golden paths, documentation, investigation tooling, and practical platform abstractions.
Work with engineering, infrastructure, trading systems, research, and regional operations teams to understand production debugging needs and improve observability adoption.
Own the reliability and operational quality of the components you build, including service health, failure modes, monitoring, incident learnings, and continuous improvement.
Raise the standard for telemetry quality, instrumentation, alerting, dashboards, diagnostic workflows, and service health across Optiver.
WHAT YOU'LL BRING
You are a strong engineer with experience in production systems, platform engineering, SRE, infrastructure, observability, or distributed systems. You are comfortable working on systems that need to be reliable, scalable, understandable, and useful to other engineers.
You understand that observability is not just a tooling problem. It is about signal quality, platform reliability, developer experience, production workflows, and adoption. You care about building systems that engineers trust, operators can rely on, and production teams can depend on during high-pressure situations.
You will bring:
Strong engineering experience in SRE, software engineering, platform engineering, infrastructure, observability, developer tooling, or distributed systems.
A production mindset, with the ability to reason about failure modes, debugging workflows, service reliability, operational impact, and how systems behave under pressure.
Technical understanding of modern observability practices across logs, metrics, traces, events, alerting, dashboards, telemetry pipelines, diagnostics, instrumentation quality, and service health.
Experience designing, building, or operating reliable services, platforms, pipelines, tools, or automation used by other engineering teams.
Good judgement in technical trade-offs across performance, scalability, reliability, complexity, cost, and maintainability.
A delivery mindset, with the ability to take ambiguous platform problems and turn them into practical, reliable solutions.
Strong preference will be given to candidates with experience on observability, SRE, infrastructure, platform, production engineering, or developer tooling teams in large-scale distributed systems environments, including telemetry pipelines, streaming systems, time-series data, log platforms, query systems, alerting systems, or production diagnostics tooling.
Experience with technologies such as Kafka, Grafana, ELK/OpenSearch, ClickHouse, VictoriaMetrics, InfluxDB, Telegraf, Vector, OpenTelemetry, Prometheus-style systems, or custom telemetry collectors is valued.
WHAT YOU'LL GET
A performance-based bonus structure unmatched anywhere in the industry. We combine our profits across desks, teams and offices into a global profit pool, fostering a truly collaborative environment.
The chance to work alongside diverse and intelligent peers in a rewarding environment.
Training, mentorship and personal development opportunities.
Daily breakfast, lunch and an in-house barista.
Gym membership plus weekly in-house chair massages.
Regular social events, including a company trip every two years.
Guided relocation, a competitive relocation package and visa sponsorship where necessary.
DIVERSITY STATEMENT
Optiver is committed to diversity and inclusion. We encourage applications from candidates of all backgrounds, and welcome requests for reasonable adjustments during the process.
Questions? Get in touch with the recruitment team at [email protected].
Below are some other jobs we think you might be interested in.
-
Senior Platform Engineer - MLOps, Kubernetes & Observability
- Firmus Technologies
- Sydney, NSW, AU
Job DescriptionA tech solutions provider based in Australia is seeking a Senior Platform Engineer to drive the design and implementation of MLOps...16 Jun -
Senior Observability Engineer - Hybrid Data-Driven Platform
- Furō
- Melbourne, VIC, AU
Job DescriptionA dynamic tech startup based in Melbourne is seeking a Senior Observability Engineer for a 6-month contract. This role involves designing...16 Jun -
Hybrid Platform Reliability Engineer - AWS & Observability
- nib Group
- Melbourne, VIC, AU
Job Description nib Group is seeking a Platform Support Engineer in Newcastle to enhance and support cloud-based applications. You will diagnose issues,...23 Jun -
Hybrid Platform Reliability Engineer - AWS & Observability
- nib Group
- City of Sydney, NSW, AU
Job Description nib Group is seeking a Platform Support Engineer in Newcastle to enhance and support cloud-based applications. You will diagnose issues,...28 Jun -
Hybrid Platform Reliability Engineer - AWS & Observability
- nib Group
- Newcastle, NSW, AU
Job Description nib Group is seeking a Platform Support Engineer in Newcastle to enhance and support cloud-based applications. You will diagnose issues,...24 Jun -
Hybrid Platform Reliability Engineer - AWS & Observability
- nib Group
- Sydney, NSW, AU
Job Description nib Group is seeking a Platform Support Engineer in Newcastle to enhance and support cloud-based applications. You will diagnose issues,...23 Jun -
Platform Engineer: Cloud-Native, SRE & Observability
- DBA-Verwaltungs-Gmbh
- City of Sydney, NSW, AU
Job Description ResMed is seeking a Software Engineer to innovate and enhance healthcare technology solutions. This role involves addressing technical...13 Jun -
Platform Engineer: Cloud-Native, SRE & Observability
- DBA-Verwaltungs-Gmbh
- Sydney, NSW, AU
Job Description ResMed is seeking a Software Engineer to innovate and enhance healthcare technology solutions. This role involves addressing technical...19 Jun -
Senior Data Engineer - Cloud Data Platform (Network & Observability)
- TPG Telecom
- Barangaroo NSW 2000,Australia,Australia
Apply Job no: KGC98 Category: Technology, TEC - Service Operations Join a powerhouse of brands that connect customers, businesses and communities. Bring...20 Jun -
Global Observability Leader - Platform & Resilience
- NewsNowGh
- City of Sydney, NSW, AU
Job Description NewsNowGh is seeking a Global Observability Team Lead based in Sydney to lead and enhance enterprise observability platforms. This role...13 Jun -
Global Observability Leader - Platform & Resilience
- NewsNowGh
- Sydney, NSW, AU
Job Description NewsNowGh is seeking a Global Observability Team Lead based in Sydney to lead and enhance enterprise observability platforms. This role...23 Jun -
Senior Observability & AIOps Platform Lead
- Hub24 Management Services Pty Ltd
- Sydney, NSW, AU
Job Description Hub24 Management Services Pty Ltd is seeking a Principal Observability Engineer to ensure the reliability and performance of core...16 Jun -
AWS Cloud Platform Engineer & SRE -- Kubernetes, IaC, Observability
- Leidos Australia
- City of Knox, VIC, AU
Job Description Leidos Australia is looking for a skilled individual to support a Defence program enhancing the Health Knowledge Management System. Key...13 Jun -
AWS Cloud Platform Engineer & SRE -- Kubernetes, IaC, Observability
- Leidos Australia
- Williamstown, SA, AU
Job Description Leidos Australia is looking for a skilled individual to support a Defence program enhancing the Health Knowledge Management System. Key...16 Jun -
Senior Observability Engineer
- Furō
- Melbourne, VIC, AU
Job Description Join Furō as a Senior Observability Engineer! 6 Month initial fixed term contract High-impact transformation role Hands-on delivery...16 Jun -
SRE Observability Engineer
- N2S.Global
- City of Sydney, NSW, AU
Job Description We are seeking a highly skilled SRE Observability Engineer to design, implement, and optimize observability solutions across...22 Jun -
Principal Observability Engineer
- Hub24 Management Services Pty Ltd
- Sydney, NSW, AU
Job Description Job Summary The Principal Observability Engineer is responsible for ensuring the reliability, scalability, and performance of HUB24's...16 Jun -
Principal Observability Engineer
- HUB24 Limited
- Sydney, NSW, AU
Job DescriptionHUB24 leads the wealth industry as the best provider of integrated platform, technology and data solutions. At HUB24, we know the...16 Jun -
SRE Observability Engineer
- N2S.Global
- Sydney, NSW, AU
Job Description We are seeking a highly skilled SRE Observability Engineer to design, implement, and optimize observability solutions across distributed...23 Jun -
SRE Tools Lead: Observability & ITSM Platform
- ING Bank (Australia) Limited
- City of Sydney, NSW, AU
Job Description ING Bank (Australia) Limited is looking for a SRE Tools – Area Lead to oversee their SRE Centre of Expertise. This critical role...17 Jun