“Patterns Led Observability” offers out-of-the-box patterns that can be adopted across all layers of the Observability Value Pyramid, which allows our internal customer development teams ease of function to consume and get moving quickly. Standardizing development practices with leveraging AI and automation technologies, whilst maintaining careful control over costs. This session will delve into the unique observability challenges and opportunities faced at JPMorgan Chase - with upwards of 40k internal developers ready and eager to consume our tooling. Leveraging standardized practices to streamline the adoption of strategic tooling, ensuring consistent and
efficient observability across complex systems, leveraging proven cost effective methods for integration that scales to volumes in the multi-TB-per-day range.
Patterns Led Observability in Financial Services: Understanding the specific needs and constraints of the financial sector and which observability patterns can address them.
Strategic Tooling and Standardization: How standardized practices facilitate the seamless integration of observability tools, driving efficiency and reducing operational complexity.
Leveraging AI and Automation: Exploring the role of AI and automation in enhancing observability, from predictive analytics to automated incident response.
Cost Control Strategies: Techniques for balancing advanced observability capabilities with budgetary constraints, ensuring cost-effective implementation and operation.
Session Available For Sponsorship
As observability data volumes explode, so do the costs - leaving many organizations struggling to balance visibility with financial sustainability. This panel explores how engineering, platform, and finance leaders are working together to control observability spend without sacrificing critical insights. We’ll discuss strategies for smarter data collection, tooling consolidation, and aligning observability investments with business value.
Session Available For Sponsorship
In this session, Kristina will show how Electrolux built an internal developer platform (IDP) that integrates observability and runbook knowledge into every workflow. By shifting incident resolution upstream, developers own more of the troubleshooting, and SREs get time back for high-impact projects. Specifically, the session will cover:
Session Available For Sponsorship
GroupOn operates as a much leaner business today as a public company than in its days as a unicorn startup. As the company has scaled down, the engineering team faces the challenge of maintaining and reengineering a system built without cost constraints to be more cost conscious and sustainable. This session explores how GroupOn is tackling legacy complexity, consolidating tooling, and finding the right balance between buying and building to right-size for the future.
With the scale and complexity of modern systems, human-driven observability alone is no longer enough. This session explores how AI and machine learning are transforming observability - enabling proactive anomaly detection, intelligent alerting, and faster root cause analysis. Learn how enterprises are integrating AI/ML into their observability stacks to surface signals earlier, reduce noise, and accelerate response times.
Culture is as critical to reliability as any tool or dashboard. This session explores how leading organisations are building SRE cultures rooted in real-time observability - where teams are empowered to detect, respond, and learn from incidents proactively. We’ll look at how observability is shaping everything from on-call practices and team structures to incident reviews and cross-functional collaboration.
Modern Site Reliability Engineers (SREs) are drowning in alert fatigue, repetitive toil, and late night pages. But what if you could flip the script using AI? In this talk, we explore how a solo engineer empowered their SRE practice by building an AI-driven ChatOps bot and anomaly detector - automating the triage of alerts, generating root-cause hints, and triggering runbooks autonomously. What started as a survival tactic evolved into a transformative shift: from reactive firefighting to proactive reliability engineering. Alongside this, the speaker delves into how solo side-projects using code-generation agents and no-code orchestrators helped fast-track innovation,
proving that individual contributors can ship impactful tools without a team. The session rounds out with an honest discussion of what it takes to bring these innovations into a larger engineering culture—navigating the dynamics of engineer-led experiments, middle-management resistance, and the need for CTO-level sponsorship.