cotalks.dev
Login
SREcon25 Americas
Videos
1 — SREcon25 Americas - Safe Evaluation and Rollout of AI Models
2 — SREcon25 Americas - An SRE Approach to Monitoring ML in Production
3 — SREcon25 Americas - Using Statistical Techniques to Automatically Detect Game-Breaking Issues
4 — SREcon25 Americas - Live, Laugh, Log
5 — SREcon25 Americas - Mapping a Better Future with STPA
6 — SREcon25 Americas - Improving the SRE Experience for 10 Years as a Free, Open, and Automated...
7 — SREcon25 Americas - Transformers in SRE Land: Evolving to Manage AI Infrastructure
8 — SREcon25 Americas - Case Study: A Thundering Herd in the Wild
9 — SREcon25 Americas - Techniques Netflix Uses to Weather Significant Demand Shifts
10 — SREcon25 Americas - Distributed Tracing in Action: Our Journey with OpenTelemetry
11 — SREcon25 Americas - The Search for Speed
12 — SREcon25 Americas - Is the S in SRE for “Security”?
13 — SREcon25 Americas - Tackling Slow Queries: A Practical Approach to Prevention and Correction
14 — SREcon25 Americas - Lies Programmers Believe about Memory
15 — SREcon25 Americas - “On-Call Is Ruining My Life” and Other Tales about Holding the Pager as an SRE
16 — SREcon25 Americas - Maturing Your Data Architecture in a Week: How Bluesky Survived
17 — SREcon25 Americas - The Perverse Incentives of Reliability
18 — SREcon25 Americas - Inclusive SRE: Best Practices for Working with a Visually Impaired Incident...
19 — SREcon25 Americas - Learning from Incidents at Scale; Actually Doing Cross-Incident Analysis
20 — SREcon25 Americas - SRE & Complexification: Where Verbs and Nouns Do Battle
21 — SREcon25 Americas - Running DRP Tabletop Exercises
22 — SREcon25 Americas - Optimizing Machine Learning Training Infrastructure: A Governance Approach
23 — SREcon25 Americas - Beyond Sequential: A Recipe for Async Pipeline Observability and Alerting
24 — SREcon25 Americas - Handling the Largest Domains Migration, Ever!
25 — SREcon25 Americas - Chaos Experiments - Datacenter Stress Testing
26 — SREcon25 Americas - Please Give Me Back My Network Cables! On Networking Limits in AWS
27 — SREcon25 Americas - Fully Automated HW SKU Selection System to Optimize Apache Pinot’s Cost-to...
28 — SREcon25 Americas - No Time to Do It All! Approaching Overload on DevOps Teams
29 — SREcon25 Americas - Taming the Beast: Understanding and Harnessing the Power of HTTP Proxies
30 — SREcon25 Americas - Measuring Availability the Player Focused Way: How Riot Games Changed Its...
31 — SREcon25 Americas - OpenTelemetry Semantic Conventions and How to Avoid Broken Observability
32 — SREcon25 Americas - Lightning Talks
33 — SREcon25 Americas - Stopping Performance Regression via Changepoint Detection
34 — SREcon25 Americas - Incident Management Metrics That Matter
35 — SREcon25 Americas - Per Aspera ad Productum: Turning Processes into Products
36 — SREcon25 Americas - Production Engineering When Trading Billions of Dollars a Day
37 — SREcon25 Americas - Systems Thinking with Poisoned Systems
38 — SREcon25 Americas - Securing Distributed Cache: Achieving Secure-by-Default with Key Challenges &...
39 — SREcon25 Americas - Cattle vs. Pets - A Cost-Effective Elasticsearch Architecture to Scale-Out...
40 — SREcon25 Americas - Going Multi Cloud in a Hurry with Quality and Style
41 — SREcon25 Americas - Mitigating Against Large Scale Systemic Failures in E-Trading
42 — SREcon25 Americas - Network Flow Data in the Cloud
43 — SREcon25 Americas - Technical Debt as Theory Building and Practice
44 — SREcon25 Americas - OLTP SQL Database Query Tracing and Linting
45 — SREcon25 Americas - From HAR to OpenTelemetry Trace: Redefining Browser Observability
46 — SREcon25 Americas - Hijacking Service Discovery to Simulate Dependency Degradation
47 — SREcon25 Americas - “How’s the App Doing?” Bringing Mobile Into Your Reliability Picture
48 — SREcon25 Americas - AIOps: Prove It! An Open Letter to Vendors Selling AI for SREs