InfoQ Homepage Resilience Content on InfoQ
-
Built to Outlast: Cultivating a Culture of Resilience
Kathleen Vignos explains key strategies for software leaders to navigate uncertainty and build lasting careers.
-
Slack's Migration to a Cellular Architecture
Cooper Bethea explains the journey of converting Slack's monolithic production services to cellular, highlighting the challenges and key success factors.
-
Designing Cloud Applications for Elasticity and Resilience
The panelists explore elasticity and resilience, discussing how architects can design systems that withstand workload variations, user traffic fluctuations, and infrastructure failures.
-
Resilience and Chaos Engineering in a Kubernetes World
The panelists discuss the tools, knowledge, and resources that can help achieve faster incident response and recovery times.
-
Building Organizational Resilience through Documentation and InnerSource Practices
David Grizzanti discusses how communication is more effective through writing, documentation helping drive clarity and alignment across teams, and where InnerSource practices can speed up development.
-
Generative AI and Organizational Resilience
Alex Cruikshank discusses where GenAI is likely to have the greatest impact, steps to manage this change, and ways to leverage the shift to AI mediated work to better understand business processes.
-
Multiplying Engineering Productivity in Face of Constant Change
Shweta Saraf discusses harnessing the collective intelligence of a team to not only multiply productivity, but also cultivate organizational resilience in the face of unceasing changes.
-
How Do We Talk to Each Other? How Surfacing Communication Patterns in Organizations Can Help You Understand and Improve Your Resilience
Nora Jones discusses how communication patterns in organizations can reveal how systems actually work in practice, vs how we think they work in theory.
-
Two Years of Incidents at Six Different Companies: How a Culture of Resilience Can Help You Accomplish Your Goals
Vanessa Huerta Granda looks at real-life examples of companies she has worked with who chose to invest in improving their incident programs and have seen it pay dividends.
-
Resilience Hides in Plain Sight
John Allspaw describes what resilience is, and how it's incredibly hard to recognize it.
-
Orchestrating Resilience: Building Modern Asynchronous Systems
Sai Pragna Etikyala discusses her journey at Twilio, sharing practical examples from their projects, the challenges they faced, and how they overcame them.
-
Comparing Apples and Volkswagens: the Problem with Aggregate Incident Metrics
Courtney Nash presents data from the Verica Open Incident Database (VOID) to demonstrate how aggregate incident metrics (MTTR) aren't representative of systems' resilience.