[go: up one dir, main page]

DEV Community

Site Reliability Engineering

Site Reliability Engineering principles, practices, and culture.

Posts

đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.
S3 Is Starting to Feel Like a File System — But Not Quite

S3 Is Starting to Feel Like a File System — But Not Quite

Comments
2 min read
Why I Built Scenar.io - An AI-Powered DevOps Interview Practice Tool

Why I Built Scenar.io - An AI-Powered DevOps Interview Practice Tool

Comments
4 min read
Why SRE Principles Are the Missing Layer in MCP Security

Why SRE Principles Are the Missing Layer in MCP Security

1
Comments
5 min read
Stop Wondering How Virtual Memory Works!!!

Stop Wondering How Virtual Memory Works!!!

Comments
5 min read
My First dev.to Post — And a 1-Evening SRE System That Changed Our On-Call

My First dev.to Post — And a 1-Evening SRE System That Changed Our On-Call

Comments
2 min read
80% of GitHub Repos Still Use Static AWS Credentials in 2026

80% of GitHub Repos Still Use Static AWS Credentials in 2026

Comments
4 min read
Incident response / On-call: timeouts — operational runbook (playbook thực chiến)

Incident response / On-call: timeouts — operational runbook (playbook thực chiến)

Comments
3 min read
From MVP to Production: Scaling a Speech AI Service

From MVP to Production: Scaling a Speech AI Service

Comments
3 min read
Something I wish someone had told me five years earlier:

Something I wish someone had told me five years earlier:

Comments
2 min read
The Hidden Costs of Real-Time: Latency vs Accuracy Trade-offs

The Hidden Costs of Real-Time: Latency vs Accuracy Trade-offs

Comments
2 min read
A hard-earned rule from incident retrospectives:

A hard-earned rule from incident retrospectives:

1
Comments
2 min read
Beyond Static Limits: Adaptive Concurrency with TCP-Vegas in Go

Beyond Static Limits: Adaptive Concurrency with TCP-Vegas in Go

2
Comments
3 min read
Exponential Back-off with Jitter: Retries

Exponential Back-off with Jitter: Retries

Comments
3 min read
Native Chaos Engineering: Testing Resilience with Fault & Latency Injection

Native Chaos Engineering: Testing Resilience with Fault & Latency Injection

1
Comments
3 min read
The “Token Bleed”: How to Operate LLMs Without Bankrupting Yourself

The “Token Bleed”: How to Operate LLMs Without Bankrupting Yourself

Comments
5 min read
đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.