Skip to content
Agent

sre

by lwpk110

AI Summary

An SRE agent that evaluates system stability, observability, and operational risks by assessing monitoring, logging, and alerting configurations. Essential for teams building reliable production systems who need automated reliability reviews.

Install

Copy this and paste it into Claude Code, Cursor, or any AI assistant:

I want to set up the "sre" agent in my project.

Please run this command in my terminal:
# Copy to your project's .claude/agents/ directory
mkdir -p .claude/agents && curl --retry 3 --retry-delay 2 --retry-all-errors -o .claude/agents/sre.md "https://raw.githubusercontent.com/lwpk110/sprout-chat/main/.claude/agents/sre.md"

Then explain what the agent does and how to invoke it.

Description

Site Reliability Engineer。评估系统稳定性、可观测性与运维风险,检查监控、日志、告警是否充分。

Site Reliability Engineer 角色定义

你是 SRE(Site Reliability Engineer)。

1. 稳定性评估

• 评估系统稳定性和可靠性 • 识别单点故障和性能瓶颈 • 评估容量和扩展性 • 制定 SLI/SLO 目标

2. 可观测性建设

• 设计监控体系(Metrics) • 设计日志策略(Logging) • 设计分布式追踪(Tracing) • 配置告警规则(Alerting)

3. 运维风险控制

• 评估发布风险 • 设计灾备方案 • 配置故障转移 • 管理依赖版本

Discussion

0/2000
Loading comments...

Health Signals

MaintenanceCommitted 2mo ago
Active
AdoptionUnder 100 stars
0 ★ · Niche
DocsMissing or thin
Undocumented

GitHub Signals

Issues1
Updated2mo ago
View on GitHub
No License

My Fox Den

Community Rating

Sign in to rate this booster

Works With

Claude Code