Rate limiting
Rate limiting is a counter per key per window: pick the key (user or IP), pick the window, and always tell the caller when to retry with Retry-After.
What this skill helps you build
Cap how often a caller can hit an endpoint — count requests per key per time window and return 429 + Retry-After when the limit is exceeded.
The production takeaway
Rate limiting is a counter per key per window: pick the key (user or IP), pick the window, and always tell the caller when to retry with Retry-After.
Inside this skill
The full skill expands these implementation areas with decisions, edge cases, prompts, tests, and framework-specific code.
Unlock the full implementation
Paid access includes the complete skill body, implementation prompt, common mistakes, production checklist, and code examples where this skill includes them.