#security

Practical software security from an engineer's perspective — secrets handling, threat modelling, least privilege, input validation, prompt injection, sandboxing, and the AI-specific attack surfaces that change the threat model. Each post focuses on how to think about risk before it bites in production: which mitigations actually move the needle, which ones are theatre, and how to design systems so a single bug doesn't become a single point of catastrophic failure.

The coverage spans the boring-but-essential (rotating credentials, locking down server access, sanitising user input) and the AI-era unknowns (prompt-injected agents, untrusted tool outputs, exfiltrating data through innocent-looking model responses). Written for engineers shipping code, not security consultants writing reports — every recommendation is something you can apply in your next pull request.

1 post below, newest first.

Securing AI Agents from Doing Bad Things

Show notes for AI Explained Part 31 — sandboxing, permission scoping, instruction hierarchy, and the metrics that tell you whether your agent is safe to ship.

May 8, 2026

Subjects that frequently appear alongside #security. Click through to see every post on each one.

#security

Securing AI Agents from Doing Bad Things

#ai 1 post

#ai-agents 1 post

#ai-explained 1 post

#llm 1 post

Securing AI Agents from Doing Bad Things

Related topics

#ai 1 post

#ai-agents 1 post

#ai-explained 1 post

#llm 1 post