Three New Side-Channel Attacks Expose LLM Privacy Through Network Metadata

The Attack Vectors

Researchers have identified three distinct side-channel attack classes that compromise LLM privacy despite TLS encryption protecting message content.

Remote Timing Attacks on Efficient Inference

The first attack exploits timing variations introduced by efficiency optimizations like speculative sampling and parallel decoding. By monitoring encrypted network traffic between users and remote LLMs, attackers can:

Classify conversation topics (medical advice vs. coding assistance) with 90%+ precision on open-source systems
Distinguish between specific messages on production systems including OpenAI's ChatGPT and Anthropic's Claude
Extract PII (phone numbers, credit card numbers) through active boosting attacks on open-source implementations

Speculative Decoding Side Channels

The second attack targets speculative decoding mechanisms that generate and verify multiple candidate tokens in parallel. Researchers demonstrated that input-dependent patterns of correct and incorrect speculations leak through:

Per-iteration token counts
Packet size variations
Timing patterns

Testing across four speculative-decoding schemes (REST, LADE, BiLD, EAGLE) using vLLM serving frameworks, the attack achieved:

Over 75% query fingerprinting accuracy at temperature 0.3
REST scheme vulnerability: 100% accuracy at low temperature, 99.6% at temperature 1.0
Confidential datastore content extraction at rates exceeding 25 tokens/second

Whisper Leak: Streaming Response Pattern Analysis

The third attack, dubbed "Whisper Leak," analyzes packet size and timing patterns in streaming responses to classify user prompt topics. Testing across 28 popular LLMs from major providers revealed:

Near-perfect classification performance (often >98% AUPRC)
High precision even at extreme class imbalance ratios (10,000:1 noise-to-target)
100% precision identifying sensitive topics like "money laundering" while recovering 5-20% of target conversations

Industry-Wide Implications

These vulnerabilities affect LLMs deployed across sensitive domains including healthcare, legal services, and confidential communications. The attacks pose particular risks for users under network surveillance by:

Internet Service Providers
Government entities
Local network adversaries

Mitigation Strategies

Researchers evaluated three defensive approaches:

Random padding: Adds noise to packet sizes
Token batching: Aggregates tokens across iterations
Packet injection: Introduces dummy traffic

While each mitigation reduces attack effectiveness, none provides complete protection against all three attack vectors. The research teams have engaged in responsible disclosure with LLM providers to implement initial countermeasures.

Sources

Side-Channel Attacks Against LLMs - Schneier on Security

The Attack Vectors

Researchers have identified three distinct side-channel attack classes that compromise LLM privacy despite TLS encryption protecting message content.

Remote Timing Attacks on Efficient Inference

Classify conversation topics (medical advice vs. coding assistance) with 90%+ precision on open-source systems

Distinguish between specific messages on production systems including OpenAI's ChatGPT and Anthropic's Claude

Extract PII (phone numbers, credit card numbers) through active boosting attacks on open-source implementations

Speculative Decoding Side Channels

Per-iteration token counts

Packet size variations

Timing patterns

Testing across four speculative-decoding schemes (REST, LADE, BiLD, EAGLE) using vLLM serving frameworks, the attack achieved:

Over 75% query fingerprinting accuracy at temperature 0.3

REST scheme vulnerability: 100% accuracy at low temperature, 99.6% at temperature 1.0

Confidential datastore content extraction at rates exceeding 25 tokens/second

Whisper Leak: Streaming Response Pattern Analysis

The third attack, dubbed "Whisper Leak," analyzes packet size and timing patterns in streaming responses to classify user prompt topics. Testing across 28 popular LLMs from major providers revealed:

Near-perfect classification performance (often >98% AUPRC)

High precision even at extreme class imbalance ratios (10,000:1 noise-to-target)

100% precision identifying sensitive topics like "money laundering" while recovering 5-20% of target conversations

Mitigation Strategies

Researchers evaluated three defensive approaches:

Random padding: Adds noise to packet sizes

Token batching: Aggregates tokens across iterations

Packet injection: Introduces dummy traffic