Tag

#adversarial-ml

9 posts tagged adversarial-ml.

Attack Techniques

LLM Jailbreak Techniques Explained: Eight Attack Patterns and What Defenders Do About Them

A technical breakdown of the eight most-used LLM jailbreak techniques — persona hijacking, many-shot flooding, adversarial suffixes, indirect injection
June 18, 2026
adversarial-ml

Evasion Attacks on Production Classifiers: Malware, Spam, and Fraud

Deployed ML classifiers in malware, spam, and fraud detection face evasion attacks where the attacker has a clear payoff.
May 22, 2026
adversarial-ml

Web-Scale Dataset Poisoning: Split-View and Frontrunning

Web-scale dataset poisoning is practical: split-view and frontrunning attacks corrupt training data via mutable URLs and timed edits, plus defenses.
May 22, 2026
adversarial-ml

Adversarial Examples Against Vision Models in 2025

Where physical-world adversarial patches and digital attacks stand against modern vision models — what still works, what's been hardened, and where the
May 9, 2026
attack-patterns

Jailbreaking Multimodal Models: Visual Prompt Injection Attacks

How attackers use images, typography, and adversarial visual inputs to bypass safety guardrails in GPT-4V, Claude, and Gemini — and why multimodal inputs
May 9, 2026
adversarial-ml

Model Extraction via Black-Box Query Attacks

How attackers reconstruct private model weights and decision boundaries through query-only access — the techniques, the economics, and what extracted
May 9, 2026
adversarial-ml

Model Inversion and Membership Inference: Extracting LLM Data

How membership inference attacks determine whether specific data was used to train a model, and how model inversion techniques reconstruct private
May 9, 2026
attack-patterns

Supply Chain Attacks on AI Models: Poisoning and Backdoors

How attackers compromise AI models before they reach production — through malicious fine-tuning, dataset poisoning, serialization exploits, and the unique
May 9, 2026
adversarial-ml

Training Data Poisoning and Backdoor Attacks on LLMs

A technical deep-dive into how adversaries manipulate training datasets and introduce hidden backdoors into LLMs — covering poisoning mechanics, stealthy
May 9, 2026