🐢 Open-Source Evaluation & Testing library for LLM Agents
-
Updated
Jun 19, 2026 - Python
🐢 Open-Source Evaluation & Testing library for LLM Agents
The Python Risk Identification Tool for generative AI (PyRIT) is an open source framework built to empower security professionals and engineers to proactively identify risks in generative AI systems.
AI Red Teaming playground labs to run AI Red Teaming trainings including infrastructure.
Agentic LLM Vulnerability Scanner / AI red teaming kit 🧪
A powerful tool for automated LLM fuzzing. It is designed to help developers and security researchers identify and mitigate potential jailbreaks in their LLM APIs.
An offensive/defense security toolset for discovery, recon and ethical assessment of AI Agents
A comprehensive guide to adversarial testing and security evaluation of AI systems, helping organizations identify vulnerabilities before attackers exploit them.
LLM | Agentic | Security | Operations in one github repo with good links and pictures.
AI Red Teaming Range
AI Security Platform: Defense (61 Rust engines + Micro-Model Swarm) + Offense (39K+ payloads)
AspGoat is an intentionally vulnerable ASP.NET Core application for learning and practicing web application security.
The Python Risk Identification Tool for generative AI (PyRIT) is an open source framework built to empower security professionals and engineers to proactively identify risks in generative AI systems.
An open source plugin for enabeling claude to gain offensive pentesting capabilities
LMAP (large language model mapper) is like NMAP for LLM, is an LLM Vulnerability Scanner and Zero-day Vulnerability Fuzzer.
🤖🛡️🔍🔒🔑 Tiny package designed to support red teams and penetration testers in exploiting large language model AI solutions.
Open-Source autonomous security operations and red teaming agent built to help defenders investigate threats, analyze vulnerabilities, assess indicators of compromise, generate hardening guidance, and execute security research through an auditable agent workflow.
SOC-in-a-Box for AI purple teaming
🛡️ Safe AI Agents through Action Classifier
90-day learning path from ML fundamentals to production AI security systems
AI/LLM Red Team Suite — Automated security testing toolkit for probing language models against prompt injection, jailbreaks, data extraction, and guardrail bypasses
Add a description, image, and links to the ai-red-team topic page so that developers can more easily learn about it.
To associate your repository with the ai-red-team topic, visit your repo's landing page and select "manage topics."