Prevent unsafe prompts
from reaching your LLM

Protect your application from prompt injection, data leakage, and unsafe outputs - before they happen.

For teams building AI-powered applications in production that need enforcement, not just monitoring.

WrapSec is an AI security gateway - multi-layer threat detection, real-time policy enforcement, and a full audit trail. Deploys on your own infrastructure in minutes.

View on GitHub Getting started

Request flow

Your Applicationorigin

inspect -> ALLOW / BLOCK / SANITIZE

WrapSec Gateway

enforced

proxied via HTTPS - inspect output

LLM Providerproxied

ALLOWBLOCKSANITIZE

The problem

Without an enforcement layer

Most AI applications ship features without a dedicated security layer between user input and the model. The consequences are predictable.

Models get manipulated

Without enforcement, a crafted prompt can override your system instructions, change model behaviour, or extract data your application never intended to expose.

Sensitive data leaks

PII entered by users - or retrieved through RAG pipelines - can pass through to LLM providers or appear in responses without detection or redaction.

Outputs cannot be trusted

Without output inspection, your application passes LLM responses directly to users. There is no enforcement point between the model and your customers.

This is not a detection problem - it is an enforcement problem.

~5ms

Detection latency

Rule + ML Tier 1. Transformer optional (~20-50ms), LLM analysis optional.

PII entity types

Enforced on input and output, before reaching the model

Threat categories

Full AI attack surface coverage in one layer

368

Tests passing

Validated across unit, integration, security, and E2E

What it detects

Six threat categories.
One enforcement pipeline.

Each detection method catches what the others miss. Rule-based detection is fast but rigid. ML classification generalises. LLM semantic analysis catches intent that neither rules nor classifiers can codify.

Covers both known attack patterns and emerging threats beyond static rules.

Prompt Injection

Detects attempts to override system instructions through crafted user input - including indirect injections from retrieved documents and tool outputs.

Jailbreak Attempts

Multi-layer detection across rule heuristics, ML classification, and semantic LLM analysis catches novel bypass techniques that single-method approaches miss.

PII Leakage

Identifies and redacts 22 personally identifiable entity types from both input and output - SSNs, card numbers, emails, IBANs, passports, API keys, and more.

Toxic Content

Independent guardrail reading ML toxicity confidence directly - bypassing detection weight dilution so a high-confidence signal always triggers enforcement.

Data Exfiltration

Flags prompts designed to extract training data, internal context, or confidential documents surfaced through RAG pipelines or tool-augmented agents.

Malicious Intent

Catches prompts attempting to weaponise your model for fraud, social engineering, or generating harmful content at scale through your application.

How detection works

Multi-layer pipeline.
Independent guardrails.

WrapSec does not rely on a single detection method. It combines independent layers to ensure no attack passes through a single failure point.

Each layer adds a different protection mechanism. Guardrails always override the detection pipeline, and any detector failure returns an explicit error rather than a silent pass.

Guardrail-first - Guardrails run ahead of the detection pipeline with no bypass path.

Fail-safe design - Any detector failure blocks the request - never a silent pass to the LLM.

Weighted scoring - Risk score reflects agreement across all active detection layers.

Two detection modes - Fast or full mode, selectable per request.

detection pipeline
Input
PII Guard<1ms
Rule Detector~1ms
ML Pipelinetwo-tier
Tier 1 - Base Model~5ms
Tier 2 - Transformeroptional
Toxicity Guardindependent
LLM Detectorfull mode
Policy EngineALLOW / BLOCK / SANITIZE

Deployment modes

Two modes. One decision surface.

Integrate without changing your application architecture. Both modes produce the same structured decision and full audit trail.

Scan-Only- minimal integration, you forward to your LLM

Proxy Mode- full enforcement, WrapSec handles the LLM request end-to-end

POST /v1/ai/request

Scan-Only

Send a prompt to WrapSec, receive a structured decision - ALLOW, BLOCK, or SANITIZE - then forward to your own LLM. Minimal integration, zero change to your provider setup.

Decision + confidence score
Risk score per detector
Sanitized input on SANITIZE
Full trace ID for audit

Recommended

POST /v1/chat/completions

AI Interaction Firewall

OpenAI-compatible drop-in proxy. WrapSec forwards to your LLM provider with your provider key stored encrypted at rest, never returned to your app - enforcing policy on both input and output before your app sees any response.

Input + output enforcement
Provider key never exposed to app
Decision headers on response
Full proxy interaction audit

Why WrapSec instead of provider moderation or DIY filters?

Provider moderation (OpenAI, Anthropic)

-Post-generation - the model sees the prompt first
-No input enforcement, no PII redaction
-No audit trail you own
-Breaks on provider change

DIY filters / Guardrails libraries

-No ML-based detection out of the box
-No output inspection
-No audit logging or policy scoping
-Maintenance burden on your team

WrapSec

+Pre-enforcement - unsafe prompts never reach the model
+Input + output, PII + toxicity + injection
+Full audit trail, SIEM-ready export
+Works across any provider or self-hosted model

For security teams, not just developers

Full visibility. Full control.

Every request is traceable. Every decision is explainable. Designed for environments where compliance, auditability, and data residency are requirements - not afterthoughts.

Audit-ready logging

Every request logged with decision, severity, risk score, trace ID, and threat categories. CSV export endpoint included - structured for SIEM ingestion.

On-premises data control

Your prompts, responses, and audit logs are stored in your own PostgreSQL instance. No data is transmitted to external services. You own and control all audit data.

Department-level policy

Scope detection thresholds and guardrail settings per business unit independently. Finance and support can operate under separate policies.

Role-based access

ADMIN, DEVELOPER, and VIEWER roles with JWT, token versioning, session invalidation, and forced password rotation built in.

Integration

Works with every provider

OpenAI-compatible API. Point WrapSec at your existing integration and every request is protected immediately.

Drop-in compatibility - no changes required to your existing LLM integration.

OpenAI

GPT-4o, GPT-4 Turbo, GPT-3.5

Groq

Llama 3, Mixtral, Gemma

Ollama

Any self-hosted model

OpenAI-compatible

Any OpenAI-compatible API endpoint

Python SDK

Sync + async, full CLI

Node.js SDK

Native fetch, zero dependencies

FastAPI

Middleware + explicit scan

Prometheus

14 security metrics + Grafana

Start in under 5 minutes

Self-hosted. No account required. Clone the repo, run Docker Compose, and your first scan is one API call away.

View on GitHub Getting started guide

Prevent unsafe promptsfrom reaching your LLM