AI inside your perimeter. Under your controls.

Private LLMs in your VPC or on-prem. PII redaction, RBAC + SSO, mTLS, audit logs, and the operational guardrails your security team needs to say yes — first review, not the third.

Book a discovery call How it deploys

Production AI, private by design

AI inside your perimeter, under your controls.

For teams that can't ship sensitive data to public APIs. We deploy private LLMs into your AWS, GCP, or Azure account, or onto on-prem hardware. Inference happens inside your perimeter; data never crosses it.

The deployment stack — PII redaction, RBAC, SSO/SAML, audit logs, encryption — is wired to the identity and observability tools your security team already runs. We hand you a compliance dossier, not a "trust us" checklist.

Llama, Mistral, GPT-4-via-Azure, or your private model — your choice
Inference in your VPC or on-prem hardware
PII redaction + DLP rules at the prompt boundary
SAML / OIDC SSO, RBAC, per-tenant isolation
mTLS service-to-service; encryption at rest
Audit logs, SOC 2 / HIPAA / GDPR-ready dossier templates

Book a discovery call

security.console · prod-vpc

All systems nominal

Transport

mTLS

cert auto-rotated

Access

RBAC + SSO

SAML via Okta

PII redact

Pre-inference DLP

Egress

No external calls

Recent events

live

[14:32]✓Prompt redacted (PII match: SSN)
[14:28]✓user@acme.com signed in via SSO
[14:21]✓KMS key rotated · automatic
[14:15]✓Audit log exported to S3 bucket

SOC 2HIPAAGDPRISO 27001

How we deploy

From security questionnaire to audited production.

01
Security workshop
Map your compliance regime, data classifications, identity provider, and threat model. The deploy plan flows from this conversation.
02
Architect the deploy
Pick the inference target (VPC, on-prem, hybrid), wire the identity provider, draft the data-flow diagram. All reviewed before any apply.
03
Build + harden
Stand up the inference layer, plug in PII redaction + audit logging + RBAC, run the pen-test, document everything in the compliance dossier.
04
Audit + go-live
Hand your security team a finished dossier. After sign-off, traffic flips. Monitoring already lit up; on-call documented; SOC 2 / HIPAA / GDPR templates included.

By the numbers

Compliance posture, not vibes.

0
Data egress
Inference happens inside your perimeter; data never crosses it.
100%
Audit-logged
Every prompt, response, and retrieval — append-only, exportable.
4–6 wk
Review timeline
From kickoff to ATO sign-off for teams with prior LLM reviews.
SOC 2 · HIPAA · GDPR
Compliance
Dossier templates included; FedRAMP via your ATO sponsor.

What you get

Compliance-grade AI without sacrificing capability.

VPC or on-prem
Deploy into your AWS, GCP, Azure account, or on-prem hardware. Data never crosses your perimeter.
Private LLM
Llama, Mistral, or GPT-4 via Azure OpenAI — your choice, behind your firewall.
PII redaction
Pre-inference DLP pass redacts PII before any prompt hits the model. Configurable per class.
RBAC + SSO
SAML / OIDC SSO, role-based access, per-tenant isolation — tied into your identity provider.
mTLS everywhere
Service-to-service traffic mTLS-encrypted. Certificate rotation automated; key material in your KMS.
Encryption at rest
All vector stores, conversation logs, and model artifacts encrypted with keys you control.
Audit logs
Every prompt, every response, every retrieval — append-only, exportable, compliance-ready.
Compliance dossier
Templates for SOC 2, HIPAA BAA, GDPR — the paperwork your legal team needs, drafted.
Threat-modeled
Designed against your specific threat model — not a generic vendor checklist.

Built for these teams

Where "we can’t ship that to OpenAI" is non-negotiable.

Healthcare
PHI-bound clinical, claims, and patient workflows
HIPAA-compliant private deploy; PHI never leaves the VPC; BAA in place.
Financial Services
PII-bound KYC, fraud, compliance workflows
SOC 2 + GLBA-ready; bank IT signs off before traffic flips.
Government + Defense
FedRAMP / IL-bound workloads
Air-gapped or on-prem deploys; cleared-personnel-only operational access.
Legal
Privileged client documents, contract review
Attorney-client privilege preserved; no data egress to model providers.
Enterprise IT
Internal copilots for SOC2 / ISO-bound orgs
Production AI inside the enterprise perimeter, audited end-to-end.
EU / GDPR-bound teams
Data-residency-constrained AI workloads
EU-region deployments with full data-residency controls and DPA in place.

Common questions

What security teams ask before they sign off on a private deploy.

Which models can run privately?
Open-weight (Llama 3, Mistral, Qwen, Phi) on your hardware or in your VPC; commercial (GPT-4 via Azure OpenAI, Claude via AWS Bedrock) without data egress to the model provider. We pick the model based on your accuracy, latency, and compliance requirements — and we'll show you the eval-suite comparison so the choice is defensible.
Do you support on-prem deploys?
Yes. Fully on-prem on your hardware, or air-gapped on classified networks. We can also do hybrid — inference on-prem, vector store + observability in your VPC. The deployment topology is part of the security workshop in week 1.
How is PII redaction enforced?
A DLP layer runs between user input and the model. Configurable per data class (SSN, account number, PHI, custom regex). PII is redacted to placeholder tokens before any prompt reaches inference; the original values are stored in your encrypted store, and the model never sees them. Audit logs capture which classes were redacted per request.
What compliance frameworks are covered?
Templates and operational guardrails for SOC 2 Type II, HIPAA (BAA signed), GDPR (DPA + EU-region deploys + data-residency controls), CCPA, and ISO 27001. For FedRAMP / IL-class deploys we work in collaboration with your ATO sponsor.
How long does a security review take?
For teams who have done LLM reviews before, 4–6 weeks from kickoff to ATO/sign-off. For first-time AI reviews, 8–12 weeks. The longest phase is always the security questionnaire + documentation, which we draft for you instead of leaving it as homework.
What about prompt injection and jailbreaks?
Multi-layer defense: input sanitization, instruction-hierarchy prompt design, refusal-rate eval suite, an output filter for sensitive content patterns, and runtime monitoring that flags suspicious request patterns. We test against a curated red-team set every release.

Have a security review to clear?

AI your security team actually approves.

Book a discovery call. We'll review your compliance regime, your identity provider, and your data classifications, then come back with a deploy architecture and a paperwork plan.

Book a discovery call See products