Overview / Description

Mask is an open-source AI Data Loss Prevention (DLP) tool that intercepts and encrypts sensitive data flowing through LLM agent pipelines, designed for engineering teams building compliant AI applications. It sits between the LLM context window and tool execution environments, applying Format-Preserving Encryption (FPE) so that PII is replaced with format-identical ciphertext tokens before reaching the model — and silently restored only when an authorized backend function actually needs the real value.

The core mechanism is a three-phase local-first, just-in-time (JIT) workflow: masking replaces detected entities with HMAC-derived tokens; a pre-tool decryption hook unmaskes values before calling downstream tools; and a post-tool re-masking hook catches any new PII returned in tool output before it flows back to the LLM. Because tokenization is HMAC-based and deterministic within a session, the LLM's reasoning context stays coherent without ever seeing raw personal data.

Detection uses a two-tier waterfall: a fast deterministic tier (registry lookups, checksums, context rules) handles high-confidence structured PII such as SSNs, IBANs, credit card numbers, and passport IDs; a slower probabilistic tier using transformer-based Named Entity Recognition (NER) catches fuzzy entities like names, locations, and organizations. The system supports 50+ PII entity types across financial, contact, identity, healthcare, and vehicle categories in English and Spanish.

SDKs are available for Python (with LangChain, LlamaIndex, and Google ADK integrations) and TypeScript (Node.js). Vault state can be synchronized across clusters via Redis, DynamoDB, or Memcached. Structured JSON audit logs are emitted asynchronously for ingestion into SIEM platforms such as Datadog and Splunk. The library is released under the Apache-2.0 license and is intended to help teams meet SOC2, HIPAA, and PCI-DSS obligations.

Used For

Preventing PII leakage in LLM agent pipelines, Achieving SOC2 and HIPAA compliance for AI applications, PCI-DSS compliant credit card handling in AI workflows, Encrypting sensitive data before it enters LLM context windows, Just-in-time decryption for authorized tool calls in agentic systems, Audit logging for AI data flows into Datadog or Splunk, Building GDPR-aware AI agents handling EU personal data, Protecting healthcare identifiers and medical IDs in AI pipelines, Securing financial data (IBANs, SSNs, routing numbers) in LLM applications, Multi-language PII detection (English and Spanish) in AI systems

Pricing

Plan

Free

Mask

Overview / Description

Used For

Pricing

Plan

Plan

Plan

Plan

Plan

Plan

Plan

Plan

Plan

Plan

Plan

Plan

Plan

Plan

Plan

Plan

Plan

Plan

Plan

Plan

Plan

Plan

Plan

Plan

Plan

Plan

Plan

Plan

Plan

Plan

Plan

Plan

Plan

Plan

Plan

Plan

Plan

Plan

Plan

Plan

Plan

Plan

Plan

Plan

Pros & Cons

Pros

Cons

Alternatives

Reviews & Ratings