Accuracy
Up to 99.999% detection accuracy across supported entity types, combining AI model recognition with pattern-based rules.
Pricing
Starting at $0.10 per million tokens processed. Only detected and transformed tokens count toward usage.
- Detection — identify sensitive entities and surface them for review, without modifying the data
- Anonymization (redaction) — replace detected entities with mask tokens to remove sensitive information
- Augmentation — transform detected entities into synthetic but realistic replacements, preserving data utility while eliminating real sensitive values
- PII — Personally Identifiable Information (names, dates, contact details, demographics)
- PHI — Protected Health Information (diagnoses, medications, medical record numbers)
- PCI — Payment Card Industry data (credit cards, bank accounts, routing numbers)
- Financial Data — Tax IDs, IBANs, and other financial identifiers
- Credentials / Secrets — Passwords, API keys, tokens
- Government ID Data — Passports, driver licenses, national IDs, voter IDs
- Device / Digital Identifiers — IP addresses, MAC addresses, device IDs, URLs
- Employment / Professional Data — Employee IDs, salaries, job titles, company names

Creating a job
Step 1: Select dataset
Choose a seed dataset from your library as the input for the job. Give the job a descriptive name so you can identify it later.
Step 2: Detection configuration
Configure how sensitive entities are detected and which model evaluates the results.
Detection methods
AIMon-PII-M1 (Recommended)
AIMon-PII-M1 (Recommended)
Combines the AIMon AI detection model with pattern-based rules. Offers the best balance of precision and recall for most use cases.
AIMon-PII-M1 (Model Only)
AIMon-PII-M1 (Model Only)
Uses the AIMon PII detection model exclusively, relying on learned entity recognition without rule-based augmentation.
LLM + AIMon-PII-Simple
LLM + AIMon-PII-Simple
Combines an LLM for contextual detection with fast pattern-based rules. Useful when you want LLM judgment alongside deterministic patterns.
LLM Only
LLM Only
Delegates all detection to an LLM. The most flexible option for unusual or domain-specific entity types.
AIMon-PII-Simple
AIMon-PII-Simple
Pattern-based detection only. Fastest option with deterministic behavior, but lower recall on context-dependent entities.
All Methods
All Methods
Combines AIMon-PII-M1, LLM, and AIMon-PII-Simple in a union. Best for maximum coverage when false negatives are unacceptable.
Confidence threshold
The confidence threshold controls the trade-off between recall and precision. Lower values (e.g., 0.1) produce more detections with more potential false positives. Higher values (e.g., 0.9) produce fewer detections but with higher certainty. The default of 0.30 works well for most datasets.Evaluation judge model
After the transform completes, an LLM evaluates the quality of the anonymization or augmentation. Select the model you want to use for this post-processing evaluation step.Step 3: Entity types & masks
Select which entity types to detect and configure how each one is handled in the output—either replaced with a mask token (anonymization) or substituted with a synthetic value (augmentation).
PII – Personal
PII – Personal
Person Name, Date of Birth, Date, Age, Gender, Nationality, Ethnicity, Marital Status
PII – Contact
PII – Contact
Email, Phone Number, Address, ZIP Code, City, State, Country
PHI – Medical
PHI – Medical
Medical Record Number, Diagnosis, Medication, Health Plan Number, Patient ID, Lab Result
PCI – Payment Card
PCI – Payment Card
Credit Card, Bank Account, Routing Number
Financial Data
Financial Data
Social Security Number, Tax ID, IBAN
Credentials / Secrets
Credentials / Secrets
Password, Username, API Key, Token
Government ID Data
Government ID Data
Passport Number, Driver License, National ID, Voter ID
Device / Digital Identifiers
Device / Digital Identifiers
IP Address, URL, MAC Address, Device ID
Employment / Professional Data
Employment / Professional Data
Company Name, Job Title, Employee ID, Salary
person_name → <NAME> or date_of_birth → <DOB>. You can customize the mask token for each type. For augmentation, detected values are replaced with synthetic equivalents that preserve the format and context of the original.
Step 4: Review & submit
Review your full configuration before submitting. The summary shows the job name, dataset, detection method, confidence threshold, evaluation judge, and all selected entity types with their configured transforms (mask tokens for anonymization, or synthetic replacement rules for augmentation).

