Features

Stop redacting documents by hand. Let AI do the heavy lifting.

PII Anomalyzer uses dual AI models to detect PII, paired with a manual draw-to-redact toolbar — so you catch everything, miss nothing, and never upload a file to do it.

Detection

Context-aware AI, not just regex

PII Anomalyzer runs two independent AI models on every scan and combines their results. The dual-model approach catches what single-model tools miss: "Jordan" is recognized as a person in one sentence and a country in the next.

  • Wide range of entity types detected automatically
  • Dual-model detection for higher accuracy
  • Pattern recognizers for US, UK, AU, and NZ postal formats
  • Surface form propagation — detect once, match everywhere
  • Form field PII detection for fillable PDFs

Entity Categories

Government IDs

12

Financial

10

Healthcare

6

Digital Presence

6

Travel & Transportation

5

Location

4

Insurance & Compliance

3

Contact Information

2

Organization & Work

2

Personal Identifiers

2

Assets & Devices

3
View all entity types ↓
Entity Reference

Built-in entity types

PII Anomalyzer detects personal identifiers, financial data, healthcare records, government IDs, and more — all using context-aware AI, not just regex.

Government IDs

12
  • US Social Security Number
  • US Driver's License
  • US Passport Number
  • Passport Expiration Date
  • Identity Card Number
  • Identity Document Number
  • National ID Number
  • Birth Certificate Number
  • Visa Number
  • Tax Identification Number
  • Brazil CPF
  • Brazil CNPJ

Financial

10
  • Credit Card Number
  • Credit Card Expiration
  • Card Security Code (CVV)
  • Credit Card Brand
  • Cryptocurrency Wallet
  • Currency Amount
  • IBAN
  • Transaction Number
  • US Bank Account Number
  • US ITIN

Healthcare

6
  • Health Insurance ID
  • Medical Condition
  • Medical License
  • Medication
  • National Health ID
  • UK NHS Number

Digital Presence

6
  • IP Address
  • MAC Address
  • Web URL
  • Username
  • Social Media Handle
  • Digital Signature

Travel & Transportation

5
  • Flight Number
  • License Plate
  • Reservation Number
  • Train Ticket Number
  • Vehicle Registration

Location

4
  • Address
  • Location
  • Postal Code
  • Facility

Insurance & Compliance

3
  • Insurance Policy Number
  • Insurance Provider
  • Registration Number

Contact Information

2
  • Email Address
  • Phone Number

Organization & Work

2
  • Organization
  • Student ID

Personal Identifiers

2
  • Person Name
  • Date / Time

Assets & Devices

3
  • Serial Number
  • Nationality / Religious / Political
  • Generic PII

Supported Formats

PDF

Native text, forms, scanned

DOCX

Word documents

XLSX

Excel spreadsheets

XLS / XLSM / XLSB

Legacy & macro-enabled

Plain Text

Paste or load directly

Scanned Docs

Built-in OCR (multiple engines)

Document Mode

Import any document, see results side by side

Import a PDF, Word document, or Excel spreadsheet. PII Anomalyzer auto-detects the document type — native text, fillable form, or scanned image — and applies the right processing pipeline. A side-by-side viewer shows the original next to the de-identified version with native vector PDF rendering. New to this? Follow the step-by-step guide to redacting a PDF.

  • Auto-detects PDF type: native, form, or scanned
  • Non-PDF formats converted automatically
  • Side-by-side original vs. de-identified view
  • Coordinate-aware text processing
  • Built-in OCR for scanned documents
De-identification

Four ways to protect sensitive data

Choose the right method for your workflow — from permanent removal to non-destructive annotation.

Redact

Black boxes permanently cover PII. The underlying text is removed from the PDF layer — gone for good.

Destructive

Replace

Numbered identifiers like <<PERSON1>> replace the original text. A legend page is appended for reference.

Destructive

Highlight

Semi-transparent overlays in entity-specific colors mark detected PII for review — non-destructive.

Non-destructive

Mask

Character-level replacement that preserves text length while hiding the value.

Destructive
Draw-to-Redact

Manual precision for what AI misses

Handwritten text, logos, signatures, and visual PII in images — some content needs a human eye. Draw redaction rectangles directly on the PDF, resize with drag handles, and nudge with arrow keys for pixel-perfect placement.

  • Select, Draw, Delete, and Clear All toolbar modes
  • Click-and-drag to create redaction boxes on any page
  • Resize by dragging edges, nudge with arrow keys
  • Always renders as solid black — explicit "remove this" intent
  • Persists between de-identification runs for iterative refinement
  • Zoom-independent — coordinates stored in PDF points

Toolbar

Select

Click to select, drag to move

Draw

Click and drag to create redaction boxes

Delete

Remove selected rectangle

Clear All

Remove all rectangles

Batch Processing

Process multiple documents at once

Queue multiple documents for processing in a single workflow. Each document gets its own de-identified output, modified PDF, and results table — combined into a single exportable dataset.

Re-identification & Export

De-identify, share safely, translate back

Export a de-identified document using the Replace method, share it with AI tools or external reviewers, then use re-identification mode to translate their responses back to the original context. You can also use it to verify redactions before final release. Export de-identified text, results tables (XLSX), and modified PDFs.

.txt De-identified text
.xlsx Results table with entity type, confidence, method
.pdf Modified PDF with redactions applied
Privacy by Design

100% offline PII detection, redaction, and anonymization.

Every AI model, every algorithm, every computation runs on your machine. No document data leaves your computer. No telemetry, no analytics, no usage data sent back to us. There is no server holding your documents because there is no server.

Offline processing

Detection, redaction, and anonymization work without an internet connection

All models bundled

NLP and OCR engines ship with the app — nothing to download separately

No telemetry

No analytics, no usage data leaves your machine

See it in action

Download the free trial and start detecting PII in your documents today.

7-day free trial · $249/year · Windows & macOS