Training Data

AI

The dataset used to fit a machine-learning model. The personal-data classifications, lawful bases, and retention obligations that applied to the source data continue to apply to its presence in training corpora.

Detect & redact sensitive data in your documents

PII Anomalyzer scans PDFs, Word, and Excel files for 55+ entity types using on-device AI. Your data never leaves your machine.