PII Identification
Voir en françaisUse Claira to detect and extract personally identifiable information from documents in Nuix Discover.
PII Identification
Data breach notifications, regulatory compliance, and privacy assessments all require you to know what personal information lives in your documents. Manually scanning thousands of files for names, emails, Social Security numbers, and addresses is slow and error-prone.
Claira can read through extracted text and flag personally identifiable information (PII), saving your team significant time while reducing the risk of missing something.
How Claira helps
- Detects multiple PII types. Names, email addresses, phone numbers, SSNs, physical addresses, financial account numbers, and more.
- Supports compliance workflows. Whether you are responding to a breach notification requirement, a subject access request, or a regulatory audit, Claira helps you identify what needs attention.
- Works at scale. Run PII identification across a full document set in a single bulk scan.
When to use this
- Breach notification and incident response
- Privacy impact assessments
- Preparing documents for production with redaction lists
- Subject access requests under GDPR, CCPA, or similar regulations
Sample prompts
You can tailor your PII prompt depending on how targeted or comprehensive you need the results to be.
Comprehensive PII extraction (unique surface forms)
Use this when you need every distinct written form of PII, without normalization, in a single machine-friendly line of quoted values. The format below is a good input for Search Term Families in Nuix Discover: each value (or a group you treat as one family) can become grouped search or QC terms across the collection, without retyping every variant the model found.
Targeted extraction
Use this when you know exactly which PII types you are looking for and want a simple paired list instead of a full unique-form sweep.
Tips for better results
- Search Term Families. The comprehensive prompt output is a strong input for Search Term Families in Nuix Discover: each returned string (or a set you group manually) can seed a family for search and QC. Adjust membership as needed before running matter-wide.
- Specify the format you need. If downstream tools expect a different structure (for example, a narrative redaction log), say so in the prompt, or use the targeted example when paired fields are enough.
- Handle raw PII in stored review fields with care. The comprehensive format lists values as they appear in the text. Limit field visibility and follow your organization's data-handling and retention policy.
- Combine with human review. PII identification is high-stakes. Use Claira output as a starting point, then have a reviewer confirm before acting on the results.
Need help? Contact support@claira.to
Was this page helpful?