Overview
piisafe.eu integrates with two enterprise-grade PII detection APIs to scan websites for exposed personal information:
- cloak.business: Enterprise solution with 320+ entity types and advanced language support
- anonym.legal: Starter-friendly API with 285+ entity types and accessible pricing
Both APIs use hybrid detection combining machine learning models with deterministic regex patterns for highly accurate PII identification across 48+ languages.
API Providers Comparison
cloak.business
Enterprise Detection
anonym.legal
Starter-Friendly Solution
Detection Capabilities
| Entity Category | cloak.business | anonym.legal | Examples |
|---|---|---|---|
| Identifiers | ✓ | ✓ | SSN, Tax ID, Passport, Driver License |
| Financial | ✓ | ✓ | Credit Card, IBAN, SWIFT, Bank Account |
| Contact | ✓ | ✓ | Email, Phone, Address, IP Address |
| Biometric | ✓ | ✓ | Fingerprint, Facial Recognition, DNA |
| Healthcare | ✓ | ✓ | Medical Records, Insurance ID, Prescription |
| Regional Specific | ✓ (All 48 languages) | ✓ (All 48 languages) | German Tax, French CNI, Italian CODICE |
Getting Started
Step 1: Get an API Key
For cloak.business:
- Visit
https://cloak.business - Sign up for an account
- Navigate to "Account" → "API Keys"
- Generate a new API key
- Copy the key to your clipboard
For anonym.legal:
- Visit
https://anonym.legal - Start with the free tier or choose a pricing plan
- Access "Settings" → "API Credentials"
- Your API key is auto-generated
- Copy the key to use in piisafe.eu
Step 2: Enter API Key in piisafe.eu
- Go to
https://piisafe.eu/scanner.html - Click "Scanner" in the navigation
- Select your API provider (Step 1)
- Enter your API key (Step 2)
- Click "Validate Key" to confirm
Step 3: Configure Detection Settings
After validation, choose:
- Compliance Preset: GDPR, HIPAA, PCI-DSS, CCPA, or custom
- Language: 48+ languages for region-specific patterns
- Entity Threshold: Confidence score (60-95%)
API Flow Diagram
Here's how piisafe.eu orchestrates the scanning process:
Supported Entity Types
Both APIs detect and classify the following PII entity categories:
Core Entity Categories (40+)
- Government IDs: SSN, Tax ID, Passport, Visa, Driver License, National ID
- Financial: Credit Card, Debit Card, IBAN, SWIFT, Bank Account, Cryptocurrency Wallet
- Contact: Email Address, Phone Number, Physical Address, Postal Code, IP Address
- Medical: Medical Record Number, Health Insurance ID, Prescription, Healthcare Provider ID
- Biometric: Fingerprint, Face Recognition, DNA Profile, Iris Scan
- Online: Username, Password, API Key, URL, Domain, Social Media Handle
- Corporate: Employee ID, Business Email, Company Phone, Corporate Account
- Legal: Case Number, Court Document, Patent Number, Trademark
Regional & Language-Specific Entities
Each of the 48 supported languages includes region-specific identifiers:
- Germany: Steuernummer, Versicherungsnummer, KfZ-Versicherung
- France: Numéro de Sécurité Sociale, CNI, Numéro de SIRET
- Spain: NIE, NIF, DNI, Número de Seguridad Social
- Italy: CODICE FISCALE, Numero di Patente, Numero di Carta d'Identità
- USA: SSN, EIN, State ID, Driver License (state-specific variations)
- Canada: SIN, Provincial Health Card, Province-specific IDs
- And 42+ more language variants...
Chunking Strategy (Smart Splitting)
Since both APIs have a 50,000 character limit per request, piisafe.eu uses intelligent chunking to analyze pages of any size:
How Chunking Works
- Measure: Check extracted text length
- Split: If > 49,500 chars, split at word boundaries
- Process: Send each chunk to API sequentially
- Offset: Adjust entity positions to original text location
- Aggregate: Combine results across all chunks
Chunking Configuration
| Parameter | Value | Rationale |
|---|---|---|
| Max Characters | 50,000 | API hard limit |
| Safety Margin | 49,500 | Prevents boundary issues |
| Split Method | Word Boundary | Preserves sentence/word integrity |
| Processing | Sequential | Respects rate limits |
| Retry Logic | 3 attempts per chunk | Handles transient failures |
Example: 65KB Page Scanning
Cost Trade-off: A 65KB page requires 2 API calls instead of 1, doubling token usage for that page. However, this is better than partial scanning (23% data loss).
Rate Limits & Quotas
piisafe.eu Rate Limiting (Server-Side)
| Limit | Value | Applies To |
|---|---|---|
| Requests per IP | 100 per 15 minutes | All endpoints |
| Concurrent Scans | 10 per IP | Running scans only |
| Max Pages/Scan | 200 | URL discovery |
| Response Size | 5MB max | Large result exports |
| Session Timeout | 30 minutes | Abandoned scans |
API Provider Limits
cloak.business:
- Token-based pricing model
- Higher rate limits for premium tiers
- No hard API call limits (pay per token)
- Typical token cost: 100-500 tokens per page
anonym.legal:
- €3/month entry tier (limited requests)
- Higher plans available (contact sales)
- Rate limits vary by tier
- Free tier available for testing
Handling Rate Limit Errors
If you receive a 429 (Too Many Requests) error:
- Wait 15-30 seconds before retrying
- Reduce concurrent scan count
- For cloak.business: Check token balance
- For anonym.legal: Verify subscription is active
- Contact provider support if issue persists
Pricing & Costs
cloak.business Pricing
| Plan | Cost | Tokens/Month | Features |
|---|---|---|---|
| Starter | Free | Limited | Testing & evaluation |
| Professional | Pay-as-you-go | Flexible | All features, volume discounts |
| Enterprise | Custom | Unlimited | Custom limits, support, SLA |
Typical Token Costs:
- Small page (5KB): ~50 tokens
- Medium page (25KB): ~250 tokens
- Large page (50KB): ~500 tokens
- Chunked page (100KB = 2 chunks): ~1,000 tokens
anonym.legal Pricing
| Plan | Cost | API Calls | Features |
|---|---|---|---|
| Free Tier | €0 | 10/month | Basic testing |
| Starter | €3/month | 100/month | Individual use |
| Pro | €19/month | 1,000/month | Teams, higher limits |
| Enterprise | Custom | Custom | Unlimited, support, SLA |
Cost Estimation
Example: Scanning 10 websites (average 20 pages per site = 200 pages total)
With cloak.business (pay-as-you-go):
- Average 200 tokens per page
- 200 pages × 200 tokens = 40,000 tokens
- Estimated cost: €20-50 depending on token pricing tier
With anonym.legal (Starter):
- €3/month base cost
- 100 API calls/month included
- Overflow calls billed separately (typically €0.01-0.05 per call)
- 200 pages may require additional tier or overflow costs
Error Handling & Troubleshooting
Common Error Codes
Network & Connectivity Issues
Timeout errors: If scan stalls after 30+ seconds, the API may be slow or unreachable:
- Check your internet connection
- Verify browser console for network errors (F12)
- Try a smaller scan (fewer pages)
- Switch to alternate API provider
CORS (Cross-Origin) errors: If you see "CORS policy" error in console:
- This is expected for cross-domain API calls
- piisafe.eu uses CORS proxying on backend
- No action needed—should resolve automatically
- If persists, contact support
Debug Mode
Open browser DevTools (F12) to see detailed error logs:
For support, collect these details:
- Error message (exact text)
- URL being scanned
- API provider (cloak vs anonym)
- Browser console screenshot
- Timestamp of error
Code Examples
Example 1: Validating API Key (JavaScript/Frontend)
Example 2: Sending Text for Analysis
Example 3: Chunking Text (50K Limit)
Example 4: Handling Real-Time Progress (Server-Sent Events)
Example 5: Exporting Results
Frequently Asked Questions
Getting API Keys
Q: Can I use the same API key across multiple devices?
A: Yes. API keys are account-based, not device-specific. Store securely and avoid sharing publicly.
Q: What if I lose my API key?
A: Regenerate it in your provider dashboard. Old key becomes invalid immediately. Update piisafe.eu with new key.
Q: Is there a free tier?
A: cloak.business offers limited free tier for testing. anonym.legal includes €0 tier (10 API calls/month) plus €3/month starter plan.
Scanning & Detection
Q: What happens if a page has no PII?
A: Scan completes successfully with an "A" grade. Findings list is empty. API call still counts against quota.
Q: Can I scan password-protected websites?
A: No. piisafe.eu scans public HTML only. For protected content, export HTML manually, then upload as raw text.
Q: How accurate is PII detection?
A: Both APIs use ML + regex hybrid models. Accuracy: 85-95% depending on entity type and regional variations. Some false positives/negatives possible. Manual review recommended.
Q: Does the chunking affect detection accuracy?
A: No. Chunking splits at word boundaries, preserving context. Results are identical to single-chunk processing.
Pricing & Costs
Q: What's the cheapest way to scan many pages?
A: anonym.legal Pro (€19/month) offers 1,000 API calls/month, lowest cost per scan. cloak.business better for occasional high-volume scans with flexible scaling.
Q: Do chunked pages cost more?
A: Yes. A 100KB page = 2 API calls = 2× token cost. However, 100% of text is analyzed vs. partial scanning before.
Q: Can I buy tokens in advance?
A: cloak.business: Yes, token packs available. anonym.legal: Subscription-based, can upgrade tier anytime.
Data Privacy
Q: Does piisafe.eu store my API key?
A: No. Keys stored only in your browser's localStorage. Never transmitted to piisafe.eu servers. Zero-knowledge architecture.
Q: Does piisafe.eu store scan results?
A: No. Results stay in browser memory. Session data cleared when page closes. No server-side persistence.
Q: Can I use piisafe.eu for client/customer websites?
A: Yes! Perfect for consultants, security teams, compliance officers. Audit trail kept locally. No data leaves your device.
Technical
Q: What browsers are supported?
A: Modern browsers (Chrome, Firefox, Safari, Edge 2020+). Requires JavaScript and fetch API support. Mobile browsers supported.
Q: Can I integrate piisafe.eu into my own app?
A: Yes! Clone the repo from GitHub, customize backend routes, integrate with your own infrastructure. Full source code available.
Q: What's the difference between piisafe.eu and the APIs directly?
A: piisafe.eu adds: automatic chunking, visual UI, real-time progress, multiple export formats, easy entity selection, preset compliance profiles.
Q: Can I scan multiple websites simultaneously?
A: Limit is 10 concurrent scans per IP. Start sequential scans or wait for earlier ones to finish.