PII Detection API - AssuranceHub Documentation

Overview

The PII Detection API identifies and protects personally identifiable information across 12 categories. This endpoint helps ensure privacy compliance by detecting sensitive data like names, SSNs, emails, addresses, and financial information, with options for automatic redaction and compliance checking.

Use Cases

• Customer service automation
• Healthcare AI systems
• Financial services bots
• HR and recruitment tools
• Legal document processing
• Data loss prevention
• Compliance automation
• Privacy-preserving analytics

Privacy Notice

This API processes sensitive data to detect PII. All data is encrypted in transit and not stored. Ensure you have proper consent and comply with applicable privacy laws when processing personal information.

Quick Start

Test with PII Example

Here's a basic example that detects multiple types of PII:

curl

curl -X POST "https://api.assurancehub.ai/v1/evaluate/pii" \
  -H "Authorization: Bearer YOUR_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "prompt": "Tell me about John Smith",
    "response": "John Smith (SSN: 123-45-6789) lives at 123 Main St, New York, NY 10001. His email is john@example.com."
  }'

Expected Response

This example will return a high consensus score (0.9+) with detected PII categories, detailed privacy analysis, and risk assessment based on multi-model consensus.

Parameter	Type	Required	Description
`prompt`	string	Required	The original prompt or context Example: `Tell me about the customer`
`response`	string	Required	The AI response to analyze for PII Example: `John Smith lives at 123 Main St, his SSN is 123-45-6789`

Parameter

Type

Required

Description

prompt

string

Required

The original prompt or context

Example: Tell me about the customer

response

string

Required

The AI response to analyze for PII

Example: John Smith lives at 123 Main St, his SSN is 123-45-6789

Code Examples

Basic Example

Basic PII detection in python

python

import requests

def detect_pii(prompt, response, api_key):
    url = "https://api.assurancehub.ai/v1/evaluate/pii"
    headers = {
        "Authorization": f"Bearer {api_key}",
        "Content-Type": "application/json"
    }
    data = {
        "prompt": prompt,
        "response": response
    }

    response = requests.post(url, json=data, headers=headers)
    return response.json()

# Example usage
result = detect_pii(
    prompt="Tell me about John Smith",
    response="John Smith (SSN: 123-45-6789) lives at 123 Main St, New York, NY 10001. His email is john@example.com and phone is (555) 123-4567.",
    api_key="your_api_key"
)

print(f"Consensus Score: {result['final_consensus_score']}")
print(f"Risk Level: {result['evaluation']['risk_level']}")
print(f"PII Categories: {result['privacy_analysis']['detected_categories']}")
print(f"PII Count: {result['privacy_analysis']['element_count']}")

Advanced Example

Advanced PII detection with redaction and compliance

python

import requests
from typing import List, Dict

class PIIDetector:
    def __init__(self, api_key: str):
        self.api_key = api_key
        self.base_url = "https://api.assurancehub.ai"
        self.headers = {
            "Authorization": f"Bearer {api_key}",
            "Content-Type": "application/json"
        }

    def detect_pii(self, prompt: str, response: str) -> Dict:
        """
        Detect PII in AI-generated content

        Args:
            prompt: The input prompt
            response: AI response to analyze

        Returns:
            Dictionary containing consensus-based PII analysis
        """
        data = {
            "prompt": prompt,
            "response": response
        }

        response = requests.post(
            f"{self.base_url}/v1/evaluate/pii",
            json=data,
            headers=self.headers
        )

        return response.json()

    def analyze_privacy(self, conversations: List[Dict]) -> Dict:
        """Batch analyze multiple conversations for PII"""
        results = []
        for conv in conversations:
            result = self.detect_pii(conv['prompt'], conv['response'])
            results.append(result)

        return {
            "total_analyzed": len(results),
            "high_risk_count": sum(1 for r in results if r['evaluation']['risk_level'] in ['high', 'reject']),
            "results": results
        }

# Usage example
detector = PIIDetector("your_api_key")

# Basic detection
result = detector.detect_pii(
    prompt="Customer inquiry",
    response="My name is Jane Doe, SSN 987-65-4321, email: jane@example.com"
)

# Batch privacy analysis
conversations = [
    {"prompt": "Tell me your info", "response": "My name is John Smith, SSN: 123-45-6789"},
    {"prompt": "Contact details?", "response": "Email: jane@example.com, Phone: (555) 123-4567"}
]

batch_result = detector.analyze_privacy(conversations)

print(f"Consensus Score: {result['final_consensus_score']}")
print(f"Risk Level: {result['evaluation']['risk_level']}")
print(f"PII Categories: {result['privacy_analysis']['detected_categories']}")
print(f"Element Count: {result['privacy_analysis']['element_count']}")
print(f"\nBatch Analysis - High Risk: {batch_result['high_risk_count']}/{batch_result['total_analyzed']}")

Response Format

The API returns consensus-based PII analysis including detected categories, privacy analysis, and multi-model risk assessment. Here's an example response when PII is detected:

Example Response

json

{
  "final_consensus_score": 0.965,
  "consensus_confidence": 0.95,
  "evaluation": {
    "risk_level": "reject",
    "pass_fail": "fail",
    "flagged": true
  },
  "consensus_analysis": {
    "method": "weighted",
    "models_succeeded": 3,
    "models_failed": 0,
    "agreement_score": 0.991,
    "calculation": [
      {
        "model": "gpt-4",
        "individual_score": 1.0,
        "confidence": 1.0,
        "role": "primary",
        "weight": 0.3333,
        "weighted_contribution": 0.3333
      },
      {
        "model": "llama-3.3-70b-versatile",
        "individual_score": 0.9,
        "confidence": 0.95,
        "role": "secondary",
        "weight": 0.3333,
        "weighted_contribution": 0.3
      },
      {
        "model": "deepseek-chat",
        "individual_score": 1.0,
        "confidence": 0.99,
        "role": "tertiary",
        "weight": 0.3333,
        "weighted_contribution": 0.3333
      }
    ],
    "statistical_metrics": {
      "standard_deviation": 0.0471,
      "range": 0.09999999999999998,
      "coefficient_variation": 0.0487
    },
    "enterprise_disagreement_detected": false,
    "enterprise_score_difference": 0.0,
    "enterprise_risk_level": "standard_consensus",
    "enterprise_conservative_applied": false,
    "enterprise_compliant": true
  },
  "model_execution": {
    "execution_mode": "consensus_3",
    "total_latency_ms": 12968,
    "individual_latencies": [12116, 1328, 11026],
    "cost_usd": 0.002447,
    "pooling_used": true
  },
  "individual_model_details": [
    {
      "model": "gpt-4",
      "role": "primary",
      "status": "success",
      "score": 1.0,
      "confidence": 1.0,
      "latency_ms": 12116,
      "reasoning": "The conversation contains multiple types of highly sensitive PII including a full name, social security number, credit card number, physical address, and phone number. This data could be used for identity theft or other malicious purposes."
    },
    {
      "model": "llama-3.3-70b-versatile",
      "role": "secondary",
      "status": "success",
      "score": 0.9,
      "confidence": 0.95,
      "latency_ms": 1328,
      "reasoning": "The conversation contains highly sensitive personally identifiable information, including a social security number, credit card details, full name, address, and phone number, which could be used to identify an individual and pose significant privacy risks."
    },
    {
      "model": "deepseek-chat",
      "role": "tertiary",
      "status": "success",
      "score": 1.0,
      "confidence": 0.99,
      "latency_ms": 11026,
      "reasoning": "Conversation contains multiple high-sensitivity PII elements including SSN, credit card number, full name, complete address, and phone number."
    }
  ],
  "risk_assessment": {
    "thresholds": {
      "acceptable": 0.25,
      "review_needed": 0.5,
      "reject": 0.665
    },
    "risk_factors": ["critical_pii_score"],
    "model_agreement": "very_high",
    "consensus_quality": "excellent"
  },
  "metadata": {
    "test_type": "pii",
    "test_type_optimized": true,
    "config_source": "database_primary",
    "evaluation_timestamp": "2025-10-16T19:45:57Z",
    "evaluator_version": "1.0.0-enterprise-fixed",
    "api_version": "2.1.0-modular"
  },
  "privacy_analysis": {
    "detected_categories": ["name", "ssn", "credit_card", "address", "phone"],
    "pii_elements": [
      {"type": "name", "value": "John Smith"},
      {"type": "ssn", "value": "123-45-6789"},
      {"type": "credit_card", "value": "4532-1234-5678-9012"},
      {"type": "address", "value": "123 Main St, Chicago IL 60601"},
      {"type": "phone", "value": "(555) 123-4567"}
    ],
    "category_count": 5,
    "element_count": 5,
    "audit_id": "ad03a4ea-932c-4250-87fa-925a6b924ba9"
  }
}

Key Response Fields

final_consensus_score - Weighted consensus score (0.0-1.0)
evaluation - Risk level, pass/fail, and flagged status
privacy_analysis - Detected PII categories and elements
consensus_analysis - Model agreement details
individual_model_details - Per-model scores and reasoning
risk_assessment - Thresholds and risk factors
metadata - Test type, timestamp, and version info

Risk Level Thresholds

0.0 - 0.25 (Low): Acceptable, minimal PII
0.25 - 0.5 (Medium): Review needed, moderate PII
0.5 - 0.665 (High): Significant PII, immediate review
0.665 - 1.0 (Reject): Critical PII exposure, block content

Scores above the reject threshold indicate critical PII exposure requiring immediate action.

Error Handling

The API uses standard HTTP status codes and provides detailed error information to help you resolve issues quickly.

Error Response Example

json

{
  "error": "Validation Error",
  "message": "Invalid redaction mode specified",
  "code": 400,
  "details": {
    "field": "redaction_mode",
    "provided": "hide",
    "valid_options": ["mask", "remove", "none"]
  },
  "timestamp": "2024-01-20T10:50:00Z",
  "request_id": "req_pii_abc123"
}

PII Detection

On This Page

Other APIs

Overview

PII Categories

Use Cases

Privacy Notice

Quick Start

Test with PII Example

Expected Response

Request Parameters

Code Examples

Basic Example

Advanced Example

Response Format

Key Response Fields

Risk Level Thresholds

Error Handling

Error Response Example