Privacy-Preserving Authentication Audit Data Platform
Privacy Preserving Authentication Audit Data Platform
A governed event-ingestion project for authentication telemetry. The platform demonstrates queue intake, contract validation, PII tokenization, curated persistence, quarantine handling, and batch audit evidence through a working technical surface.
This page is backed by the same transformation module used by the ingestion worker. The sample records below are loaded from repository data, transformed by Python at request time, and displayed through API-backed previews.
Platform Overview
Source Registry
The platform is now wired for a real authentication research dataset. The running page uses the local fixture until the RBA dataset is downloaded and normalized, then switches to the normalized RBA sample automatically.
End-to-End Data Flow
The flow is intentionally small but complete: source contract, queue intake, validation, privacy transform, database persistence, and public operational surface.
Live Transform Preview
The source sample includes direct identifiers. The curated output replaces sensitive fields with deterministic HMAC tokens and records the source payload hash for replay and audit alignment.
Source Payload
Waiting for source sample...
Curated Output
Waiting for transform...
Data Tables
The browser shows compact 10-row previews only. The full 1.1 GB RBA source dataset is linked in Source Registry and is not hosted inside this page.
Schema and Source Contract
PostgreSQL Schema
Loading schema...
Login Event Contract
Loading contract...
Quality and Privacy Gates
These controls are implemented in code and mapped to explicit failure behavior. Invalid records are not silently corrected; they are separated for investigation and replay governance.
Engineering Toolchain
Implemented Stack
Planned Enterprise Expansion
Project Knowledge Bank
The documentation below is the same content prepared for the internal wiki page. It is displayed here so the public project surface contains the architecture narrative, not only the demo output.