Today, Dataiku, the Platform for AI Success, announced the general availability of Kiji Privacy Proxy, a new open-source privacy layer that ensures personally identifiable information (PII) never leaves an organization's control, even when using third-party AI services such as OpenAI or Anthropic. Kiji Privacy Proxy is the latest innovation from Dataiku's 575 Lab, the company's open-source initiative focused on advancing transparency, privacy, and responsible AI.
Automatically detects and replaces sensitive data (names, email addresses, financial details) with realistic placeholders before sending requests.
Original data securely restored when responses return.
Requires no changes to prompts, applications, or infrastructure.
Allows domain-level customization for proprietary identifiers and industry-specific formats.
Available as open-source project on GitHub.
Integrates as desktop application or server-based solution for enterprise teams.
As organizations scale AI, the risk of exposing sensitive data has become a critical barrier to adoption. With the rise of AI using closed-source models, employees are pouring sensitive data into AI systems faster than enterprises can secure it. Kiji Privacy Proxy addresses this challenge directly, acting as a seamless intermediary between enterprise applications and external AI models. It automatically detects and replaces sensitive data, such as names, email addresses, or financial details, with realistic placeholders before any request is sent. When a response returns, the original data is securely restored, ensuring a frictionless user experience without compromising privacy.
Kiji Privacy Proxy integrates directly into existing workflows with minimal disruption—whether running as a desktop application for individual users or scaling as a server-based solution for enterprise teams. Unlike traditional approaches, it requires no changes to prompts, applications, or infrastructure, making privacy protection both immediate and practical. Beyond standard PII detection, Kiji Privacy Proxy allows for domain-level customization. Enterprises can tailor detection to their own data, from proprietary identifiers to highly specific industry- and regional-specific formats, enabling the level of precision and control required in even the most regulated sectors.
Kiji Privacy Proxy also signals a broader shift toward open, collaborative AI infrastructure, where trust, transparency, and governance are built in rather than bolted on. Built alongside contributors across the machine learning ecosystem, the project brings together capabilities in workflow orchestration, data annotation, and synthetic data generation to help raise the industry standard for responsible AI.
As Hannes Hapke, Head of Dataiku's 575 Lab, stated: "With the rise of AI using closed-source models, employees are pouring sensitive data into AI systems faster than enterprises can secure it. That's not just a technical issue, it's a governance failure waiting to happen. Kiji Privacy Proxy gives organizations a way to scale AI responsibly, without putting sensitive data and ultimately customer trust on the line."
About Dataiku
Dataiku is the Platform for AI Success, the enterprise orchestration layer for building, deploying, and governing AI. In a single environment, teams design and operate analytics, machine learning, and AI agents with the transparency, collaboration, and control enterprises require. Sitting above data platforms, cloud infrastructure, and AI services, Dataiku connects the full enterprise AI stack — empowering organizations to run AI across multi-vendor environments with centralized governance.