OpenAI has unveiled the OpenAI Privacy Filter, an open-source model designed to detect and redact personally identifiable information (PII) in text. The model, featuring 1.5 billion total parameters and 50 million active parameters, supports a context window of up to 128,000 tokens. It utilizes a bidirectional token classification architecture to identify eight categories of PII, including names, addresses, and email addresses, achieving a 96% F1 score on the PII-Masking-300k benchmark.
The OpenAI Privacy Filter is now accessible on Hugging Face and GitHub under the Apache 2.0 license, allowing developers to deploy and fine-tune the model locally. This release aims to enhance privacy protection in text processing applications by providing a robust tool for anonymizing sensitive information.
OpenAI Launches Open-Source Privacy Filter for PII Detection
Disclaimer: The content provided on Phemex News is for informational purposes only. We do not guarantee the quality, accuracy, or completeness of the information sourced from third-party articles. The content on this page does not constitute financial or investment advice. We strongly encourage you to conduct you own research and consult with a qualified financial advisor before making any investment decisions.
