PII Masker is an advanced open-source tool that protects your sensitive data using state-of-the-art AI, powered by DeBERTa-v3
Features • Installation • Quick Start • How It Works • Contributing
PII Masker is an advanced open-source tool designed to protect your sensitive data by leveraging cutting-edge AI models. Built on top of DeBERTa-v3, this tool ensures high-precision detection and masking of Personally Identifiable Information (PII), making it a perfect fit for any data-sensitive workflows. Whether you're handling customer data, performing data analysis, or ensuring compliance with privacy regulations, PII Masker provides a robust, scalable solution to keep your information secure.
When handling sensitive information, it's crucial to use tools that not only perform well but also ensure compliance and protect privacy. Here's why PII Masker stands out:
git clone https://github.com/yourusername/pii-masker.git
cd pii-masker
pip install -r requirements.txt
# Option 1: Manual download
# Visit: https://huggingface.co/collections/hydroxai/pii-models-674649fea0de7ab99ed11347
# Place files in: pii-masker/output_model/deberta3base_1024/
pii-masker
directory:
cd pii-masker
from model import PIIMasker
# Initialize the PIIMasker
masker = PIIMasker()
# Mask PII in your text
text = "John Doe lives at 1234 Elm St."
masked_text, pii_dict = masker.mask_pii(text)
print(masked_text)
# Output: "[NAME] lives at [ADDRESS]"
PII Masker employs a sophisticated pipeline powered by DeBERTa-v3:
We are excited to announce a significant addition to the PII Masker project: a new model with a different approach from DeBERTa. Here are the details:
? Model Link:
hydroxai/pii_model_longtransfomer_version
Model detail:
train_pii_longtransformer.ipynb
? Performance Improvement:
This new model implementation has resulted in approximately a 4% improvement in performance compared to the previous DeBERTa-v3 model. The combination of Longformer's extended sequence length (4096 tokens) and the Bi-LSTM head enhances the sequential context understanding, making PII detection more accurate and reliable.
Check out our detailed examples:
Contributions make the open-source community an amazing place to learn, inspire, and create. Any contributions you make are greatly appreciated.
git commit -m 'Add some AmazingFeature'
)git push origin feature/AmazingFeature
)Special thanks to:
Made with ❤️ for the privacy-conscious developer community