#自然语言处理#An open-source framework for detecting, redacting, masking, and anonymizing sensitive data (PII) across text, images, and structured data. Supports NLP, pattern matching, and customizable pipelines.
#大语言模型#Document (PDF, Word, PPTX ...) extraction and parse API using state of the art modern OCRs + Ollama supported models. Anonymize documents. Remove PII. Convert any document or picture to structured JSO...
#自然语言处理#What's in your data? Extract schema, statistics and entities from datasets
#安全#Secure Vault for Customer PII/PHI/PCI/KYC Records
#自然语言处理#An AI-powered Personal Identifiable Information (PII) scanner.
A powerful scanner to scan your Filesystem, S3, MySQL, Redis, Google Cloud Storage and Firebase storage for PII and sensitive data.
Scan databases and data warehouses for PII data. Tag tables and columns in data catalogs like Amundsen and Datahub
#自然语言处理#This package features data-science related tasks for developing new recognizers for Presidio. It is used for the evaluation of the entire system, as well as for evaluating specific PII recognizers or ...
🚨 slog: Attribute formatting
Multi Cloud Data Tokenization Solution By Using Dataflow and Cloud DLP
KloudDB Shield is a comprehensive Postgres Security Tool - PII Scanner , CIS Benchmarks , SSL audit , 12+ features .. Supports Postgres, RDS ,Aurora, MySQL
A research python package for detecting, categorizing, and assessing the severity of personal identifiable information (PII)
#大语言模型#🛡️ PII Guard is an LLM-powered tool that detects and manages Personally Identifiable Information (PII) in logs — designed to support data privacy and GDPR compliance
Scala library and compiler plugin that prevent inadvertent leakage of sensitive fields in `case classes` (such as credentials, personal data, and other confidential information)
The open source PII and PHI redaction and de-identification engine
#自然语言处理#A package to build an end-to-end pipeline for detecting personally identifiable information from text.
#自然语言处理#Application and python script to identify, remove, and/or recode personally identifiable information (PII) from field experiment datasets.