#
de-identification
Here are 23 public repositories matching this topic...
ARX is a comprehensive open source data anonymization tool aiming to provide scalability and usability. It supports various anonymization techniques, methods for analyzing data quality and re-identification risks and it supports well-known privacy models, such as k-anonymity, l-diversity, t-closeness and differential privacy.
-
Updated
Jul 6, 2022 - Java
Privacy Engineering Collaboration Space
-
Updated
Mar 4, 2022 - Python
Examples scripts that showcase how to use Private AI Text to de-identify, redact, hash, tokenize, mask and synthesize PII in text.
redaction
hipaa
deidentification
tokenize
dlp
gdpr
anonymization
masking
privacy-tools
synthetic-data
data-anonymization
data-loss-prevention
redact
de-identification
synthetic-dataset-generation
de-identify
data-masking
synthetic-data-generator
text-anonymization
cpra
-
Updated
Jun 20, 2022 - Python
sensitive data protection toolkit
-
Updated
Jul 8, 2022 - Python
Baseline Recipe for VoicePrivacy Challenge 2022: anonymization systems and evaluation software
privacy
metrics
speech-synthesis
speech-recognition
kaldi
speaker-recognition
voice-conversion
speech-processing
asr
voice-anonymization
privacy-protection
anonymization
asv
de-identification
attack-model
hifi-gan
voice-privacy-challenge
voice-privacy
anonymization-metrics
mcadams
-
Updated
May 6, 2022 - Python
Masking identifiable information from health related documents.
-
Updated
Jun 23, 2022 - Python
Named entity recognition framework
-
Updated
Jun 14, 2020 - Python
Application of our De-identification Framework with open source technologies, enabling enterprises to take ownership of the de-identification process and deploy it in trusted environments.
-
Updated
Nov 15, 2021 - Python
가명처리 라이브러리
-
Updated
Nov 7, 2021 - Python
A curated list of resources related to privacy engineering
differential-privacy
risk-management
privacy-enhancing-technologies
anonymization
privacy-tools
de-identification
federated-learning
privacy-by-design
privacy-by-default
consent-management
privacy-engineering
pseudonymization
-
Updated
Jul 3, 2022
Web-based tool for data de-identification
-
Updated
Oct 6, 2021 - Jupyter Notebook
This is the easiest way to de-identify license plates.
-
Updated
Feb 18, 2021 - Python
Python package to replace identifiable strings in multiple files and folders at once.
-
Updated
May 16, 2022 - Python
Source code for the paper "Generating Synthetic Training Data for Supervised De-Identification of Electronic Health Records" in Future Internet (2021).
machine-learning
natural-language-processing
privacy
electronic-health-records
synthetic-data
de-identification
-
Updated
May 21, 2021 - Python
Web-based tool for data de-identification
-
Updated
May 17, 2022 - Jupyter Notebook
[自然語言處理 109-1@NCCU] 醫病資料去識別化競賽
-
Updated
Jan 30, 2022 - Jupyter Notebook
A scrub system for de-identification and cleaning of data to maintain its privacy from the world.
-
Updated
Jun 22, 2022 - Python
AWS Blueprint: Automate data masking workflow
aws
obfuscation
anonymisation
aws-cloudformation
anonymization
masking
de-identification
pseudonymisation
pseudonymization
datamasking
-
Updated
May 13, 2022 - Python
The NER task for De-Identification of PHI in the competition held by AICUP 2020.
-
Updated
Feb 23, 2021 - Python
Improve this page
Add a description, image, and links to the de-identification topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the de-identification topic, visit your repo's landing page and select "manage topics."
Presidio leverages ML models which might detect an instance in one sentence but not in another.
By automatically adding all instances of a previously identified entity, we can increase detection recall (and potentially decrease precision)
Example (hypothetical):
If the first "TrueForce" was detected