#
pii
Here are 93 public repositories matching this topic...
Secure SDK/vault for personal records/PII built to comply with GDPR
security
privacy
encryption
database
vault
application-server
compliance
passportjs
tokenization
gdpr
data-protection
legaltech
anonymization
pii
data-anonymization
secure-storage
privacy-by-design
user-consent
piidata
ccpa
-
Updated
Jul 21, 2022 - Go
What's in your data? Extract schema, statistics and entities from datasets
python
nlp
security
data-science
privacy
csv
avro
tabular-data
pandas
dataset
data-analysis
gdpr
npi
sensitive-data
pii
dataprofiling
data-profiler
data-labels
-
Updated
Aug 8, 2022 - Python
vrajat
commented
Feb 14, 2020
It is not surprising that deep and shallow scan show different results. Shallow scan only looks at column names. Deep scan looks at a sample of the data. I've even noticed that two different runs of deep scan show different results as sample rows are different. This is the challenge with not scanning all of the data. Its a trade-off between performance/cost and accuracy. There is no right answer.
This package features data-science related tasks for developing new recognizers for Presidio. It is used for the evaluation of the entire system, as well as for evaluating specific PII recognizers or PII detection models.
-
Updated
Aug 5, 2022 - Python
A Mongoose plugin that lets you transparently cipher stored PII and use securely-hashed passwords
-
Updated
Feb 11, 2022 - JavaScript
Application and python script to identify, remove, and/or recode personally identifiable information (PII) from field experiment datasets.
-
Updated
Aug 19, 2021 - Python
Hides personal information from pages, similar to Discord's Streamer mode.
-
Updated
Feb 18, 2022 - JavaScript
A package to build an end-to-end pipeline for detecting personally identifiable information from text.
-
Updated
Jun 2, 2019 - Python
Library for identification, anonymization and de-anonymization of PII data
-
Updated
Jun 22, 2022 - Python
Python Data Loss Prevention (DLP) SDK - Nightfall Developer Platform
python
api
sdk
data-privacy
data-protection
pii
data-loss-prevention
data-classification
secrets-detection
nightfall
-
Updated
Jul 19, 2022 - Python
Project Matt: Scan your AWS S3 Buckets for PII Data to Guard against GDPR
redis
elasticsearch
machine-learning
scala
aws-s3
aws-cloudformation
aws-batch
gdpr
pii
piii-filters
-
Updated
May 25, 2018 - Scala
Metadata and data identification tool and Python library. Identifies PII, common identifiers, language specific identifiers. Fully customizable and flexible rules
-
Updated
Aug 8, 2022 - Python
Search for PII in Python
-
Updated
Jul 6, 2022 - Python
A project to build a machine learning pipeline to detect personal identifiable information (PII)
-
Updated
Jun 27, 2022 - Jupyter Notebook
Mirror of Gitloab repo PostgreSQL Anonymizer
-
Updated
Mar 23, 2020 - PLpgSQL
Case study using dotfurther's Open Discover Platform with the RavenDB document store to rapidly create a full-text search/eDiscovery/information governance capable demonstration application.
metadata
text-extraction
full-text
full-text-search
ravendb
ediscovery
indexing-engine
file-format-detection
data-breach
file-deduplication
pii
information-governance-catalog
personally-identifiable-information
archive-extractor
pii-detection
file-identification
full-text-extraction
document-ingestion
information-governance
-
Updated
May 7, 2021
.NET API for document file format identification, text/metadata/attachment/embedded object/sensitive item (PII/PHI)/entity extraction.
metadata
sdk
csharp
dotnet
email
text
extractor
extraction
indexing
text-extraction
archive
pst
embedded-objects
file-format-detection
file-deduplication
pii
microsoft-office
pdf-text-extract
pii-detection
file-identification
-
Updated
Jul 26, 2022 - C#
PureKit SDK allows developers to protect users' passwords and sensitive personal information in a database from data breaches and both online and offline attacks and make stolen passwords useless even if a database is breached.
cryptography
sdk
encryption
password
hipaa
gdpr
pii
password-hardened-encryption
phe
passw0rd
piidata
-
Updated
May 8, 2020 - Java
PureKit PHP SDK allows developers to protect users' passwords and sensitive personal information in a database from data breaches and both online and offline attacks and make stolen passwords useless even if a database is breached.
cryptography
sdk
encryption
password
hipaa
gdpr
pii
password-hardened-encryption
phe
passw0rd
piidata
-
Updated
Jun 21, 2022 - PHP
An example demonstrating how Very Good Security can secure a Rails application without any code changes and instantly make it PCI DSS Level 2 compliant.
security
pci-dss
hipaa
kyc
sensitive-data-security
sensitive-data
data-security
pii
pci-payment-processor
credit-card-payments
personally-identifiable-information
platform-eng-team
-
Updated
Aug 2, 2022 - Ruby
PureKit SDK allows developers to protect users' passwords and sensitive personal information in a database from data breaches and both online and offline attacks and make stolen passwords useless even if a database is breached.
cryptography
sdk
encryption
password
hipaa
gdpr
pii
password-hardened-encryption
phe
passw0rd
piidata
-
Updated
Feb 11, 2022 - C#
A project to build a machine learning pipeline to detect personal identifiable information (PII)
-
Updated
Oct 21, 2019 - Jupyter Notebook
Detects exploitable storage of private info in cookies
-
Updated
Dec 11, 2020 - JavaScript
This project demonstrate that how a government or any super authority can utilize the Ethereum blockchain for the attestation of citizen's PII data. By empowering the user and allowing user to verify his information on any third party website using his Ethereum account through Web.3.0 wallets.
nodejs
java
ethereum
dapp
decentralized
blockchain
verification
administrator
solidity
web3j
node-js
user
age
web3js
attestation
metamask
truffle-framework
ethereum-blockchain
pii
smartcontract
pii-data
3rd-parties
-
Updated
Feb 10, 2022 - Java
Export Dialogflow conversation logs to BigQuery with masking PII using DLP API
bot
export
bigquery
redaction
conversations
gae
chatbot
gcp
google-cloud
dataflow
masking
pii
gcp-pubsub
gcp-appengine-std
dialogflow
dialogflow-v2
gcp-dataflow
dialogflow-fulfillment
cloud-dlp
cloud-tasks
-
Updated
Apr 9, 2019 - JavaScript
Improve this page
Add a description, image, and links to the pii topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the pii topic, visit your repo's landing page and select "manage topics."
Presidio leverages ML models which might detect an instance in one sentence but not in another.
By automatically adding all instances of a previously identified entity, we can increase detection recall (and potentially decrease precision)
Example (hypothetical):
If the first "TrueForce" was detected