#

de-identification

Here are 45 public repositories matching this topic...

microsoft / presidio

An open-source framework for detecting, redacting, masking, and anonymizing sensitive data (PII) across text, images, and structured data. Supports NLP, pattern matching, and customizable pipelines.

Updated Dec 4, 2025
Python

arx-deidentifier / arx

ARX is a comprehensive open source data anonymization tool aiming to provide scalability and usability. It supports various anonymization techniques, methods for analyzing data quality and re-identification risks and it supports well-known privacy models, such as k-anonymity, l-diversity, t-closeness and differential privacy.

open-source privacy cross-platform data-analytics arx data-anonymization de-identification

Updated Oct 1, 2025
Java

google / magritte

Mediapipe-based library to redact faces from videos and images

face-detection anonymization de-identification pii-detection

Updated Sep 29, 2023
C++

mplspunk / awesome-privacy-engineering

A curated list of resources related to privacy engineering

differential-privacy risk-management privacy-enhancing-technologies anonymization privacy-tools de-identification federated-learning privacy-by-design privacy-by-default consent-management privacy-engineering pseudonymization

Updated Sep 28, 2024

privateai / deid-examples

Examples scripts that showcase how to use Private AI Text to de-identify, redact, hash, tokenize, mask and synthesize PII in text.

Updated Oct 1, 2025
Jupyter Notebook

Voice-Privacy-Challenge / Voice-Privacy-Challenge-2022

Baseline Recipe for VoicePrivacy Challenge 2022: anonymization systems and evaluation software

Updated Oct 17, 2024
Python

jftuga / deidentification

Deidentify people's names and gender specific pronouns

python nlp natural-language-processing python3 named-entity-recognition ner deidentification anonymization pii data-anonymization deidentify de-identification data-scrubbing text-anonymization pii-anonymization

Updated May 3, 2025
Python

OsiriX-Foundation / karnak

DICOM gateway for publishing images in Kheops and for de-identification

dicom deidentification de-identification pseudonymisation

Updated Dec 1, 2025
Java

nf-core / detaxizer

A pipeline to identify (and remove) certain sequences from raw genomic data. Default taxon to identify (and remove) is Homo sapiens. Removal is optional.

workflow nanopore pipeline nextflow shotgun filter metagenomics edna microbiome fastq metabarcoding long-reads taxonomic-classification de-identification nf-core short-reads taxonomic-profiling decontamination

Updated Nov 20, 2025
Nextflow

privateai / pai-thin-client

A python client used to interact with the Private AI's API

redaction deidentification dlp gdpr hippa anonymization synthetic-data redact de-identification

Updated Oct 15, 2025
Python

icescentral / MASK_public

Masking identifiable information from health related documents.

natural-language-processing named-entity-recognition anonymization de-identification

Updated Jun 23, 2022
Python

CliniDeID

Clinacuity / CliniDeID

CliniDeID automatically de-identifies clinical text notes according to the HIPAA Safe Harbor method. It accurately finds identifiers and tags or replaces them with realistic surrogates for better anonymity.

machine-learning privacy-protection de-identification

Updated Aug 13, 2023
Java

KeeplerIO / de-identification-framework

Application of our De-identification Framework with open source technologies, enabling enterprises to take ownership of the de-identification process and deploy it in trusted environments.

data privacy-protection data-security pii de-identification datalake-ingestion

Updated Nov 15, 2021
Python

kosatnkn / veil

A data de-identification library written in Go

golang pii de-identification

Updated Jul 30, 2024
Go

omers / pii-anonymizer-api

PII Anonymizer service based on python with FastAPI

data-privacy phi anonymization pii de-identification fastapi healthdata pii-data pii-anonymization

Updated Oct 3, 2025
Python

privateai / pai-pre-commit-hook

A pre-commit hook to check for PII in your code.

github git redaction security privacy pre-commit pci compliance hipaa gdpr phi anonymization pii privacy-tools de-identification data-masking cpra

Updated May 5, 2023
Python

nikolamilosevic86 / NERo

Named entity recognition framework

natural-language-processing text-mining named-entity-recognition de-identification de-identify

Updated Jun 14, 2020
Python

LRTK-CODER / DIL-Project

가명처리 라이브러리

privacy python3 privacy-protection de-identification pseudonymizatoin

Updated Nov 7, 2021
Python

nedap / mdpi2021-textgen

Source code for the paper "Generating Synthetic Training Data for Supervised De-Identification of Electronic Health Records" in Future Internet (2021).

machine-learning natural-language-processing privacy electronic-health-records synthetic-data de-identification

Updated May 21, 2021
Python

SwissFederalArchives / tcc-metadata-anonymization

An named-entity-recognition (NER) based anonymizer for archival documents metadata.

nlp machine-learning information-extraction named-entity-recognition logistic-regression ner conditional-random-fields anonymization multilayer-perceptron de-identification ensembling

Updated Jan 16, 2023
Jupyter Notebook

Improve this page

Add a description, image, and links to the de-identification topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the de-identification topic, visit your repo's landing page and select "manage topics."