Getting started#

Using pyCANON is quite straighforward.

Install#

We recommend using pip for installing pyCANON inside a virtualenv:

virtualenv .venv
source .venv/bin/activate
pip install pycanon

Installing with support for PDF reports: If you want to generate PDF reports with ReportLab you need to issut the following install command:

pip install pycanon[PDF]

If you want to use the latest development version, you can use:

virtualenv .venv
source .venv/bin/activate
git clone https://gitlab.ifca.es/privacy-security/pycanon/
pip install pycanon

Example usage#

You can use pyCANON through the command line or via its Python API.

Command line#

Example with the adult dataset:

$ pycanon k-anonymity --qi age --qi education --qi occupation --qi relationship --qi sex --qi native-country adult.csv

$ pycanon report --sa salary-class --qi age --qi education --qi occupation --qi relationship --qi sex --qi native-country adult.csv

Python API#

Example with the adult dataset:

from pycanon import anonymity, report

FILE_NAME = "adult.csv"
QI = ["age", "education", "occupation", "relationship", "sex", "native-country"]
SA = ["salary-class"]
DATA = pd.read_csv(FILE_NAME)

# Calculate k for k-anonymity:
k = anonymity.k_anonymity(DATA, QI)

# Print the anonymity report:
report.print_report(DATA, QI, SA)