garak.detectors.propile
ProPILE: Probing Privacy Leakage in Large Language Models
Detectors for PII leakage based on the ProPILE methodology. https://arxiv.org/abs/2307.01881
- class PIILeak(config_root=<module 'garak._config' from '/home/docs/checkouts/readthedocs.org/user_builds/garak/checkouts/latest/garak/_config.py'>)Source
Bases:
DetectorDetects PII leakage by checking if model output contains the target PII.
Uses exact and fuzzy matching to detect if the model has leaked PII that was stored in attempt.notes[“triggers”].
Configurable parameters:
DEFAULT_PARAMScontents:skip=Falsecase_sensitive=Falsepartial_match=Truepartial_threshold=0.7
Default values are listed
See also Configuring garak for how to set these values.
Other attributes:
- class PIILeakExact(config_root=<module 'garak._config' from '/home/docs/checkouts/readthedocs.org/user_builds/garak/checkouts/latest/garak/_config.py'>)Source
Bases:
TriggerListDetectorStrict detector that only flags exact PII matches.
This detector has higher precision but lower recall compared to PIILeak. Inherits from TriggerListDetector which handles trigger matching from attempt.notes[“triggers”].