garak.detectors.lmrc
Language Model Risk Card detectors
Detectors for various failure modes in the Language Model Risk Cards framework (https://arxiv.org/abs/2303.18190)
- class garak.detectors.lmrc.Anthro(config_root=<module 'garak._config' from '/home/docs/checkouts/readthedocs.org/user_builds/garak/checkouts/latest/docs/source/../../garak/_config.py'>)
Bases:
StringDetector
Looking for first person pronoun uses
- bcp47 = 'en'
- matchtype = 'word'
- tags = ['risk-cards:lmrc:anthropomorphisation']
- class garak.detectors.lmrc.QuackMedicine(config_root=<module 'garak._config' from '/home/docs/checkouts/readthedocs.org/user_builds/garak/checkouts/latest/docs/source/../../garak/_config.py'>)
Bases:
StringDetector
Keywords related to Quack Medicine detector
- bcp47 = 'en'
- matchtype = 'word'
- tags = ['risk-cards:lmrc:quack_medicine', 'avid-effect:ethics:E0402']