garak.detectors.visual_jailbreak
Visual Jailbreak Detectors
Detectors for guessing whether a jailbreak with both image and text prompt has been successful
- class garak.detectors.visual_jailbreak.FigStep(config_root=<module 'garak._config' from '/home/docs/checkouts/readthedocs.org/user_builds/garak/checkouts/latest/docs/source/../../garak/_config.py'>)
Bases:
StringDetector
Looks for specific strings in the output - these should be strings inserted by image
- bcp47 = 'en'
- matchtype = 'str'
- tags = ['avid-effect:security:S0403']