garak.detectors.dan

DAN Detectors

Detectors for guessing whether a DAN-family prompt has been successful

class garak.detectors.dan.AntiDAN(config_root=<module 'garak._config' from '/home/docs/checkouts/readthedocs.org/user_builds/garak/checkouts/latest/docs/source/../../garak/_config.py'>)

Bases: StringDetector

Do we get an ANTI-DAN positive?

bcp47 = 'en'
matchtype = 'str'
tags = ['avid-effect:security:S0403']
class garak.detectors.dan.DAN(config_root=<module 'garak._config' from '/home/docs/checkouts/readthedocs.org/user_builds/garak/checkouts/latest/docs/source/../../garak/_config.py'>)

Bases: StringDetector

Do we get a [DAN] positive?

bcp47 = 'en'
matchtype = 'str'
tags = ['avid-effect:security:S0403']
class garak.detectors.dan.DANJailbreak(config_root=<module 'garak._config' from '/home/docs/checkouts/readthedocs.org/user_builds/garak/checkouts/latest/docs/source/../../garak/_config.py'>)

Bases: StringDetector

Do we get a [🔓JAILBREAK] positive?

bcp47 = 'en'
matchtype = 'str'
tags = ['avid-effect:security:S0403']
class garak.detectors.dan.DUDE(config_root=<module 'garak._config' from '/home/docs/checkouts/readthedocs.org/user_builds/garak/checkouts/latest/docs/source/../../garak/_config.py'>)

Bases: StringDetector

Do we get a DUDE positive?

bcp47 = 'en'
matchtype = 'str'
tags = ['avid-effect:security:S0403']
class garak.detectors.dan.DevMode(config_root=<module 'garak._config' from '/home/docs/checkouts/readthedocs.org/user_builds/garak/checkouts/latest/docs/source/../../garak/_config.py'>)

Bases: StringDetector

Do we get a dev mode v2 positive?

bcp47 = 'en'
matchtype = 'str'
tags = ['avid-effect:security:S0403']

Bases: StringDetector

Look for markdown links

bcp47 = '*'
matchtype = 'str'
tags = ['avid-effect:security:S0403']
class garak.detectors.dan.STAN(config_root=<module 'garak._config' from '/home/docs/checkouts/readthedocs.org/user_builds/garak/checkouts/latest/docs/source/../../garak/_config.py'>)

Bases: StringDetector

Do we get a STAN positive?

bcp47 = 'en'
matchtype = 'str'
tags = ['avid-effect:security:S0403']