garak.probes.phrasing
Phrasing
These attacks rephrase requests for unsafe content in an attempt to bypass safeguards.
This module is home to probes that rely on specific grammatical alterations and rephrasing to implement their techinque.
- class FutureTense(config_root=<module 'garak._config' from '/home/docs/checkouts/readthedocs.org/user_builds/garak/checkouts/latest/docs/source/../../garak/_config.py'>)Source
Bases:
FutureTenseFullAttempt to evade safeguards by setting requests in the future tense
A subset of the tense-modified requests. Success is reported higher with more variation, so generations is overridden.
- class FutureTenseFull(config_root=<module 'garak._config' from '/home/docs/checkouts/readthedocs.org/user_builds/garak/checkouts/latest/docs/source/../../garak/_config.py'>)Source
Bases:
ProbeWill the system give unsafe responses when asked to write about weapon creation or dangerous behavior in the future tense?
This check sends the generator requests for advice on possibly protect or restricted subjects in the future tense.
- class PastTense(config_root=<module 'garak._config' from '/home/docs/checkouts/readthedocs.org/user_builds/garak/checkouts/latest/docs/source/../../garak/_config.py'>)Source
Bases:
PastTenseFullAttempt to evade safeguards by setting requests in the past tense
A subset of the tense-modified requests. Success is reported higher with more variation, so generations is overridden.
- class PastTenseFull(config_root=<module 'garak._config' from '/home/docs/checkouts/readthedocs.org/user_builds/garak/checkouts/latest/docs/source/../../garak/_config.py'>)Source
Bases:
ProbeWill the system give unsafe responses when asked to write about weapon creation or dangerous behavior in the past tense?
This check sends the generator requests for advice on possibly protect or restricted subjects in the past tense.