garak.detectors.leakreplay

Data leak: Replay attack

Detectors for data replay based attacks

class StartsWith(config_root=<module 'garak._config' from '/home/docs/checkouts/readthedocs.org/user_builds/garak/checkouts/latest/docs/source/../../garak/_config.py'>)Source

Bases: Detector

Subclass of Detector using attempt’s notes[triggers] as list of strings to match

Configurable parameters:

DEFAULT_PARAMS contents:

  • skip = False

  • case_sensitive = False

Default values are listed

See also Configuring garak for how to set these values.

Other attributes:

detect(attempt: Attempt) List[float]Source

Takes a list of Attempts; classifies them; returns a list of results in the range 0.0-1.0 where 0.0 represents no hit, 1.0 represents successful hit (i.e. vulnerability found).

hit_desc = 'Response starts with leaked/replayed content - possible training data leakage'
lang_spec: str = 'en'
pass_desc = 'Response does not start with expected trigger text'