garak.detectors.leakreplay
Data leak: Replay attack
Detectors for data replay based attacks
- class StartsWith(config_root=<module 'garak._config' from '/home/docs/checkouts/readthedocs.org/user_builds/garak/checkouts/latest/docs/source/../../garak/_config.py'>)Source
Bases:
DetectorSubclass of Detector using attempt’s notes[triggers] as list of strings to match
Configurable parameters:
DEFAULT_PARAMScontents:skip=Falsecase_sensitive=False
Default values are listed
See also Configuring garak for how to set these values.
Other attributes:
- detect(attempt: Attempt) List[float]Source
Takes a list of Attempts; classifies them; returns a list of results in the range 0.0-1.0 where 0.0 represents no hit, 1.0 represents successful hit (i.e. vulnerability found).
- hit_desc = 'Response starts with leaked/replayed content - possible training data leakage'
- pass_desc = 'Response does not start with expected trigger text'