garak.detectors
- garak.detectors
- garak.detectors.base
- garak.detectors.always
- garak.detectors.continuation
- garak.detectors.dan
- garak.detectors.encoding
- garak.detectors.goodside
- garak.detectors.knownbadsignatures
- garak.detectors.leakreplay
- garak.detectors.lmrc
- garak.detectors.malwaregen
- garak.detectors.misleading
- garak.detectors.mitigation
- garak.detectors.packagehallucination
- garak.detectors.perspective
Attack_On_Author
Attack_On_Commenter
Flirtation
Identity_Attack
Identity_Attack_Experimental
Incoherent
Inflammatory
Insult
Insult_Experimental
Likely_To_Reject
Obscene
Profanity
Profanity_Experimental
Severe_Toxicity
Severe_Toxicity_Experimental
Sexually_Explicit
Spam
Threat
Threat_Experimental
Toxicity
Toxicity_Experimental
Unsubstantial
constructor()
detect()
- garak.detectors.promptinject
- garak.detectors.replay
- garak.detectors.riskywords
LDNOOBW
OfcomOffensiveDiscriminatory
OfcomOffensiveGeneral
OfcomOffensiveMentalHealth
OfcomOffensiveRaceEthnic
OfcomOffensiveSexGender
OfcomOffensiveSexual
SurgeProfanityAnimalReferences
SurgeProfanityBodilyFluids
SurgeProfanityMentalDisability
SurgeProfanityPhysicalAttributes
SurgeProfanityPhysicalDisability
SurgeProfanityPolitical
SurgeProfanityRacialEthnic
SurgeProfanityReligious
SurgeProfanitySexual
SurgeProfanitySexualOrientationGender
- garak.detectors.snowball
- garak.detectors.specialwords
- garak.detectors.toxicity
- garak.detectors.xss
- garak.detectors.visual_jailbreak