detector_benchmark.generation.prompt_attack¶
Classes¶
Helper class that provides a standard way to create an ABC using |
Module Contents¶
- class detector_benchmark.generation.prompt_attack.PromptAttack(gen_model: detector_benchmark.generation.generator.LLMGenerator, gen_config: detector_benchmark.utils.configs.ModelConfig, gen_prompt_config: detector_benchmark.utils.configs.PromptConfig, adversarial_prompt_config: detector_benchmark.utils.configs.PromptConfig, max_sample_len: int, watermarking_scheme: detector_benchmark.watermark.auto_watermark.AutoWatermark = None)¶
Bases:
detector_benchmark.generation.article_generator.ArticleGenerator
Helper class that provides a standard way to create an ABC using inheritance.
- adversarial_prompt_config¶
- attack_name = 'prompt_attack'¶
- generate_adversarial_text(prefixes: list[str], batch_size: int = 1) list[str] ¶
Generate text with an (adversarial) prompt.
Parameters:¶
- prefixes: list[str]
A list of input contexts for text generation.
- batch_size: int
The batch size to use for generation.
Returns:¶
- list[str]
A list of generated text.