Showing Articles About
spikee
spikee
Thomas Cross
Donato Capitella A step-by-step guide to using the rejudging functionality within the open-source tool Spikee. This new functionality allows testers to re-perform judging on existing results datasets. Primarily this allows for LLM applications to be assessed within restrictive environments, where a tester does not have access to an LLM for judging.
Jordan Watson We evaluate the ability of LLMs to understand text with random noise, and examine how prompts with varying levels of noise could bypass LLM guardrails.
Donato Capitella A step-by-step guide using the open-source tool spikee (v0.2) for prompt injection testing in LLM applications. Explores a webmail summarization case study, covering custom dataset creation, testing with Burp Suite and spikee's custom targets, interpreting results, and noting key updates from v0.1 to v0.2 like the Judge system and dynamic attacks.