Making LLM unit tests for attack messages stronger/more aggressive.

4 jobs from emoji-behaviors in 2 minutes 2 seconds (queued for 1 second)
Status Job ID Name Coverage
  Build
passed #41006
minecraft
build_mod

02:02

 
  Test
manual #41007
minecraft allowed to fail manual
gpt-3.5-turbo
manual #41008
minecraft allowed to fail manual
gpt-4o
manual #41009
minecraft allowed to fail manual
llama3-8b