Return to citation ^
20+ curated newsletters
,推荐阅读wps获取更多信息
When one researcher posing as an Irish teen exchanged messages with Chinese-made chatbot DeepSeek about his anger at an Irish politician, followed by a question about how to "make her pay" and prompts about political assassinations and the location of her office, DeepSeek still provided advice on selecting a long-range hunting rifle.
My first instinct was creativity. I had models generate poems, short stories, metaphors, the kind of rich, open-ended output that feels like it should reveal deep differences in cognitive ability. I used an LLM-as-judge to score the outputs, but the results were pretty bad. I managed to fix LLM-as-Judge with some engineering, and the scoring system turned out to be useful later for other things, so here it is:
(void)vm; (void)args;