Discussion about this post

Commenting has been turned off for this post
Techlatest.net's avatar

DLLMs jailbreaking GPT-5 at 53% ASR with less compute? Wild. Activation Oracles reading hidden goals from activations is the real audit game-changer tho.

Inductive backdoors from benign data scary smart—fine-tuning filters clearly dead. Subscribed for more.

Esa K's avatar

Thank you for the great newsletter as always! Just wanted to share that I really like the format of the one-paragraph "why this matters" sections, it's very informative and useful when first scanning the contents. Looking forward to the next one!

No posts

Ready for more?