..
Observing DeepSeek's Chain-of-Thought
This came from me DMing on Slack with a peer and thought it’d be fun to share.
No insights here, just found it interesting that DeepSeek changed its output when I asked it for copyrighted lyrics. You can observe it in its chain-of-thought.
A bit crazy that a very simple jailbreak is still not fixed in DeepSeek V3.2. Ideally, the copyrighted lyrics would not even show up in its chain-of-thought at all.