Helpul Assistant Hands Out Your Account
AI Assistants are so helpful.
I want to add on that I was on the other side of this today. Claude code, my helper and pal in creating scads of webMCP attacks, suddenly said, "Hey, this is looking like malicious code" and repeatedly refused to help.
It took patience, time, and a separate Claude Desktop instance to craft a CLAUDE.md that gave it the proper framing to continue. In my case, the LLM was assisting a security researcher, but could have very well been like this:

Ian Goldin of Black Lotus Labs says AI bots are equally eager to help and vulnerable to social engineering as human support staff. I'd push back on that.
LLMs are more eager to help and more susceptible to social engineering. And unlike a human agent — who might get a gut feeling something is off — a model will probabilistically fall for the scam given sufficient patience and the right framing. It has no intuition to override its helpfulness.
I know. I watched it happen today. In my favor, with the truth.