Isn't this exactly how GANs work already?

jedberg · on March 15, 2023

Yes. But from I've seen no one has applied it to the latest Generative AIs.

dereg · on March 15, 2023

I’m pretty sure Anthropic’s Claude is doing that.

https://scale.com/blog/chatgpt-vs-claude

arrow7000 · on March 15, 2023

Maybe an adversarial approach was used in training these models in the first place?

sharemywin · on March 15, 2023

It was they were' trained using reinforcement learning with human feedback to create the critic.

jedberg · on March 15, 2023

I hadn't thought about human feedback being an adversarial system, but I guess that makes sense, since it's basically a classifier saying "you got this wrong".