Post by @yangjingkun • Hey

Reinforcement Learning by Human Feedback is just parenting for a supernaturally precocious child.

Stats

Comments