A Distinguished Lecture by John Schulman from OpenAI on the Reinforcement Learning from Human Feedback (RLHF) work powering ChatGPT.
Event Date
-
Location
YouTube
Event ID
193423