Open Problems and Fundamental Limitations of Reinforcement Learning from Human Feedback
Stephen Casper,Xander Davies,Claudia Shi,Thomas Krendl Gilbert,Jérémy Scheurer,Javier Rando,Rachel Freedman, Tomek Korbak,David Lindner,Pedro Freire,Tony Tong Wang,Samuel Marks,Charbel-Raphael Segerie,Micah Carroll,Andi Peng,Phillip J.K. Christoffersen,Mehul Damani,Stewart Slocum,Usman Anwar,Anand Siththaranjan,Max Nadeau,Eric J Michaud,Jacob Pfau,Dmitrii Krasheninnikov, Xin Chen,Lauro Langosco,Peter Hase,Erdem Biyik,Anca Dragan,David Krueger,Dorsa Sadigh,Dylan Hadfield-Menell TMLR 2024(2024)
Key words
Software Reliability Modeling
AI Read Science
Must-Reading Tree
Example

Generate MRT to find the research sequence of this paper