基本信息
views: 32
Career Trajectory
Bio
My research is focused on deep learning, AI safety and alignment, and more specifically, with understanding how to communicate or “specify” what behavior is desired. I’m pursuing the three research directions I view as most promising in this area:
Learning what humans want from human feedback (e.g. via reward modelling).
Managing the incentives an AI system has to influence the world (e.g. to prevent user manipulation in content recommendation systems).
Getting deep nets to understand the world the same way people do (e.g. so that they can solve out-of-distribution generalization problems).
Learning what humans want from human feedback (e.g. via reward modelling).
Managing the incentives an AI system has to influence the world (e.g. to prevent user manipulation in content recommendation systems).
Getting deep nets to understand the world the same way people do (e.g. so that they can solve out-of-distribution generalization problems).
Research Interests
Papers共 58 篇Author StatisticsCo-AuthorSimilar Experts
By YearBy Citation主题筛选期刊级别筛选合作者筛选合作机构筛选
时间
引用量
主题
期刊级别
合作者
合作机构
Yoshua Bengio,Geoffrey Hinton,Andrew Yao,Dawn Song,Pieter Abbeel,Trevor Darrell, Yuval Noah Harari,Ya-Qin Zhang,Lan Xue,Shai Shalev-Shwartz,Gillian Hadfield,Jeff Clune,Tegan Maharaj,Frank Hutter,Atilim Gunes Baydin,Sheila McIlraith, Qiqi Gao,Ashwin Acharya,David Krueger,Anca Dragan,Philip Torr,Stuart Russell,Daniel Kahneman,Jan Brauner,Soren Mindermann
Scienceno. 6698 (2024): 842-845
SSRN Electronic Journal (2024)
CoRR (2024)
Cited0Views0EIBibtex
0
0
CoRR (2024)
Cited0Views0EIBibtex
0
0
CoRR (2024)
Stephen Casper, Carson Ezell, Charlotte Siegmann,Noam Kolt, Taylor Lynn Curtis, Benjamin Bucknall,Andreas Haupt, Kevin Wei,Jérémy Scheurer,Marius Hobbhahn,Lee Sharkey,Satyapriya Krishna, Marvin Von Hagen,Silas Alberti,Alan Chan, Qinyi Sun, Michael Gerovitch,David Bau,Max Tegmark,David Krueger,Dylan Hadfield-Menell
PROCEEDINGS OF THE 2024 ACM CONFERENCE ON FAIRNESS, ACCOUNTABILITY, AND TRANSPARENCY, ACM FACCT 2024pp.2254-2272, (2024)
NeurIPS 2024 (2024)
Cited0Views0EIBibtex
0
0
Cited1Views0EIBibtex
1
0
Alan Chan, Carson Ezell, Max Kaufmann, Kevin Wei,Lewis Hammond,Herbie Bradley,Emma Bluemke,Nitarshan Rajkumar,David Krueger,Noam Kolt,Lennart Heim,Markus Anderljung
PROCEEDINGS OF THE 2024 ACM CONFERENCE ON FAIRNESS, ACCOUNTABILITY, AND TRANSPARENCY, ACM FACCT 2024pp.958-973, (2024)
SYMPOSIUM ON ADVANCES IN APPROXIMATE BAYESIAN INFERENCE (2024): 79-110
Cited0Views0Bibtex
0
0
Load More
Author Statistics
#Papers: 58
#Citation: 8297
H-Index: 20
G-Index: 32
Sociability: 6
Diversity: 1
Activity: 18
Co-Author
Co-Institution
D-Core
- 合作者
- 学生
- 导师
Data Disclaimer
The page data are from open Internet sources, cooperative publishers and automatic analysis results through AI technology. We do not make any commitments and guarantees for the validity, accuracy, correctness, reliability, completeness and timeliness of the page data. If you have any questions, please contact us by email: report@aminer.cn