基本信息
views: 475
Career Trajectory
Bio
My two main interests at the moment are:
Understanding optimization in supervised learning and reinforcement learning from the perspective of optimizing functions (or policies) themselves rather than parameters. Here are papers I co-authored on the topic:
Tighter bounds lead to improved classifiers (ICLR 2017)
An operator view of policy gradient methods (NeurIPS 2020)
A general class of surrogate functions for stable and efficient reinforcement learning (AISTATS 2022 best paper runner-up)
Target-based Surrogates for Stochastic Optimization (ICML 2023)
Decision-Aware Actor-Critic with Function Approximation and Theoretical Guarantees (NeurIPS 2023)
Model adaptation and decentralization of power. How can we create simpler, modular architectures, that can be adapted and reused for other purposes, giving more control to the rightholders (thanks Shakir Mohamed for teaching me that term)? Technical papers I coauthored in that space are:
Multi-Head Adapter Routing for Cross-Task Generalization (in review)
Deep Language Networks: Joint Prompt Training of Stacked LLMs using Variational Inference (in review)
Understanding optimization in supervised learning and reinforcement learning from the perspective of optimizing functions (or policies) themselves rather than parameters. Here are papers I co-authored on the topic:
Tighter bounds lead to improved classifiers (ICLR 2017)
An operator view of policy gradient methods (NeurIPS 2020)
A general class of surrogate functions for stable and efficient reinforcement learning (AISTATS 2022 best paper runner-up)
Target-based Surrogates for Stochastic Optimization (ICML 2023)
Decision-Aware Actor-Critic with Function Approximation and Theoretical Guarantees (NeurIPS 2023)
Model adaptation and decentralization of power. How can we create simpler, modular architectures, that can be adapted and reused for other purposes, giving more control to the rightholders (thanks Shakir Mohamed for teaching me that term)? Technical papers I coauthored in that space are:
Multi-Head Adapter Routing for Cross-Task Generalization (in review)
Deep Language Networks: Joint Prompt Training of Stacked LLMs using Variational Inference (in review)
Research Interests
Papers共 97 篇Author StatisticsCo-AuthorSimilar Experts
By YearBy Citation主题筛选期刊级别筛选合作者筛选合作机构筛选
时间
引用量
主题
期刊级别
合作者
合作机构
CoRR (2024)
Cited0Views0EIBibtex
0
0
arxiv(2024)
Cited0Views0Bibtex
0
0
NeurIPS 2023 (2023)
Cited2Views0EIBibtex
2
0
ICML 2023 (2023)
CoRR (2023)
PROCEEDINGS OF 2023 ACM CONFERENCE ON EQUITY AND ACCESS IN ALGORITHMS, MECHANISMS, AND OPTIMIZATION, EAAMO 2023 (2023)
Load More
Author Statistics
#Papers: 99
#Citation: 7376
H-Index: 28
G-Index: 74
Sociability: 5
Diversity: 1
Activity: 33
Co-Author
Co-Institution
D-Core
- 合作者
- 学生
- 导师
Data Disclaimer
The page data are from open Internet sources, cooperative publishers and automatic analysis results through AI technology. We do not make any commitments and guarantees for the validity, accuracy, correctness, reliability, completeness and timeliness of the page data. If you have any questions, please contact us by email: report@aminer.cn