基本信息
views: 52
![](https://originalfileserver.aminer.cn/sys/aminer/icon/show-trajectory.png)
Bio
In model-based RL, my work provided a novel insight into probabilistic environment model ensemble, which is commonly used in model-based RL algorithm. Based on the insight, we can substitute the ensemble with a single model and Lipschitz regularized value function to make the learning algorithm much more computationally efficient. For transfer-RL, I have worked on transferring domain knowledge under the drastic change of observation spaces (e.g., from vector-based observation to image-based observation). For adversarial RL, I have worked on observation attacks for the deep RL policy, as well as designing efficient algorithms to improve the agent’s robustness under attack. In addition, I have also worked on communication attacks in multi-agent reinforcement learning and developed a certifiable defense mechanism.
Research Interests
Papers共 34 篇Author StatisticsCo-AuthorSimilar Experts
By YearBy Citation主题筛选期刊级别筛选合作者筛选合作机构筛选
时间
引用量
主题
期刊级别
合作者
合作机构
Ruijie Zheng,Yongyuan Liang,Xiyao Wang,Shuang Ma,Hal Daumé,Huazhe Xu,John Langford, Praveen Palanisamy, Kalyan Basu,Furong Huang
Cited0Views0EIBibtex
0
0
Tianying Ji,Yongyuan Liang,Yan Zeng,Yu Luo, Guowei Xu, Jiawei Guo,Ruijie Zheng,Furong Huang,Fuchun Sun,Huazhe Xu
ICML 2024 (2024)
Cited0Views0EIBibtex
0
0
CoRR (2023)
Guowei Xu,Ruijie Zheng,Yongyuan Liang,Xiyao Wang,Zhecheng Yuan,Tianying Ji,Yu Luo,Xiaoyu Liu, Jiaxin Yuan,Pu Hua, Shuzhen Li,Yanjie Ze,
ICLR 2024 (2023)
ICLR 2023 (2023)
Cited9Views0EIBibtex
9
0
ICLR 2023 (2023)
ICLR 2024 (2023)
Load More
Author Statistics
Co-Author
Co-Institution
D-Core
- 合作者
- 学生
- 导师
Data Disclaimer
The page data are from open Internet sources, cooperative publishers and automatic analysis results through AI technology. We do not make any commitments and guarantees for the validity, accuracy, correctness, reliability, completeness and timeliness of the page data. If you have any questions, please contact us by email: report@aminer.cn