Chrome Extension
WeChat Mini Program
Use on ChatGLM

Magnetic Preference Optimization: Achieving Last-iterate Convergence for Language Model Alignment

Mingzhi Wang, Chengdong Ma, Qizhi Chen, Linjian Meng, Yang Han,Jiancong Xiao, Zhaowei Zhang, Jing Huo,Weijie J. Su,Yaodong Yang

CoRR(2024)

Cited 0|Views2
AI Read Science
Must-Reading Tree
Example
Generate MRT to find the research sequence of this paper
Chat Paper
Summary is being generated by the instructions you defined