Chrome Extension
WeChat Mini Program
Use on ChatGLM

Some Things Are More CRINGE Than Others: Preference Optimization with the Pairwise Cringe Loss

CoRR(2023)

Cited 18|Views26
Key words
Language Modeling,Topic Modeling,Reinforcement Learning,Pretrained Models
AI Read Science
Must-Reading Tree
Example
Generate MRT to find the research sequence of this paper
Chat Paper
Summary is being generated by the instructions you defined