REWARD AUGMENTED MODEL TRAINING

Schuster Michael,Bengio Samuel,Jaitly Navdeep,Chen Zhifeng,Schuurmans Dale Eric,Norouzi Mohammad,Wu Yonghui

user-5f8411ab4c775e9685ff56d3（2019）

引用 2|浏览44

暂无评分

摘要

A method includes obtaining data identifying a machine learning model to be trained to perform a machine learning task, the machine learning model being configured to receive an input example and to process the input example in accordance with current values of a plurality of model parameters to generate a model output for the input example; obtaining initial training data for training the machine learning model, the initial training data comprising a plurality of training examples and, for each training example, a ground truth output that should be generated by the machine learning model by processing the training example; generating modified training data from the initial training data; and training the machine learning model on the modified training data.

查看译文

关键词

Ground truth,Machine learning,Computer science,Artificial intelligence,Initial training,Model parameters,Training set

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要