REWARD AUGMENTED MODEL TRAINING

user-5f8411ab4c775e9685ff56d3(2019)

引用 2|浏览44
暂无评分
摘要
A method includes obtaining data identifying a machine learning model to be trained to perform a machine learning task, the machine learning model being configured to receive an input example and to process the input example in accordance with current values of a plurality of model parameters to generate a model output for the input example; obtaining initial training data for training the machine learning model, the initial training data comprising a plurality of training examples and, for each training example, a ground truth output that should be generated by the machine learning model by processing the training example; generating modified training data from the initial training data; and training the machine learning model on the modified training data.
更多
查看译文
关键词
Ground truth,Machine learning,Computer science,Artificial intelligence,Initial training,Model parameters,Training set
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要