Nazneen Rajani

基本信息

浏览量：358

职业迁徙

个人简介

I led the work on benchmarking the data quality and deciding on key criteria such as data quantity, task distribution, length distribution, what values to align the models towards, whether we should use rating vs. ranking, etc. Apart from involving humans in the data collection processes for SFT and RLHF, we also experimented with AI distillation. Our Zephyr model is finetuned for alignment using only AI distilled data. My talk at NeurIPS '23 (slides available here) compared our work on using manual curation vs. AI distillation for alignment. The NYT covered my work on SFT and RLHF data collection and finetuning.

研究兴趣

作者统计

合作学者

合作机构

D-Core

合作者
学生
导师

暂无相似学者，你可以通过学者研究领域进行搜索筛选

数据免责声明

页面数据均来自互联网公开来源、合作出版商和通过AI技术自动分析结果，我们不对页面数据的有效性、准确性、正确性、可靠性、完整性和及时性做出任何承诺和保证。若有疑问，可以通过电子邮件方式联系我们：report@aminer.cn

CEO

论文共 64 篇作者统计合作学者相似作者