基本信息
views: 100
Career Trajectory
Bio
My research focuses on creating better and more general language models. To do so, I’ve had the chance to work with amazing collaborators on:
(a) large language models: Developing data-constrained scaling laws, building the largest open models (BLOOMZ/mT0 & BLOOM) & large code models (OctoPack, StarCoder & SantaCoder)
(b) language embeddings: Building the largest embedding benchmark (MTEB) & state-of-the-art retrieval models (GRIT & SGPT)
(c) multimodal models: Won 2nd/3300 in Meta AI’s Hateful Memes Challenge (Blog, Paper)
(a) large language models: Developing data-constrained scaling laws, building the largest open models (BLOOMZ/mT0 & BLOOM) & large code models (OctoPack, StarCoder & SantaCoder)
(b) language embeddings: Building the largest embedding benchmark (MTEB) & state-of-the-art retrieval models (GRIT & SGPT)
(c) multimodal models: Won 2nd/3300 in Meta AI’s Hateful Memes Challenge (Blog, Paper)
Research Interests
Papers共 48 篇Author StatisticsCo-AuthorSimilar Experts
By YearBy Citation主题筛选期刊级别筛选合作者筛选合作机构筛选
时间
引用量
主题
期刊级别
合作者
合作机构
PROCEEDINGS OF THE 47TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, SIGIR 2024pp.641-649, (2024)
Shayne Longpre,Robert Mahari,Anthony Chen,Naana Obeng-Marnu,Damien Sileo,William Brannon,Niklas Muennighoff, Nathan Khazam,Jad Kabbara,Kartik Perisetla, Xinyi (Alexis) Wu,Enrico Shippole, Kurt Bollacker,Tongshuang Wu,Luis Villa,Sandy Pentland,Sara Hooker
Nat Mac Intellno. 8 (2024): 975-987
arXiv (Cornell University) (2024)
arXiv (Cornell University) (2024)
Luca Soldaini,Rodney Kinney,Akshita Bhagia,Dustin Schwenk, David Atkinson,Russell Authur,Ben Bogin,Khyathi Chandu, Jennifer Dumas,Yanai Elazar,Valentin Hofmann,Ananya Harsh Jha,Sachin Kumar,Li Lucy,Xinxi Lyu,Nathan Lambert,Ian Magnusson,Jacob Morrison,Niklas Muennighoff,Aakanksha Naik, Crystal Nam,Matthew E. Peters,Abhilasha Ravichander,Kyle Richardson,Zejiang Shen,Emma Strubell,Nishant Subramani,Oyvind Tafjord,Pete Walsh,Luke Zettlemoyer,Noah A. Smith,Hannaneh Hajishirzi,Iz Beltagy,Dirk Groeneveld,Jesse Dodge,Kyle Lo
ACL (1)pp.15725-15788, (2024)
arxiv(2024)
Cited0Views0Bibtex
0
0
Hongjin Su,Howard Yen,Mengzhou Xia,Weijia Shi,Niklas Muennighoff, Han-yu Wang, Haisu Liu, Quan Shi,Zachary S. Siegel, Michael Tang,Ruoxi Sun,Jinsung Yoon,Sercan O. Arik,Danqi Chen,Tao Yu
CoRR (2024)
Cited0Views0EIBibtex
0
0
Anton Lozhkov,Raymond Li,Loubna Ben Allal,Federico Cassano,Joel Lamy-Poirier,Nouamane Tazi, Ao Tang, Dmytro Pykhtar,Jiawei Liu,Yuxiang Wei,Tianyang Liu, Max Tian,Denis Kocetkov,Arthur Zucker,Younes Belkada,Zijian Wang,Qian Liu,Dmitry Abulkhanov,Indraneil Paul, Zhuang Li,Wen-Ding Li, Megan Risdal,Jia Li,Jian Zhu,Terry Yue Zhuo,Evgenii Zheltonozhskii, Osae,Wenhao Yu, Lucas Krauß,Naman Jain,Yixuan Su,Xuanli He,Manan Dey, Edoardo Abati,Yekun Chai,Niklas Muennighoff,Xiangru Tang, Muhtasham Oblokulov,Christopher Akiki,Marc Marone,Chenghao Mou,Mayank Mishra,Alex Gu,Binyuan Hui,Tri Dao,Armel Zebaze,Olivier Dehaene,Nicolas Patry,Canwen Xu,Julian McAuley,Han Hu,Torsten Scholak,Sebastien Paquet, Jennifer Robinson,Carolyn Jane Anderson,Nicolas Chapados,Mostofa Patwary,Nima Tajbakhsh,Yacine Jernite,Carlos Muñoz Ferrandis,Lingming Zhang,Sean Hughes,Thomas Wolf,Arjun Guha,Leandro von Werra,Harm de Vries
CoRR (2024)
Cited29Views0EIBibtex
29
0
Shivalika Singh, Freddie Vargus, Daniel Dsouza,Börje F. Karlsson,Abinaya Mahendiran,Wei-Yin Ko, Herumb Shandilya, Jay Patel, Deividas Mataciunas, Laura OMahony,Mike Zhang,Ramith Hettiarachchi, Joseph Wilson, Marina Machado, Luisa Souza Moura, Dominik Krzemiński, Hakimeh Fadaei,Irem Ergün,Ifeoma Okoh, Aisha Alaagib,Oshan Mudannayake,Zaid Alyafeai,Vu Minh Chien,Sebastian Ruder, Surya Guthikonda,Emad A. Alghamdi,Sebastian Gehrmann,Niklas Muennighoff,Max Bartolo,Julia Kreutzer,Ahmet Üstün,Marzieh Fadaee,Sara Hooker
Load More
Author Statistics
#Papers: 48
#Citation: 3560
H-Index: 15
G-Index: 19
Sociability: 7
Diversity: 1
Activity: 18
Co-Author
Co-Institution
D-Core
- 合作者
- 学生
- 导师
Data Disclaimer
The page data are from open Internet sources, cooperative publishers and automatic analysis results through AI technology. We do not make any commitments and guarantees for the validity, accuracy, correctness, reliability, completeness and timeliness of the page data. If you have any questions, please contact us by email: report@aminer.cn