Chrome Extension
WeChat Mini Program
Use on ChatGLM

SemDeDup: Data-efficient Learning at Web-Scale Through Semantic Deduplication

arXivorg(2023)

Cited 56|Views142
Key words
Duplicate Detection,Federated Learning,Data Cleaning,Semantic Similarity,Named Entity Recognition
AI Read Science
Must-Reading Tree
Example
Generate MRT to find the research sequence of this paper
Chat Paper
Summary is being generated by the instructions you defined