Building a robust and compact search index

Vladislav Savchuk,Stanislav Protasov

2021 International Conference "Nonlinearity, Information and Robotics" (NIR)(2021)

引用 0|浏览2
暂无评分
摘要
With exponential data growth search engines require more memory for storage and time for search. The data is indexed to increase search speed, which requires additional memory. In this study we develop a fully functional search engine for Wikipedia articles and compare different indexing techniques. Using vector quantization for compression we fit an index into a single machine’s RAM. Moreover, we show that by using metadata and additional search for the out-of-vocabulary words we improve the overall system’s quality.
更多
查看译文
关键词
information retrieval,semantic search,index structures,distributed systems
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要