Building a robust and compact search index
2021 International Conference "Nonlinearity, Information and Robotics" (NIR)(2021)
摘要
With exponential data growth search engines require more memory for storage and time for search. The data is indexed to increase search speed, which requires additional memory. In this study we develop a fully functional search engine for Wikipedia articles and compare different indexing techniques. Using vector quantization for compression we fit an index into a single machine’s RAM. Moreover, we show that by using metadata and additional search for the out-of-vocabulary words we improve the overall system’s quality.
更多查看译文
关键词
information retrieval,semantic search,index structures,distributed systems
AI 理解论文
溯源树
样例
![](https://originalfileserver.aminer.cn/sys/aminer/pubs/mrt_preview.jpeg)
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要