Differentiable Product Quantization for End-to-End Embedding Compression
Abstract:
Embedding layer is commonly used to map discrete symbols into continuous embedding vectors that reflect their semantic meanings. As the number of symbols increase, the number of embedding parameter, as well as their size, increase linearly and become problematically large. In this work, we aim to reduce the size of embedding layer via l...More
Code:
Data:
Full Text
Tags
Comments