谷歌浏览器插件
订阅小程序
在清言上使用

Lossless Parallel Implementation of a Turbo Decoder on GPU.

HiPC(2018)

引用 5|浏览27
暂无评分
摘要
Turbo decoders use the recursive BCJR algorithm which is computationally intensive and hard to parallelise. The branch metric and extrinsic log-likelihood ratio computations are easily parallelisable, but the forward and backward metric computation is not parallelisable without compromising bit error rate. This paper proposes a lossless parallelisation technique for Turbo decoders on Graphics Processing Units (GPU). The recursive forward and backward metric computation is formulated as prefix (scan) matrix multiplication problem which is computed on the GPU using parallel prefix sum computation technique. Overall, this method achieves a throughput of 73 Mbps for a 3GPP LTE compliant turbo decoder without any BER loss and latency as low as 61 μs.
更多
查看译文
关键词
Decoding,Parallel processing,Graphics processing units,Measurement,Bit error rate,Turbo codes,3GPP
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要