[P] Automating Optimization of Quantized Deep Learning Models on CUDA
TLDR: With learning-based program optimizer, we are able to get competitive performance on benchmark models and significant boost on emerging models against TensorRT(int8). https://tvm.ai/2019/04/29/opt-cuda-quantized.html
submitted by /u/crowwork
[link] [comments]