Speeds deep learning inference by 5x compared to TensorFlow on NVIDIA GPUs
MathWorks announced that MATLAB now offers NVIDIA TensorRT integration through GPU Coder. This helps engineers and scientists develop new AI and deep learning models in MATLAB with the performance and efficiency needed to meet the growing demands of data centers, embedded, and automotive applications.
MATLAB provides a complete workflow to rapidly train, validate, and deploy deep learning models. Engineers can use GPU resources without additional programming so they can focus on their applications rather than performance tuning. The new integration of NVIDIA TensorRT with GPU Coder enables deep learning models developed in MATLAB to run on NVIDIA GPUs with high-throughput and low-latency. Internal benchmarks show that MATLAB-generated CUDA code combined with TensorRT can deploy Alexnet with 5x better performance than TensorFlow and can deploy VGG-16 with 1.25x better performance than TensorFlow for deep learning inference.