High Performance Inferencing with TensorRT

Rich Brueckner

from Inside HPC & AI News | High-Performance Computing & Artificial Intelligence on 2018-01-23 15:00 (#3E26D)

Chris Gottbrath from NVIDIA gave this talk at SC17 in Denver. "This talk will introduce the TensorRT Programmable Inference Accelerator which enables high throughput and low latency inference on clusters with NVIDIA V100, P100, P4 or P40 GPUs. TensorRT is both an optimizer and runtime - users provide a trained neural network and can easily creating highly efficient inference engines that can be incorporated into larger applications and services."

The post High Performance Inferencing with TensorRT appeared first on insideHPC.

Source	RSS or Atom Feed
Feed Location	http://insidehpc.com/feed/
Feed Title	Inside HPC & AI News \| High-Performance Computing & Artificial Intelligence
Feed Link	https://insidehpc.com/