DeepLearningExamples/PyTorch/Translation/GNMT/scripts/tests/reference_inference_performance
Przemek Strzelczyk a644350589 Updating models and adding BERT/PyT
Tacotron2+Waveglow/PyT
* AMP support
* Data preprocessing for Tacotron 2 training
* Fixed dropouts on LSTMCells

SSD/PyT
* script and notebook for inference
* AMP support
* README update
* updates to examples/*

BERT/PyT
* initial release

GNMT/PyT
* Default container updated to NGC PyTorch 19.05-py3
* Mixed precision training implemented using APEX AMP
* Added inference throughput and latency results on NVIDIA Tesla V100 16G
* Added option to run inference on user-provided raw input text from command line

NCF/PyT
* Updated performance tables.
* Default container changed to PyTorch 19.06-py3.
* Caching validation negatives between runs

Transformer/PyT
* new README
* jit support added

UNet Medical/TF
* inference example scripts added
* inference benchmark measuring latency added
* TRT/TF-TRT support added
* README updated

GNMT/TF
* Performance improvements

Small updates (mostly README) for other models.
2019-07-16 21:13:08 +02:00

6 lines
225 B
Text

fp16,128,5,Tesla V100-SXM2-16GB,18740
fp32,128,5,Tesla V100-SXM2-16GB,8610
fp16,128,5,Tesla V100-SXM2-32GB,17800
fp32,128,5,Tesla V100-SXM2-32GB,8180
fp16,128,5,Tesla V100-SXM3-32GB,20550
fp32,128,5,Tesla V100-SXM3-32GB,9810