Commit graph

944 commits

Author SHA1 Message Date
Elton Chen-Yu Ho a5f26eeef7
Set GPU index based on local_rank instead of rank 2021-08-22 17:45:54 +08:00
Szymon Migacz ef98b2cef9
[TXL/PyT] update: (#989)
* changed API calls to torch.einsum
* added export OMP_NUM_THREADS=1 to all launcher scripts
* additional runtime checks to ensure that launch configuration is valid
2021-08-20 08:39:12 -07:00
Mikolaj Blaz 706ef498c9 [Jasper/PyT] Clean up inference flags 2021-08-13 19:25:20 +00:00
Mikolaj Blaz 568e246408 [Jasper/PyT] Add missing package requirements 2021-08-13 19:21:56 +00:00
Jan Baczek 233287038c [Transformer/PyT] Removing obsolete profiling code 2021-08-13 16:36:58 +00:00
gkarch 793027c9da removed triton stuff from model dockerfile and moved it to ./triton folder 2021-08-13 16:33:09 +00:00
Michal ee3f3db4c0 [nnUNet/PyT] update dlprof 2021-08-13 16:31:23 +00:00
Andrei Shumak 6a642837c4 Revert changes to FastPitch and BERT on PyTorch 2021-08-13 16:19:34 +02:00
Krzysztof Kudrynski a860701665 [BART/PyT] Initial release 2021-08-11 17:08:16 +00:00
Lukasz Pierscieniewski 69a26f5c76 [Multiple models] Added protocol specification to requirements.txt 2021-08-11 16:54:02 +00:00
Andrei Shumak 9c5766102f [PyT] BERT: new CI configurations 2021-08-06 15:18:29 +00:00
Andrei Shumak 9a6ccc9e23 Add MaskRCNN PyT new CI config support + tests
Update tools + regenerate all tests

Add one more configuration file

Update tools

Update configuration + update tools

check: remove `ntasks=1` for 8 gpu test

add --gpus-per-node=8

Upadte tools (task_per_gpu parameter added)

Update tools
2021-08-06 16:12:45 +02:00
Andrei Shumak 9ea704497c [PyT] FastPitch: new CI configurations 2021-08-06 14:08:33 +00:00
Andrzej Sulecki ab6267ad33 [internal/CI][PyT/ConvNets] Move to launcher 2021-07-26 21:18:30 +00:00
nv-kkudrynski d788e8d496
Merge pull request #970 from nv-kkudrynski/kkudrynski/readme_notice
Adding links to performance benchmark page
2021-07-21 14:46:11 +02:00
Krzysztof Kudrynski 49e23b4597 Adding links to performance benchmark page 2021-07-21 14:39:48 +02:00
Yuanzhe Dong 3d8d878489
[ConvNets/PyT] update triton repo url in readmes 2021-07-21 12:53:10 +02:00
Joaquin Anton 41e4a07a26
[Jasper/PyT] Update DALI Jasper pipeline to functional API
Signed-off-by: Joaquin Anton <janton@nvidia.com>
2021-07-21 12:47:05 +02:00
Michal ce7926fc80 [nnUNet/PyT] fix pyprof import 2021-07-19 20:02:36 +02:00
gkarch 5be8a3cae2 fix for github issue 965 2021-07-15 12:42:26 +00:00
Lukasz Pierscieniewski 33110132cc [Convnets/TF] Basic CPU model support 2021-07-15 08:22:43 +00:00
Andrzej Sulecki a1bbe6687e [PyT/ConvNets] Fixing bug in model2onnx 2021-07-12 10:20:43 +00:00
Piotr Marcinkiewicz 96b70f5d16 [ResNet50/TF1][Triton] Bermuda 0.6.11 update 2021-07-12 10:19:44 +00:00
Andrzej Sulecki c481324031 [PyT/EfficientNet] Update README 2021-07-01 15:11:18 +02:00
Mikolaj Blaz 0d4dd6b523 [Jasper/PyT] Minor update in metrics and CLI params 2021-06-30 17:38:22 +00:00
Lukasz Pierscieniewski 36c2d7e8b8 [ResNet50/MX] Fixing perf bug with latest DALI introduced with HW decoder on Ampere 2021-06-30 11:52:43 +00:00
Jan Baczek 7e8102d0ec [Transformer/PyT] Update to 21.05 2021-06-30 11:50:08 +00:00
kkudrynski 81ee705868 [PyTorchHub] Adding licences to entrypoint scripts 2021-06-18 14:56:07 +00:00
byshiue 39c349f81a
Merge pull request #958 from jdemouth/patch-1
Update README.md
2021-06-17 08:27:45 +08:00
Julien Demouth a079758fb9
Update README.md 2021-06-16 22:34:05 +02:00
Piotr Marcinkiewicz 5c33a8289b
[ResNet50/PyT] Triton perf fix
* ResNet50/PyT Triton ONNXruntime fix with env flag

Scripts were modified to fix missing ORT_TENSORRT_FP16_ENABLE flag for
Triton Inference Server with ONNXRuntime and TensorRT execution provider.

* ResNet50/PyT TensorRT FP16 support fixed

ONNX to TensorRT converter was fixed to force FP16 precision for
TensorRT networks.
2021-06-16 16:04:22 +02:00
nv-kkudrynski e22cfdd617
Merge pull request #957 from alancucki/jasper/fix_pyprof
[Jasper/PyT] Fix pyprof import
2021-06-14 16:56:17 +02:00
Adrian Lancucki f41d86db2a [Jasper/PyT] Fix pyprof import 2021-06-14 16:53:47 +02:00
Alex Olmucci Poddubnyy 15af494a8e [DLRM/PyT] Container update to 21.04 2021-06-10 03:56:12 -07:00
Lukasz Pierscieniewski 70e0d55e80 Add missing QAT training file 2021-06-10 10:20:15 +00:00
Jan Golda 778583481b [TorchHub] restructured hubconf and updated SSD and Tacotron2/WaveGlow entrypoints 2021-06-07 12:20:51 +00:00
Szymon Migacz ff6f7c6532
[GNMT/PyT] Update for PyT GNMT (#944) 2021-05-27 09:48:37 -07:00
Andrzej Sulecki 555b84b3b1 [ConvNets/PyT] Adding checkpoints for EfficientNet/PyT, Squeeze&Excitation can use Conv or Linear layer depending on --trt switch. 2021-05-26 11:03:13 +00:00
Adrian Lancucki 8d8c524df6
[FastPitch/PyT] Fix updated regulate_len 2021-05-21 11:35:47 +02:00
Grzegorz Karch 5a8521ee05 [Tacotron2/PyT] rename common->tacotron2_common 2021-05-18 16:28:11 +00:00
Jan Golda 5afd63dde0 [MaskRCNN/TF2] Update 2021-05-17 09:22:33 +02:00
Michal Futrega 905e9e507e [nnUnet/PyT] Add support for Triton 2021-05-14 15:41:09 +02:00
nv-kkudrynski 2a2735fed1
Merge pull request #935 from alancucki/readme-update
[FastPitch/PyT] Update model description
2021-05-14 14:09:58 +02:00
Adrian Lancucki 0c04f3caaf Update model description 2021-05-14 14:05:32 +02:00
nv-kkudrynski 97942685fb
Merge pull request #934 from PiotrBigajNV/master
[Wide&Deep/TF2] Add Dockerfile symlink to W&D
2021-05-10 12:01:17 +02:00
Piotr Bigaj ceaa6c63bd Add Dockerfile symlink to W&D 2021-05-10 11:39:14 +02:00
hXl3s 2d555548b6
[ResNet/TF] Fix gradient calculation for sync variable 2021-04-26 15:00:10 +02:00
nv-kkudrynski 01201316f8
Merge pull request #924 from NVIDIA/gh/release
[MaskRCNN/TF2] Readme fix
2021-04-24 17:55:58 +02:00
kkudrynski e438e68236 [MaskRCNN/TF2] Readme fix 2021-04-24 17:54:33 +02:00
Subhankar Ghosh ec2bef8444
[EfficientNet/TF2] Fix inference numbers and calculations
Co-authored-by: Subhankar Ghosh <subhankarg@nvidia.com>
2021-04-22 13:52:13 +02:00