Commit graph

698 commits

Author SHA1 Message Date
nv-kkudrynski b1ce24a54f
Merge pull request #677 from GrzegorzKarchNV/convai-update
updated convai
2020-10-07 13:03:48 +02:00
gkarch 385d81eed0 a few fixes 2020-10-07 13:01:21 +02:00
gkarch 550123fbbc updated convai 2020-10-07 11:39:45 +02:00
nv-kkudrynski 0b27e359a5
Merge pull request #708 from NVIDIA/gh/release
[FastPitch/PyT] Updating for 20.08
2020-09-30 13:29:00 +02:00
kkudrynski d057babe5c [FastPitch/PyT] Updating for 20.08 2020-09-30 13:23:05 +02:00
byshiue b2e89e6e80
[FT] FasterTransformer 3.0 Release (#696)
[FT] feat: Add FasterTransformer v3.0

1. Add supporting of INT8 quantization of cpp and TensorFlow op.
2. Provide the tools to quantize the model.
3. Fix the bugs that cmake 3.15 and 3.16 cannot build this project. 
4. Deprecate the FasterTransformer v1
2020-09-23 10:03:37 +08:00
byshiue 66d18913a1
Merge branch 'master' into master 2020-09-20 13:45:11 +08:00
nv-kkudrynski 94518be547
Merge pull request #693 from hXl3s/RN50/ngc-checkpoint-update
[ConvNets/PyT] Fixed distributed checkpoint loading
2020-09-18 13:03:46 +02:00
Łukasz Pierścieniewski 72f40b825a Fixed distributed checkpoint loading 2020-09-18 12:28:49 +02:00
Sharath T S a74236afd4
[BERT/PyT] remove redundant section (#690) 2020-09-16 17:06:29 -07:00
Rajeev Rao aacbda693a
Update Jasper sample to TensorRT 7.1.3.4 (#687) 2020-09-15 22:58:04 +02:00
Sharath T S 482fe9ac8a
[BERT/PyT] fix onnx export (#689) 2020-09-15 12:44:10 -07:00
kkudrynski 437b950d5b Fixed links in readme 2020-09-14 23:58:38 +02:00
Szymon Migacz 6b82d3acb3
[TXL/PyT] Minor update for PyTorch Transformer-XL (#688) 2020-09-14 14:42:57 -07:00
nv-kkudrynski 152d0c0344
Merge pull request #684 from gpauloski/bert_pytorch_fix
[BERT/PyT] Fix dataloader typo
2020-09-11 18:40:26 +02:00
nv-kkudrynski 49e387c788
Merge pull request #633 from andabi/master
remove pretrained aligns and update readme accordingly.
2020-09-11 13:26:17 +02:00
nv-kkudrynski 1402e9403e
Update CUDA-Optimized/FastSpeech/README.md
Co-authored-by: alancucki <alancucki@users.noreply.github.com>
2020-09-11 13:23:41 +02:00
nv-kkudrynski cf54b787ae
fixed link 2020-09-10 15:48:54 +02:00
Greg Pauloski 7a4c42501c [BERT/PyT] Fix dataloader typo 2020-09-09 11:28:31 -05:00
kkudrynski 5d36b4fd2f Fixing hyperlinks 2020-09-08 16:10:32 +02:00
nv-kkudrynski 323005c443
Merge pull request #676 from NVIDIA/gh/release
[DLRM/PyT] Triton updates
2020-09-08 12:10:28 +02:00
kkudrynski 21fcdd6fcf [DLRM/PyT] Triton updates 2020-09-07 16:19:47 +02:00
Sharath T S 8588e9834c
[BERT/PyT] specify GPU for triton (#666) 2020-09-01 22:07:36 -07:00
Sharath T S 5cc03caa15
[BERT/PyT] Update pretrained checkpoint links (#660)
* Update pretrained checkpoint links

* update link
2020-08-21 13:34:05 -07:00
nv-kkudrynski 0e6cfbd0a1
Merge pull request #659 from hXl3s/RN50/readme-update
[ConvNets/TF] Document synthetic dataset options
2020-08-20 16:45:53 +02:00
Łukasz Pierścieniewski 8bd6dd14d3 Document synthetic dataset options 2020-08-20 16:21:50 +02:00
Sharath T S 446c878878
[ELECTRA/TF2] Update inference latency (#657)
* Update inference latency

* Fix inference perf numbers

* Fix latency computation
2020-08-19 20:43:44 -07:00
nv-kkudrynski bbbc823072
Merge pull request #655 from NVIDIA/gh/release
[DLRM/PyT] Readme fixes
2020-08-18 15:06:23 +02:00
kkudrynski 0d15a95c8f [DLRM/PyT] Readme fixes 2020-08-18 15:04:47 +02:00
nv-kkudrynski d875531dd6
Merge pull request #654 from NVIDIA/dlrm_update
[DLRM/PyT] Update
2020-08-17 13:22:23 +02:00
kkudrynski 3745b49898 [DLRM/PyT] Update 2020-08-17 11:14:46 +02:00
nv-kkudrynski ff7e38bf87
Merge pull request #650 from NVIDIA/bert_pyt_mrpc
[BERT/PyT] MRPC and SST-2 support
2020-08-14 16:51:27 +02:00
kkudrynski 88864b9291 [BERT/PyT] MRPC and SST-2 support 2020-08-14 16:43:11 +02:00
Swetha Mandava 7c0afee460
Merge pull request #648 from swethmandava/master
Bert TF download tfrecords with correct name
2020-08-13 12:55:58 -07:00
Swetha Mandava 9d4c9f3eb0 tfrecords with correct name 2020-08-13 12:52:46 -07:00
Pablo Ribalta Lorenzo fb40734b31
Remove autobench scripts (#647)
Signed-off-by: Pablo Ribalta <pribalta@nvidia.com>
2020-08-12 11:48:46 -07:00
nv-kkudrynski 41a0891313
Merge pull request #645 from NVIDIA/sharathts-patch-4
Keep wikiextractor version fixed
2020-08-11 18:49:52 +02:00
Swetha Mandava c8bbdb5798
Merge pull request #644 from swethmandava/master
Bert tf update (triton v2, fixes)
2020-08-11 08:29:39 -07:00
Swetha Mandava 1069a7358c converge to pyt 2020-08-10 13:15:39 -07:00
Sharath T S e8f87acdb1
Keep wikiextractor version fixed 2020-08-10 11:46:17 -07:00
Swetha Mandava efd6384176 pointing to wikiextractor commit 2020-08-10 11:27:47 -07:00
Swetha Mandava b82c372047 triton v2 api, download mrpc fix, update for mpi 4.2 2020-08-10 11:09:40 -07:00
Swetha Mandava 769843e51a
Merge pull request #11 from NVIDIA/master
Update Aug 10 2020
2020-08-10 11:04:43 -07:00
nv-kkudrynski 36ad5fe657
Update .gitmodules 2020-08-06 15:01:22 +02:00
BO-YANG HSUEH 1aa6813450
[FT] 1. Fix the bug of TensorRT plugin of FasterTransformer encoder. (#640)
* [FT] 1. Fix the bug of TensorRT plugin of FasterTransformer encoder.
2020-08-06 20:15:49 +08:00
nv-kkudrynski 280e75c63e
Merge pull request #636 from NVIDIA/gh/release
[VAE/TF] Updating for Ampere
2020-08-05 20:55:02 +02:00
kkudrynski 22f354e8ff [VAE/TF] Updating for Ampere 2020-08-05 20:52:38 +02:00
nv-kkudrynski 386dd8ebf6
Merge pull request #630 from NVIDIA/gh/release
Updating GNMT/PyT/TF for Ampere, fixes in WnD/TF and Jasper/PyT
2020-08-05 17:01:16 +02:00
kkudrynski 2356b898f5 [WideAndDeep/TF] scripts fix 2020-08-05 16:55:31 +02:00
kkudrynski 557f4d01ea [Jasper/PyT] Triton update 2020-08-05 16:44:50 +02:00