Commit graph

387 commits

Author SHA1 Message Date
Greg Pauloski 7a4c42501c [BERT/PyT] Fix dataloader typo 2020-09-09 11:28:31 -05:00
kkudrynski 5d36b4fd2f Fixing hyperlinks 2020-09-08 16:10:32 +02:00
kkudrynski 21fcdd6fcf [DLRM/PyT] Triton updates 2020-09-07 16:19:47 +02:00
Sharath T S 8588e9834c
[BERT/PyT] specify GPU for triton (#666) 2020-09-01 22:07:36 -07:00
Sharath T S 5cc03caa15
[BERT/PyT] Update pretrained checkpoint links (#660)
* Update pretrained checkpoint links

* update link
2020-08-21 13:34:05 -07:00
kkudrynski 0d15a95c8f [DLRM/PyT] Readme fixes 2020-08-18 15:04:47 +02:00
kkudrynski 3745b49898 [DLRM/PyT] Update 2020-08-17 11:14:46 +02:00
kkudrynski 88864b9291 [BERT/PyT] MRPC and SST-2 support 2020-08-14 16:43:11 +02:00
Sharath T S e8f87acdb1
Keep wikiextractor version fixed 2020-08-10 11:46:17 -07:00
kkudrynski 557f4d01ea [Jasper/PyT] Triton update 2020-08-05 16:44:50 +02:00
kkudrynski 9fa75813e6 Merge branch 'gh/master' into gh/release 2020-08-05 16:38:18 +02:00
Sharath T S 1f9226283e
[BERT/PyT] link gitlab -> DLE (#634) 2020-08-03 22:02:55 -07:00
kkudrynski 5d2914e3ff [GNMT/PyT] Updating for Ampere] 2020-08-01 15:47:34 +02:00
Sharath T S 0ef5568a9d
[BERT/PyT] default train:test 9:1 (#616) 2020-07-22 14:07:50 -07:00
Przemek Strzelczyk 180382499f [Transformer/PyT] Removing unnecessary files 2020-07-16 15:29:24 +02:00
Sharath T S 3337f72cff
[BERT/PyT] Update DataPrep (#595)
* Update DataPrep

* Update create_datasets_from_start.sh

* Update README.md

* Update README.md

* Update README.md
2020-07-08 17:19:45 -07:00
Krzysztof Kudrynski aa36bc0fba [Jasper/Pyt] Fix images in readme 2020-07-08 19:06:41 +02:00
GrzegorzKarchNV 25fe4d3856
Merge pull request #592 from tonmoay/tonmoay-tt2patch
[Tacotron2] fixed get_model in inference.py
2020-07-08 13:11:49 +02:00
Krzysztof Kudrynski 33bdf65b18 readme fixes 2020-07-08 12:55:56 +02:00
Tonmoay Deb c480fbfcf6 [Tacotron2] fixed get_model in inference.py 2020-07-08 11:14:53 +06:00
Krzysztof Kudrynski 31ca062d93 [SSD/PyT] Updating for Ampere 2020-07-07 23:41:00 +02:00
nv-kkudrynski b2b2d1cb1d
Merge pull request #590 from NVIDIA/jasper-ampere
[Jasper/PyT] Updating for Ampere
2020-07-07 23:20:27 +02:00
Krzysztof Kudrynski ae7fce1e34 [Jasper/PyT] Updating for Ampere 2020-07-07 22:58:04 +02:00
Sharath T S b2763ae273
[BERT/PyT] Update ampere perf params (#589) 2020-07-07 12:43:03 -07:00
Sharath T S d9050d6da0
[MaskRCNN/PyT] Fix indentation (#588) 2020-07-07 10:38:27 -07:00
Sharath T S 18db1c17f5
Update README.md 2020-07-07 02:23:36 -07:00
Przemek Strzelczyk f8b3a63f81 [BERT/PyT] Updating for Ampere and 20.06 container 2020-07-04 03:12:11 +02:00
Przemek Strzelczyk 2860d6fe04 [Transformer/PyT] Updating for 20.06 and Ampere 2020-07-04 02:28:25 +02:00
Przemek Strzelczyk 77dad060a2 [FastPitch] Updating for Ampere 2020-07-04 02:24:45 +02:00
Przemek Strzelczyk 36f3b1b670 [DLRM/PyT] Updates for Ampere 2020-07-04 01:27:39 +02:00
Przemek Strzelczyk f0c8bc571a [Tacotron2/PyT} Updating for Ampere 2020-07-04 01:15:57 +02:00
Przemek Strzelczyk 96138d5087 [BERT/TF] Updating for Ampere 2020-07-04 01:00:48 +02:00
PrzemekS 24b8c9c7fd
Merge pull request #586 from vinhngx/vinhn-jasper-file-location-fix
Jasper colab notebook: fix file location
2020-07-03 10:03:45 +02:00
Vinh Nguyen ac4c539dc4 fix file location 2020-07-02 18:33:26 -07:00
GrzegorzKarchNV 4f02f7af81
Update Dockerfile 2020-07-01 09:51:51 +02:00
Przemek Strzelczyk 17bc6aac81 [NCF/PyT] Ampere support added 2020-06-27 12:00:21 +02:00
Przemek Strzelczyk fa59dd505a [MaskRCNN/TF&PyT] Adding Ampere support 2020-06-27 11:52:08 +02:00
Przemek Strzelczyk f838cf3292 [Transformer-XL/PyT] Added Ampere support 2020-06-27 09:57:21 +02:00
Przemek Strzelczyk 46ff3707e0 [ConvNets/PyT] Adding support for Ampere and 20.06 container 2020-06-27 09:32:20 +02:00
Przemek Strzelczyk 4eaa4434de [ConvNets/TF] Adding support for Ampere 2020-06-27 09:24:41 +02:00
PrzemekS 5ca706264e
Merge pull request #523 from yzhang123/yzhang123-patch-6
fix jasper column name mixup again
2020-06-25 12:39:22 +02:00
PrzemekS 3b5fd6800e
Merge pull request #540 from vinhngx/vinhn-jasper-colab-fix
Jasper Colab TRT notebook fix
2020-06-25 12:38:49 +02:00
PrzemekS 5edd998b4b
Merge pull request #564 from narendasan/patch-1
Typo in README instructions
2020-06-25 12:38:15 +02:00
Elton Chen-Yu Ho accf26d1d8
Fix a typo in PyTorch ConvNets README.md 2020-06-22 14:32:37 +08:00
Naren Dasan 36e8b3e751
Typo in README instructions
the flag is `-it`
2020-06-18 20:38:10 -06:00
Krzysztof Kudrynski f11884b38a minor readme updates 2020-06-12 13:50:44 +02:00
Przemek Strzelczyk 23cc1cd5bb [FastPitch/PyT] Small README fixes 2020-06-12 12:13:05 +02:00
Przemek Strzelczyk 66ed01d1ac [FastPitch/PyT] README updates 2020-06-08 18:53:02 +02:00
Vinh Nguyen 709456cdd7 fix readme for colab notebook 2020-05-30 11:51:37 +10:00
gkarch fb6d73d8f5 updated README for Triton 2020-05-29 13:50:04 +02:00
Tomasz Grel 748b0d47aa
Merge pull request #517 from t-kusanagi/dlrm-device-bug
Fix [PyT/DLRM] bug of model.py
2020-05-29 13:09:49 +02:00
gkarch 5d792121ba remover tcmalloc from Dockerfile 2020-05-28 14:14:39 +02:00
t-kusanagi 7fe3d5be4f Fix [PyT/DLRM] bug of model.py
Pass device=base_device argument to self._interaction_padding.
2020-05-28 11:51:10 +00:00
gkarch 2480060f2a added tcmalloc in Dockerfile 2020-05-27 15:26:19 +02:00
PrzemekS 7a2f7d4a55
Merge pull request #516 from NVIDIA/nvptr/6d7283
Adding FastPitch/PyT (modified version of FastSpeech)
2020-05-26 18:55:40 +02:00
Ilya Shutov 680ccfcf89 Tacotron2 --cpu-run fix 2020-05-21 21:05:24 +07:00
Yang Zhang cbde541082
fix trt benchmark column names
2nd time to fix trt benchmark column names, since it was overwritten by another commit
1st time washttps://github.com/NVIDIA/DeepLearningExamples/pull/267
2020-05-20 10:11:41 -07:00
gkarch 3bddd5df99 updates MR78 2020-05-19 15:06:13 +02:00
t-kusanagi a685b398bc Fix [PyT/DLRM] bug of model.py 2020-05-19 00:15:12 +00:00
Przemek Strzelczyk 6d72839a6c Adding FastPitch/PyT (modified version of FastSpeech) 2020-05-18 18:49:00 +02:00
gkarch 1cfa0ecffb small fixes 2020-05-18 12:08:59 +02:00
gkarch 53e7e4f130 updating trtis_cpp 2020-05-18 11:04:16 +02:00
Sharath T S 9df464f277
[BERT/PyT] stop and resume, single gpu and timing fixes. (#509)
* stop and resume, single gpu and timing fixes.

* Update utils.py

* accumulation features check
2020-05-17 12:46:53 -07:00
Sharath T S 3aae0204c3
1. stop and resume 2020-05-16 22:47:53 -07:00
gkarch bf2b7a0767 fixed trtis dockerfile 2020-05-14 22:24:54 +02:00
gkarch 8d8337196f fixed trtis 2020-05-14 19:58:45 +02:00
gkarch 86dc81241c [Tacotron2] fixed load_and_setup_model in ONNX exports 2020-05-13 11:24:39 +02:00
gkarch 1272f6fafa [Tacotron2] fixed get_model in train.py 2020-05-13 11:08:15 +02:00
PrzemekS af32ac2a23
Merge pull request #456 from NVIDIA/sharathts-patch-1
Fix to load Google's checkpoint
2020-05-07 11:09:20 +02:00
maggiezha 387f700c4c
add Intel Optimization for PyTorch
Intel's optimization for PyTorch on CPU are added, you need to set "export OMP_NUM_THREADS=num physical cores" based on your CPU's core number
2020-05-07 15:57:07 +10:00
maggiezha 150f877e19
adding CPU optimization
export OMP_NUM_THREADS=num physical cores
export KMP_BLOCKTIME=0
export KMP_AFFINITY=granularity=fine,compact,1,0
https://software.intel.com/content/www/us/en/develop/articles/maximize-tensorflow-performance-on-cpu-considerations-and-recommendations-for-inference.html
2020-05-07 15:51:45 +10:00
GrzegorzKarchNV 67a7d9c4eb
Merge pull request #482 from maggiezha/cpu-run
Cpu run
2020-05-06 21:27:26 +02:00
maggiezha acf833f7e4
adding support for --cpu-run 2020-05-06 21:57:05 +10:00
maggiezha 3a6b667118
adding support for --cpu-run 2020-05-06 21:52:49 +10:00
maggiezha 342c4710fc
adding support for --cpu-run 2020-05-06 21:48:43 +10:00
maggiezha 0e986cc1f0
adding support for --cpu-run 2020-05-06 21:46:45 +10:00
maggiezha ca43a1b1e5
adding support for --cpu-run 2020-05-06 21:42:43 +10:00
gkarch d0d4df70a1 constant batch in TensorRT engine build 2020-05-06 13:01:08 +02:00
maggiezha a4ce69a5a7
add support for --cpu-run 2020-05-06 20:58:45 +10:00
maggiezha 9268095bb1
add support for --cpu-run 2020-05-06 20:57:10 +10:00
maggiezha 3a11f10bfe
add support for --cpu-run 2020-05-06 20:50:59 +10:00
gkarch b0ab215441 fixed trt inference 2020-04-27 16:05:33 +02:00
Sharath T S 4733603577
[BERT/PyT] Fix squad inference corner case (#462) 2020-04-20 21:02:18 -07:00
gkarch 063de87218 fixing trt tests 2020-04-20 17:01:35 +02:00
Sharath T S a9c997cd57
Fix to load Google's checkpoint 2020-04-13 11:34:56 -07:00
PrzemekS 1cad180164
Merge pull request #453 from NVIDIA/nvpstr/87ec80
Nvpstr/87ec80
2020-04-09 07:11:48 +02:00
Przemek Strzelczyk 15807b36bf Adding DLRM/PyT 2020-04-08 18:17:57 +02:00
Sharath T S 5626846924
[BERT/PyT] Revert from native gelu. Breaks ONNX export. (#447) 2020-04-07 10:28:45 -07:00
gkarch 1f7950aa9d fixed alerts from https://lgtm.com/projects/g/NVIDIA/DeepLearningExamples/rev/pr-8d221f6760830499933042c03fcd83605adbd98e 2020-04-03 18:28:20 +02:00
GrzegorzKarchNV 00ca1c4bcf
Update README.md 2020-04-03 13:44:16 +02:00
Przemek Strzelczyk 5e3b487b89 [Tacotron2/PyT] custom TensorRT backend on TensorRT Inference Server; Conversional AI demo; fixed checkpoints loading; fixed FP16 export to TensorRT 2020-04-02 17:18:26 +02:00
Przemek Strzelczyk 26c2676104 [BERT/PyT] Triton Inference Server support 2020-04-02 14:39:24 +02:00
Sharath T S 793b92dca7
[BERT/PyT] fp32 and allreduce_post_accumulation compatibility (#422) 2020-03-15 23:03:06 -07:00
PrzemekS b03375bd6c
Update README.md 2020-03-12 19:09:18 +01:00
PrzemekS adeed4fcbc
Update README.md 2020-03-12 19:06:10 +01:00
PrzemekS 1c77a04548
Merge pull request #403 from yzhang123/trt_dynamic_shape_update
Jasper TRT Fix
2020-03-09 17:28:31 +01:00
Przemek Strzelczyk 96ff411ce8 [BERT/PyT] Typo in README 2020-03-05 09:54:16 +01:00
Rajeev Rao 4f42950e36 Initialize CUDA state before loading TRT engines in Tacotron sample 2020-03-03 09:56:54 -08:00
Przemek Strzelczyk 77a1bb917a [Tacotron2/PyT] Updates: better perf, better trt7 support, new logging, bug fixes 2020-02-28 15:36:14 +01:00
Przemek Strzelczyk 155578a762 [BERT/PyT] New logging and some README updates 2020-02-28 13:21:20 +01:00
Yang Zhang c72df2be4a fix syntax error 2020-02-14 15:49:54 -08:00
Yang Zhang b738d1d1af fix trt regression bug
Signed-off-by: Yang Zhang <yangzhang@nvidia.com>
2020-02-14 15:44:26 -08:00
PrzemekS ce73b32068
Merge pull request #392 from NVIDIA/nvpstr/1def26
Updating BERT/TF, Transformer-XL and NCF/PyT
2020-02-06 19:47:30 +01:00
Przemek Strzelczyk a38deff61e [Transformer-XL/PyT] Large model support; multi-node training; inference with TorchScript 2020-02-05 22:38:46 +01:00
Przemek Strzelczyk 1def26d80c [NCF/PyT] Adding new logging 2020-02-05 21:58:04 +01:00
Sharath T S ad88003e13
[BERT/PyT] Glue(MRPC) fine-tuning with LAMB pretrained checkpoint
* LAMB checkpoint compatibility
* LAMB checkpoint compatibility; amp training
2020-02-03 16:27:11 -08:00
Sharath T S 119838f1f6
Bugfix in BertAdam for fp32 finetuning (#388) 2020-01-30 20:11:03 -08:00
PrzemekS aa061052c6
Merge pull request #345 from nvcforster/master
Updating BERT Readme Docs
2020-01-02 14:35:59 +01:00
Krzysztof Kudrynski f0ef8493eb ConvNets update 2019-12-20 14:54:58 +01:00
gkarch 480f1f9811 update perf table for taco2 trt 2019-12-18 11:45:17 +01:00
Rajeev Rao 42555b734b Minor README cleanup for TRT Tacotron2 example
Signed-off-by: Rajeev Rao <rajeevrao@nvidia.com>
2019-12-17 14:33:59 -08:00
gkarch e5bd631f77 fixed trt table 2019-12-17 16:54:41 +01:00
gkarch 684678086a fixed speedup factor 2019-12-17 16:51:22 +01:00
gkarch 82518bad2d fixed speedup in trt readme 2019-12-17 16:49:44 +01:00
gkarch 5dbb6c5b91 updated perf scripts 2019-12-17 16:48:08 +01:00
gkarch 3634315318 fixed tabs 2019-12-16 18:31:29 +01:00
gkarch 2992264d3a updated Tacotron2: included trt, new dllogger 2019-12-16 18:29:08 +01:00
nv-kkudrynski e0b365ce6e
Merge pull request #347 from NVIDIA/nvpstr/63cf4f
Nvpstr/63cf4f
2019-12-16 13:24:07 +01:00
Hyungon_Ryu fb2709e002 [Transformer-XL/PyT] Update train.py (#348)
bug fix for inv_sqrt scheduler
2019-12-16 10:06:42 +01:00
Przemek Strzelczyk 5562ab767a Adding SE-ResNext and ResNext / PyT 2019-12-15 05:13:59 +01:00
Chris Forster 67b7543feb
Update README.md
Adding link to our Medium.com article that provides details about implementing the LAMB optimizer.
2019-12-13 12:51:41 -08:00
skierat 2fb7c8f472
Merge pull request #149 from HanbumKo/HanbumKo-patch-1
Update inference.py
2019-12-09 11:32:36 +01:00
kkudrynski c9c3eaacf1 minor fixes: submodules, jasper readme 2019-12-06 20:38:09 +01:00
Przemek Strzelczyk e89dfbf536 [Jasper/PyT] Fixing notebooks/README.md 2019-12-06 14:52:20 +01:00
Przemek Strzelczyk 09622fa363 [Jasper/PyT] Added: inference support for TRT6 and TRT-IS with various backends; new Jupyter notebooks 2019-12-05 20:40:27 +01:00
PrzemekS dc63c016cf
Merge pull request #312 from sharathts/patch-6
Fix case with one training shard only
2019-12-04 15:30:13 +01:00
Przemek Strzelczyk ca28f55476 [Transformer-XL/PyT] renaming folders 2019-11-28 09:48:59 +01:00
Przemek Strzelczyk 547bd323a0 [Jasper/PyT] Adding Colab TRT notebook 2019-11-27 17:07:31 +01:00
Przemek Strzelczyk 3d46067af9 Adding TransformerXL/PyT 2019-11-27 17:00:18 +01:00
Sharath T S 657874ae09
Fix case with one training shard only 2019-11-20 10:52:32 -08:00
nvpstr 7d772b8bc9
Merge pull request #299 from cschaefer26/tacotron2-fix-val-lo
Fixed wrong val loss being logged
2019-11-19 10:16:21 +01:00
nvpstr 3d4cc84640
Merge pull request #297 from GrzegorzKarchNV/tacotron2-readme-update
updated tacotron2 trtis readme
2019-11-19 10:08:52 +01:00
nvpstr fdf7124915
Merge pull request #71 from vinhngx/patch-1
Include jupyter in this docker build
2019-11-19 10:04:16 +01:00
nvpstr ad4267cb70
Merge pull request #69 from vinhngx/jupyter
add this repo to docker build
2019-11-19 10:03:46 +01:00
Przemek Strzelczyk a70896405d Updating BERT/PyT
* Use LAMB from APEX
* Code cleanup
* Bug fix in BertAdam optimizer
2019-11-18 23:07:24 +01:00
Christian Schäfer f0eede6bf8
Update train.py
fixed logged val loss
2019-11-15 09:46:46 +01:00
gkarch 09706537d2 updated tacotron2 trtis readme 2019-11-14 15:24:14 +01:00
GrzegorzKarchNV 4e00153ab5 added TRTIS demo to Tacotron2 (#281)
* added TRTIS demo

* Update README.md

* Update README.md

* Update README.md

* Update README.md

* Update README.md

* Update README.md

* Update README.md

* Update README.md
2019-11-06 19:46:35 +01:00
Sharath T S 78e97e324e
Fix incorrect perf numbers 2019-10-25 15:52:44 -07:00
Yang Zhang 81df05da1b update Jasper TRT Readme numbers (#267)
fix Readme table column name mix up
2019-10-23 12:38:43 +02:00
nvpstr 970a54b296
Merge pull request #261 from sharathts/patch-3
fix single gpu support
2019-10-21 19:59:14 +02:00
Przemek Strzelczyk b09ce9831b Updating GNMT/PyT 2019-10-21 19:41:32 +02:00
Przemek Strzelczyk e470c2150a Updating RN50/MxNet 2019-10-21 19:20:40 +02:00
Sharath T S f24491a940
fix single gpu support 2019-10-18 17:28:02 -07:00
Sharath T S 7121e21d11
fix logging total steps 2019-10-16 15:12:00 -07:00
Sharath T S 8cc635f638
Fix training perf calculation 2019-10-15 17:39:39 -07:00
Przemek Strzelczyk 81519d47a2 NCF/PyT - adjusting to API changes in PyTorch and APEX 2019-10-04 15:05:13 +02:00
Przemek Strzelczyk d29920a8b0 MaskRCNN/PyT update
* Jypyter notebooks addded
* Updates for PyTorch 1.2
2019-10-04 14:34:26 +02:00
Przemek Strzelczyk 68c6d2321f Small updates to README.md (Jasper,notebooks) 2019-10-04 14:11:10 +02:00
maggiezha 075bca5742 update readme of notebook (#229)
* updating notebook
* updating wav files
* updating examples
2019-10-01 19:41:26 +02:00
Sharath T S 7fffeade4e
Update Dockerfile 2019-09-25 11:21:15 -07:00
Sharath T S 746cfe21dc
Update Dockerfile 2019-09-25 11:17:40 -07:00
Przemek Strzelczyk 10e805921a [Jasper/PyT] Updated notebooks 2019-09-23 16:08:53 +02:00
nvpstr 2fec868452
Merge pull request #221 from maggiezha/patch-2
update readme
2019-09-20 18:06:03 +02:00
nvpstr 85aa84467f
Merge pull request #220 from maggiezha/patch-1
Update README.md
2019-09-20 18:05:30 +02:00
yzhang123 5e126c7154
Update JasperTRT.ipynb 2019-09-19 11:02:01 -07:00
yzhang123 4db4d43829
Update JasperTRT.ipynb
updated default mount directory paths
2019-09-19 09:50:58 -07:00
maggiezha b13bcc9405
update readme
running notebook directly from /notebooks/ doesn't work, the later container mapping went messy and gave me error like: No such file or directory, need to copy the notebook to root directory to work
2019-09-19 15:39:55 +10:00
maggiezha c13e5f6704
Update README.md
move requirements from notebook to readme
2019-09-19 13:36:36 +10:00
maggiezha 6cbd0fa2a4
Update README.md 2019-09-19 13:31:17 +10:00
Przemek Strzelczyk f7b0a9c583 [Jasper/PyT] Small README and notebook fixes 2019-09-18 23:38:40 +02:00
Przemek Strzelczyk 2de99b5fa7 [Jasper/PyT] Adding TRT support + jupyter notebooks for inference 2019-09-18 22:05:24 +02:00
Szymon Migacz 3014f38a3f
[GNMT PyT] Fix for fp16 training w/o label smoothing (#210) 2019-09-16 10:09:28 +02:00
Przemek Strzelczyk 8b249efad6 Minor fixes to BERT/PyT 2019-09-13 15:23:39 +02:00
Przemek Strzelczyk 4cce4d88e6 Updating SSD/PyT 2019-09-12 14:33:49 +02:00
Przemek Strzelczyk 6fe463fe27 [BERT/PyT] Support for multi-node 2019-09-10 17:21:52 +02:00
Przemek Strzelczyk 02b49acead [Tacotron2] Added denoiser and inference stats, fixed typos 2019-09-10 16:22:53 +02:00
gkarch b8027d8914 added jupyter notebook 2019-09-03 18:40:29 +02:00
Chris Forster 71e2b22d4a Update bertPrep.py (#183) 2019-08-29 21:49:02 +02:00
Chris Forster e72ea6947b BERT-PyT subprocess for bzip in wikidownloader (#180)
* Removing unnecessary subprocess.communicate calls

* Updating Bookscorpus downloader to require less memory

* Renaming variable
2019-08-29 07:21:53 +02:00
Chris Forster 3d3ff3e168 Cleanup and Readme Update (#174)
* update perf tables

* remove ide files

* fix tokenizer

* copyrights

* remove .communicate()

* refine training scripts

* fix more typos
2019-08-27 21:44:21 +02:00
Sharath T S 3d59216cec [BERT] [PyTorch] Data prep fix (#171)
* add dgx1-16g and dgx2 specific pretraining instructions

* fix typo in readme

* fix data prep and reflect changes in pretraining

* remove .ide files

* remove data files

* Point to right SQUAD location

* remove garbage [[]]

* default accumulation in fp32

* remove ide files

* fix phase2 DATADIR path

* remove readme in data folder
2019-08-22 07:52:18 +02:00
Sharath T S b6fb9aa463 [BERT][PyTorch]: add dgx1-16g and dgx2 specific pretraining instructions (#164)
* add dgx1-16g and dgx2 specific pretraining instructions

* fix typo in readme
2019-08-21 09:49:32 +02:00
yzhang123 f84446675e
novograd default parameter fix 2019-08-16 14:07:05 -07:00
yzhang123 5419463c91
fix novograd 2019-08-16 14:05:18 -07:00
nv-kkudrynski 9f7616dc54
minor readme fix 2019-08-14 13:30:37 +02:00
Cliff Woolley b7bf42d76c
Update README.md
Fix typo
2019-08-13 16:12:52 -07:00
Cliff Woolley 608663f6ec Don't omit the data/ scripts from docker build 2019-08-13 15:41:48 -07:00
Cliff Woolley 8546c7a6df Cleanups 2019-08-13 15:33:32 -07:00
Cliff Woolley 7afcd73af1 Cleanup unneeded files 2019-08-13 15:32:00 -07:00
Krzysztof Kudrynski bae6e931bd updating BERT (single node LAMB support) 2019-08-13 23:27:54 +02:00
Krzysztof Kudrynski ab85e6cc3d Updating Tacotron2_pyt (BatchNorm init fix), Resnet_tf (cosine LR),
Transformer_pyt (bugfix)
2019-08-13 15:01:10 +02:00
HanbumKo 8d365d4b1f
Update inference.py
(mean, std) isn't used in load_image but normalize.
2019-08-06 14:30:04 -07:00
sharatht 803963408a remove directory check in data download 2019-08-02 22:28:02 -07:00
nvpstr d9f925cb9c
Merge pull request #130 from NVIDIA/nvpstr/release
Adding Jasper/PyT
2019-07-30 16:18:27 +02:00
gkarch b56f72bcd9 fixed audio 2019-07-30 15:01:15 +02:00
Przemek Strzelczyk fa400a7367 Adding Jasper/PyT 2019-07-26 20:08:16 +02:00
nvpstr 6c0b5e36b4
Merge pull request #125 from NVIDIA/nvpstr/release
Updating BERT with TRT-IS support and new results
2019-07-25 20:47:32 +02:00
nvpstr d3813fde80
Merge pull request #122 from GrzegorzKarchNV/update19.07
updated Tacotron2 version
2019-07-25 18:46:33 +02:00
Przemek Strzelczyk 8218872051 Updating BERT with TRT-IS support and new results 2019-07-25 16:53:05 +02:00
yzhang123 0af34d778c
fix launch.sh 2019-07-24 12:24:43 -07:00
yzhang123 2eb764b43c
fix build.sh 2019-07-24 12:23:34 -07:00
Grzegorz Karch d0c6294695 removed cudnn-benchmark from tacotron2 perf command lines in the readme 2019-07-24 01:08:20 -07:00
Grzegorz Karch 7e5013fff9 fixed script names in the readme 2019-07-24 00:30:07 -07:00
Grzegorz Karch bb7a4ac630 updated container version in the readme, changed order of benchmarking scripts 2019-07-23 15:50:02 -07:00
Grzegorz Karch 6c42c20948 updated readme, number of epochs in training scripts 2019-07-23 15:43:24 -07:00
Grzegorz Karch 87accc3073 updated performance benchmark command lines in the readme 2019-07-23 15:26:54 -07:00
Grzegorz Karch 25b3f2678a fixed inference results 2019-07-23 14:29:58 -07:00
Grzegorz Karch 979e291848 updated to 19.07 version 2019-07-23 12:45:37 -07:00
Przemek Strzelczyk a644350589 Updating models and adding BERT/PyT
Tacotron2+Waveglow/PyT
* AMP support
* Data preprocessing for Tacotron 2 training
* Fixed dropouts on LSTMCells

SSD/PyT
* script and notebook for inference
* AMP support
* README update
* updates to examples/*

BERT/PyT
* initial release

GNMT/PyT
* Default container updated to NGC PyTorch 19.05-py3
* Mixed precision training implemented using APEX AMP
* Added inference throughput and latency results on NVIDIA Tesla V100 16G
* Added option to run inference on user-provided raw input text from command line

NCF/PyT
* Updated performance tables.
* Default container changed to PyTorch 19.06-py3.
* Caching validation negatives between runs

Transformer/PyT
* new README
* jit support added

UNet Medical/TF
* inference example scripts added
* inference benchmark measuring latency added
* TRT/TF-TRT support added
* README updated

GNMT/TF
* Performance improvements

Small updates (mostly README) for other models.
2019-07-16 21:13:08 +02:00