Commit graph

922 commits

Author SHA1 Message Date
nvpstr 8c35140712
Merge pull request #106 from NVIDIA/nvpstr/master3
Updating models and adding BERT/PyT
2019-07-16 21:27:09 +02:00
Przemek Strzelczyk a644350589 Updating models and adding BERT/PyT
Tacotron2+Waveglow/PyT
* AMP support
* Data preprocessing for Tacotron 2 training
* Fixed dropouts on LSTMCells

SSD/PyT
* script and notebook for inference
* AMP support
* README update
* updates to examples/*

BERT/PyT
* initial release

GNMT/PyT
* Default container updated to NGC PyTorch 19.05-py3
* Mixed precision training implemented using APEX AMP
* Added inference throughput and latency results on NVIDIA Tesla V100 16G
* Added option to run inference on user-provided raw input text from command line

NCF/PyT
* Updated performance tables.
* Default container changed to PyTorch 19.06-py3.
* Caching validation negatives between runs

Transformer/PyT
* new README
* jit support added

UNet Medical/TF
* inference example scripts added
* inference benchmark measuring latency added
* TRT/TF-TRT support added
* README updated

GNMT/TF
* Performance improvements

Small updates (mostly README) for other models.
2019-07-16 21:13:08 +02:00
Przemek Strzelczyk 3b3d0f6a55 Merge branch 'gh/master' into nvpstr/master3 2019-07-16 19:56:26 +02:00
Michael O'Connor 2cfd880a6b
Merge pull request #103 from lxp121/master
Adding FasterTransformer
2019-07-13 09:07:49 -10:00
Xipeng Li 75502be814 Adding FasterTransformer: A faster transformer layer inference implementation for BERT and other transformer based models. 2019-07-14 00:29:45 +08:00
Przemek Strzelczyk 0663b67c1a Updating models 2019-07-08 22:51:28 +02:00
nvpstr f89dcca19d
Merge pull request #89 from GrzegorzKarchNV/readme-epochs
changed number of epochs in train scripts; removed number of epochs f…
2019-07-01 11:44:01 +02:00
nvpstr eabbedd1d4
Merge pull request #91 from GrzegorzKarchNV/readme-fix
fixed batches for mixed precision and fp32, added info when no input …
2019-07-01 11:43:29 +02:00
yaoyiran dae8711565
Fix the Validation in Transformer
In train.py, line 391: predictions = [hypo[1] + ('\n' if hypo[-1]!='\n' else '') for hypo in predictions], I think it should be predictions = [hypo[1] + ('\n' if hypo[1][-1]!='\n' else '') for hypo in predictions] because hypo[-1] is a whole sentence which will never be "\n" at all.
2019-06-28 19:07:48 +08:00
Grzegorz Karch d7c20e5bfc fixed batches for mixed precision and fp32, added info when no input is provided for inference, added units to expected training time 2019-06-27 01:48:46 -07:00
Grzegorz Karch 7eb5eece14 changed number of epochs in train scripts; removed number of epochs from readme for the audio samples 2019-06-26 08:57:23 -07:00
nvpstr 40e074257f
Merge pull request #65 from JanuszL/fix_RN50_pytorch_DALI
Fix typo in RN50 PyTorch readme, remove unneeded lines for DALI
2019-06-13 17:59:16 +02:00
nvpstr 34a01f7b72
Merge pull request #77 from GrzegorzKarchNV/warmup
updated readme: text.txt->phrase.txt, updated inference results
2019-06-13 17:35:33 +02:00
nvpstr 9df04c29c9
Merge pull request #79 from nv-kkudrynski/ssd_torchhub_exposure
ssd exposure via hubconf.py (together with ssd utils)
2019-06-13 17:35:16 +02:00
Krzysztof Kudrynski a8328ce169 ssd exposure via hubconf.py (together with ssd utils) 2019-06-12 17:41:57 +02:00
Grzegorz Karch cd284a62ec changed text to phrase as input to inference in the readme 2019-06-07 17:37:30 +02:00
Grzegorz Karch dc7cd1ae5b updated readme 2019-06-07 17:37:30 +02:00
gkarch 79b9a48bf1 added warmup parameter to tacotron2 inference; updated README 2019-06-07 17:37:30 +02:00
nvpstr 4aa6f17167
Merge pull request #76 from GrzegorzKarchNV/torchhub
added text_to_sequence in the hubconf model
2019-06-07 15:11:28 +02:00
gkarch 362dfe6b3b added text_to_sequnce in the hubconf model 2019-06-07 14:52:23 +02:00
nvpstr da165c3a52
Merge pull request #73 from GrzegorzKarchNV/master
updated readme - warmup for inference, updated results for 19.05-py3 container
2019-06-06 16:03:19 +02:00
nvpstr bf1b29d73e
Merge pull request #63 from khcs/bert/pretrain-fix
Fix FLAG.save_checkpoint_steps naming inconsistency.
2019-06-06 16:02:28 +02:00
nvpstr 698e775680
Merge pull request #72 from jbaczek/master
Add missing submodule for PyT Transformer
2019-06-06 16:01:51 +02:00
nvpstr c5bbd74f87
Merge pull request #62 from NVIDIA/nvpstr-patch-1
Update README.md
2019-06-06 15:58:28 +02:00
gkarch 2f34442146 update perf numbers 2019-06-06 15:27:21 +02:00
gkarch 4dd9ecfcfa updated readme - warmup for inference 2019-06-06 14:16:51 +02:00
jbaczek 2774e849be remove empty dir 2019-06-05 16:20:30 +02:00
jbaczek 743162906c fix submodule path 2019-06-05 16:17:24 +02:00
jbaczek 698516f8f7 add missing submodule 2019-06-05 16:14:41 +02:00
nvpstr 2619f172c7
Merge pull request #67 from GrzegorzKarchNV/master
Update models.py - fix fp16 inference
2019-06-05 14:45:27 +02:00
Vinh Nguyen 7d3d1a028b
Include jupyter in this docker build 2019-06-05 18:38:36 +10:00
Vinh Nguyen 1cac44e014 add this repo to docker build 2019-06-05 16:32:58 +10:00
GrzegorzKarchNV 867ab68fad
Update models.py 2019-06-04 15:44:05 +02:00
Janusz Lisiecki 20f09772f8 Fix typo in RN50 PyTorch readme, remove unneeded lines for DALI
Signed-off-by: Janusz Lisiecki <jlisiecki@nvidia.com>
2019-06-03 12:03:18 +02:00
Hoo Chang Shin 37c4083c4b change to BERT-base, and checkpoints-dir. 2019-05-31 10:31:26 -04:00
Hoo Chang Shin 1d8ac4b295 Merge branch 'bert/pretrain-fix' 2019-05-30 12:58:03 -04:00
Hoo Chang Shin 65bf7dbe48 initial commit on training PubMed-gvocab-base. 2019-05-30 12:55:57 -04:00
Hoo Chang Shin f3c1cc7204 Fix FLAG.save_checkpoint_steps naming inconsistency. 2019-05-30 11:37:46 -04:00
nvpstr e5a0a056d2
Update README.md 2019-05-30 13:42:31 +02:00
nvpstr b7eff3da41
Merge pull request #60 from swethmandava/smandava/bug_fix
num_gpus pretraining fix
2019-05-30 13:25:58 +02:00
Swetha Mandava b25181c9b8 num_gpus pretraining fix 2019-05-28 09:52:52 -07:00
nvpstr 3454305d61
Merge pull request #58 from NVIDIA/nvpstr/master
Updating PyTorch models
2019-05-28 11:19:41 +02:00
Przemek Strzelczyk 5eaebefcd0 Updating PyTorch models
* New training scripts & results for RN50 PyT
* Documentation updates for Trasformer, SSD and MaskRCNN
2019-05-27 15:24:14 +02:00
nvpstr c71a3fe05b
Merge pull request #52 from sharathts/master
Fix README
2019-05-25 01:43:24 +02:00
nvpstr 3306ea2ab9
Merge branch 'master' into master 2019-05-25 01:43:11 +02:00
nvpstr 12c7bc2302
Merge pull request #45 from eric-haibin-lin/patch-1
Fix a few typos, and add gluonnlp implementation link
2019-05-25 01:37:31 +02:00
nvpstr 9712cf8281
Merge pull request #54 from vinhngx/master
adding resume capability for wget in case data downloads stop early
2019-05-25 01:37:07 +02:00
nvpstr 94472422fa
Merge pull request #55 from NVIDIA/nvpstr/master
Adding UNet(medical)/TF & updating other TF models
2019-05-25 01:36:46 +02:00
Przemek Strzelczyk d2bc3da0a1 Changes in TF models:
* added UNet for medical image segmentation
* added TF-AMP support for RN50
* small updates for other models (READMEs, benchmark & testing scripts)
2019-05-25 01:23:11 +02:00
Vinh Nguyen 8662db35a7 fix readme: direct link for Apex. Fix download script location 2019-05-24 14:47:43 +10:00