Commit graph

355 commits

Author SHA1 Message Date
xjia
fd852b56a0 fix softmax max_value 2019-09-11 07:09:53 +00:00
Przemek Strzelczyk
6fe463fe27 [BERT/PyT] Support for multi-node 2019-09-10 17:21:52 +02:00
nvpstr
b07f501eff
Merge pull request #199 from NVIDIA/nvpstr/release19.08_1
[Tacotron2] Added denoiser and inference stats, fixed typos
2019-09-10 17:04:58 +02:00
Przemek Strzelczyk
02b49acead [Tacotron2] Added denoiser and inference stats, fixed typos 2019-09-10 16:22:53 +02:00
nvpstr
da8acb1288
Merge pull request #187 from GrzegorzKarchNV/notebook_19.08
added jupyter notebook to Tacotron2
2019-09-03 21:03:35 +02:00
gkarch
b8027d8914 added jupyter notebook 2019-09-03 18:40:29 +02:00
Chris Forster
71e2b22d4a Update bertPrep.py (#183) 2019-08-29 21:49:02 +02:00
Chris Forster
e72ea6947b BERT-PyT subprocess for bzip in wikidownloader (#180)
* Removing unnecessary subprocess.communicate calls

* Updating Bookscorpus downloader to require less memory

* Renaming variable
2019-08-29 07:21:53 +02:00
Chris Forster
3d3ff3e168 Cleanup and Readme Update (#174)
* update perf tables

* remove ide files

* fix tokenizer

* copyrights

* remove .communicate()

* refine training scripts

* fix more typos
2019-08-27 21:44:21 +02:00
IrishCoffee
4850598199
Merge pull request #176 from NVIDIA/xiaoying_nan
refine softmax
2019-08-26 17:10:49 +08:00
xjia
488cc11967 refine softmax 2019-08-26 09:08:59 +00:00
Sharath T S
3d59216cec [BERT] [PyTorch] Data prep fix (#171)
* add dgx1-16g and dgx2 specific pretraining instructions

* fix typo in readme

* fix data prep and reflect changes in pretraining

* remove .ide files

* remove data files

* Point to right SQUAD location

* remove garbage [[]]

* default accumulation in fp32

* remove ide files

* fix phase2 DATADIR path

* remove readme in data folder
2019-08-22 07:52:18 +02:00
Sharath T S
b6fb9aa463 [BERT][PyTorch]: add dgx1-16g and dgx2 specific pretraining instructions (#164)
* add dgx1-16g and dgx2 specific pretraining instructions

* fix typo in readme
2019-08-21 09:49:32 +02:00
Szymon Migacz
22f122183d
Merge pull request #165 from yzhang123/yzhang123-patch-3
novograd jasper hot fix
2019-08-21 08:26:08 +02:00
yzhang123
f84446675e
novograd default parameter fix 2019-08-16 14:07:05 -07:00
yzhang123
5419463c91
fix novograd 2019-08-16 14:05:18 -07:00
yzhang123
2308d9ff62
Merge pull request #2 from NVIDIA/master
pull from nvidia
2019-08-16 14:03:28 -07:00
nv-kkudrynski
e25c23e14a
Merge pull request #153 from tlkh/master
Update formatting in AMP demo notebook
2019-08-14 13:43:04 +02:00
nv-kkudrynski
9f7616dc54
minor readme fix 2019-08-14 13:30:37 +02:00
Cliff Woolley
b7bf42d76c
Update README.md
Fix typo
2019-08-13 16:12:52 -07:00
Cliff Woolley
1fbd997d9f
Merge pull request #158 from NVIDIA/jwoolley/bert-cleanup
Minor BERT PyTorch cleanups
2019-08-13 15:47:08 -07:00
Cliff Woolley
608663f6ec Don't omit the data/ scripts from docker build 2019-08-13 15:41:48 -07:00
Cliff Woolley
8546c7a6df Cleanups 2019-08-13 15:33:32 -07:00
Cliff Woolley
7afcd73af1 Cleanup unneeded files 2019-08-13 15:32:00 -07:00
nv-kkudrynski
1187980309
Merge pull request #157 from NVIDIA/nvpstr/release
updating BERT (single node LAMB support)
2019-08-13 23:33:10 +02:00
Krzysztof Kudrynski
bae6e931bd updating BERT (single node LAMB support) 2019-08-13 23:27:54 +02:00
nv-kkudrynski
7118f12b8a
Merge pull request #155 from NVIDIA/nvpstr/release
updating tacotron2_pyt, resnet_tf and transformer_pyt
2019-08-13 16:00:03 +02:00
Krzysztof Kudrynski
52138acee6 resnet_tf readme fix 2019-08-13 15:51:10 +02:00
Krzysztof Kudrynski
ab85e6cc3d Updating Tacotron2_pyt (BatchNorm init fix), Resnet_tf (cosine LR),
Transformer_pyt (bugfix)
2019-08-13 15:01:10 +02:00
yzhang123
c1c4012166
Merge pull request #1 from NVIDIA/master
update master
2019-08-12 17:22:28 -07:00
Timothy Liu
ee64eb2d9b Update formatting in AMP demo notebook 2019-08-10 12:17:14 +08:00
jconwayNV
55a03b8b4e
Added newline formatting 2019-08-09 17:13:20 -07:00
jconwayNV
0ed7d8a1d9
Added TF API to summary section
https://www.tensorflow.org/api_docs/python/tf/train/experimental/enable_mixed_precision_graph_rewrite
2019-08-09 17:12:20 -07:00
nvpstr
d51938b1e5
Merge pull request #145 from sharathts/master
remove directory check in data download
2019-08-09 23:09:50 +02:00
nvpstr
fb389e4431
Merge pull request #151 from tlkh/master
Added demo notebook for AMP (image classification)
2019-08-09 23:06:26 +02:00
Timothy Liu
564ddc8946 Added demo notebook for AMP (image classification) 2019-08-09 22:00:40 +08:00
HanbumKo
8d365d4b1f
Update inference.py
(mean, std) isn't used in load_image but normalize.
2019-08-06 14:30:04 -07:00
sharatht
803963408a remove directory check in data download 2019-08-02 22:28:02 -07:00
nvpstr
b728d42771
Merge pull request #136 from nv-kkudrynski/master
removing torchhub access through master
2019-07-31 17:13:34 +02:00
nvpstr
29005a4aba
Fixing #131 & correcting links in README
Smandava/readme edit
2019-07-31 17:13:13 +02:00
Swetha Mandava
281beab09d
consistant parameter names
consistant parameter names in run_pretraining.py and optimization.py
2019-07-30 17:24:19 -07:00
Swetha Mandava
438c422813 moving trtis section in readme and github issue 131 2019-07-30 13:35:10 -07:00
Swetha Mandava
c15d09eed6 moving trtis section in readme and github issue 131 2019-07-30 13:27:07 -07:00
Swetha Mandava
2b3b299448 moving trtis section in readme and github issue 131 2019-07-30 13:18:12 -07:00
Swetha Mandava
4b9c90dd67 Merge branch 'master' of https://github.com/swethmandava/DeepLearningExamples 2019-07-30 13:14:49 -07:00
Swetha Mandava
0053b4b888
Merge pull request #1 from NVIDIA/master
Upstream changes
2019-07-30 13:14:33 -07:00
Swetha Mandava
249c51851d Merge branch 'master' of https://github.com/swethmandava/DeepLearningExamples 2019-07-30 13:11:46 -07:00
Krzysztof Kudrynski
ff16b6c649 removing torchhub access through master 2019-07-30 19:30:40 +02:00
nvpstr
d9f925cb9c
Merge pull request #130 from NVIDIA/nvpstr/release
Adding Jasper/PyT
2019-07-30 16:18:27 +02:00
nvpstr
2c66d3525d
Merge pull request #132 from GrzegorzKarchNV/fixed-audio-sample
fixed audio
2019-07-30 16:14:57 +02:00