Swetha Mandava
8749cf3952
don't drop remainder in eval, num_shards for data
2019-10-07 19:00:55 -07:00
Swetha Mandava
aac3e7b6e9
adding missing num accum steps to readme
2019-10-07 17:30:04 -07:00
Swetha Mandava
b85439958d
fix logo display on github
2019-10-07 11:35:22 -07:00
Swetha Mandava
b13aeea8d3
readme column mismatch and extra config removed
2019-10-07 11:32:29 -07:00
Swetha Mandava
888e01448e
change folder
2019-09-25 17:39:36 -07:00
Swetha Mandava
f5d1a8ba23
adding colab notebook, Use new TF-TRT AP
2019-09-25 17:29:12 -07:00
Swetha Mandava
df118e7155
changelog
2019-09-23 13:17:16 -07:00
Swetha Mandava
fa42f0ac27
add fp16/fp32 speedup for inference
2019-09-20 13:26:32 -07:00
Swetha Mandava
71fea240de
switch ordering in readme
2019-09-19 22:30:37 -07:00
Swetha Mandava
890fc1c143
fix data dir path in eval
2019-09-19 16:54:11 -07:00
Swetha Mandava
d6c5fef145
adding notebooks and t4 results, cleanup
2019-09-19 10:02:06 -07:00
nvpstr
2112085047
Merge pull request #219 from NVIDIA/nvpstr/jaspertrt
...
[Jasper/PyT] Adding TRT support + jupyter notebooks for inference
2019-09-18 23:44:52 +02:00
Przemek Strzelczyk
f7b0a9c583
[Jasper/PyT] Small README and notebook fixes
2019-09-18 23:38:40 +02:00
Przemek Strzelczyk
2de99b5fa7
[Jasper/PyT] Adding TRT support + jupyter notebooks for inference
2019-09-18 22:05:24 +02:00
IrishCoffee
58e34b4c3d
Merge pull request #216 from NVIDIA/xiaoying_nan
...
refine assert/cmake
2019-09-18 13:03:35 +08:00
xjia
61d96c2020
refine assert/cmake
2019-09-18 03:37:27 +00:00
Szymon Migacz
3014f38a3f
[GNMT PyT] Fix for fp16 training w/o label smoothing ( #210 )
2019-09-16 10:09:28 +02:00
nvpstr
5c0969c38a
Merge pull request #208 from NVIDIA/nvpstr/nvidia-release-19.08_e
...
[BERT/TF] Multi-node support + LAMB + Fine-Tuning for GLUE
2019-09-13 19:48:00 +02:00
nvpstr
f83166868c
Merge pull request #207 from NVIDIA/nvpstr/release19.08_d
...
Minor fixes to BERT/PyT
2019-09-13 19:23:37 +02:00
Przemek Strzelczyk
a98df279fe
[BERT/TF] Added multi-node support
2019-09-13 19:12:50 +02:00
Przemek Strzelczyk
8b249efad6
Minor fixes to BERT/PyT
2019-09-13 15:23:39 +02:00
jconwayNV
41b55e7c8a
Update main readme to focus on Tensor Cores
2019-09-12 10:09:05 -07:00
nvpstr
d03b139d2c
Updating SSD/PyT
...
* support for DALI 0.12.0
* checkpoint loading fix
* README update
* new results pusblished
2019-09-12 14:46:36 +02:00
Przemek Strzelczyk
1180291973
Merge branch 'gh/master' into nvpstr/release19.08_c
2019-09-12 14:42:58 +02:00
Przemek Strzelczyk
4cce4d88e6
Updating SSD/PyT
2019-09-12 14:33:49 +02:00
nvpstr
22a6f9d99e
Merge pull request #200 from NVIDIA/nvpstr/release19.08_b
...
[BERT/PyT] Support for multi-node
2019-09-11 16:09:31 +02:00
IrishCoffee
07f4eb88cb
Merge pull request #202 from NVIDIA/xiaoying_nan
...
Xiaoying nan
2019-09-11 15:38:48 +08:00
xjia
392d6bee1c
refine softmax/diff
2019-09-11 07:32:50 +00:00
xjia
fd852b56a0
fix softmax max_value
2019-09-11 07:09:53 +00:00
Przemek Strzelczyk
6fe463fe27
[BERT/PyT] Support for multi-node
2019-09-10 17:21:52 +02:00
nvpstr
b07f501eff
Merge pull request #199 from NVIDIA/nvpstr/release19.08_1
...
[Tacotron2] Added denoiser and inference stats, fixed typos
2019-09-10 17:04:58 +02:00
Przemek Strzelczyk
02b49acead
[Tacotron2] Added denoiser and inference stats, fixed typos
2019-09-10 16:22:53 +02:00
nvpstr
da8acb1288
Merge pull request #187 from GrzegorzKarchNV/notebook_19.08
...
added jupyter notebook to Tacotron2
2019-09-03 21:03:35 +02:00
gkarch
b8027d8914
added jupyter notebook
2019-09-03 18:40:29 +02:00
Chris Forster
71e2b22d4a
Update bertPrep.py ( #183 )
2019-08-29 21:49:02 +02:00
Chris Forster
e72ea6947b
BERT-PyT subprocess for bzip in wikidownloader ( #180 )
...
* Removing unnecessary subprocess.communicate calls
* Updating Bookscorpus downloader to require less memory
* Renaming variable
2019-08-29 07:21:53 +02:00
Chris Forster
3d3ff3e168
Cleanup and Readme Update ( #174 )
...
* update perf tables
* remove ide files
* fix tokenizer
* copyrights
* remove .communicate()
* refine training scripts
* fix more typos
2019-08-27 21:44:21 +02:00
IrishCoffee
4850598199
Merge pull request #176 from NVIDIA/xiaoying_nan
...
refine softmax
2019-08-26 17:10:49 +08:00
xjia
488cc11967
refine softmax
2019-08-26 09:08:59 +00:00
Sharath T S
3d59216cec
[BERT] [PyTorch] Data prep fix ( #171 )
...
* add dgx1-16g and dgx2 specific pretraining instructions
* fix typo in readme
* fix data prep and reflect changes in pretraining
* remove .ide files
* remove data files
* Point to right SQUAD location
* remove garbage [[]]
* default accumulation in fp32
* remove ide files
* fix phase2 DATADIR path
* remove readme in data folder
2019-08-22 07:52:18 +02:00
Sharath T S
b6fb9aa463
[BERT][PyTorch]: add dgx1-16g and dgx2 specific pretraining instructions ( #164 )
...
* add dgx1-16g and dgx2 specific pretraining instructions
* fix typo in readme
2019-08-21 09:49:32 +02:00
Szymon Migacz
22f122183d
Merge pull request #165 from yzhang123/yzhang123-patch-3
...
novograd jasper hot fix
2019-08-21 08:26:08 +02:00
yzhang123
f84446675e
novograd default parameter fix
2019-08-16 14:07:05 -07:00
yzhang123
5419463c91
fix novograd
2019-08-16 14:05:18 -07:00
yzhang123
2308d9ff62
Merge pull request #2 from NVIDIA/master
...
pull from nvidia
2019-08-16 14:03:28 -07:00
nv-kkudrynski
e25c23e14a
Merge pull request #153 from tlkh/master
...
Update formatting in AMP demo notebook
2019-08-14 13:43:04 +02:00
nv-kkudrynski
9f7616dc54
minor readme fix
2019-08-14 13:30:37 +02:00
Cliff Woolley
b7bf42d76c
Update README.md
...
Fix typo
2019-08-13 16:12:52 -07:00
Cliff Woolley
1fbd997d9f
Merge pull request #158 from NVIDIA/jwoolley/bert-cleanup
...
Minor BERT PyTorch cleanups
2019-08-13 15:47:08 -07:00
Cliff Woolley
608663f6ec
Don't omit the data/ scripts from docker build
2019-08-13 15:41:48 -07:00