Commit graph

626 commits

Author SHA1 Message Date
gkarch 2480060f2a added tcmalloc in Dockerfile 2020-05-27 15:26:19 +02:00
PrzemekS dd9f5c401c
Create README.md 2020-05-26 19:30:04 +02:00
PrzemekS 7a2f7d4a55
Merge pull request #516 from NVIDIA/nvptr/6d7283
Adding FastPitch/PyT (modified version of FastSpeech)
2020-05-26 18:55:40 +02:00
GrzegorzKarchNV 0e1622f745
Merge pull request #525 from gloriouskilka/master
Tacotron2 --cpu-run fix
2020-05-22 11:16:34 +02:00
Ilya Shutov 680ccfcf89 Tacotron2 --cpu-run fix 2020-05-21 21:05:24 +07:00
Yang Zhang cbde541082
fix trt benchmark column names
2nd time to fix trt benchmark column names, since it was overwritten by another commit
1st time washttps://github.com/NVIDIA/DeepLearningExamples/pull/267
2020-05-20 10:11:41 -07:00
Yang Zhang 726e32610c
Merge pull request #6 from NVIDIA/master
update
2020-05-20 10:01:07 -07:00
GrzegorzKarchNV 3927e1334a
Merge pull request #519 from GrzegorzKarchNV/master
[Tacotron2] updates in trtis_cpp
2020-05-19 15:07:56 +02:00
gkarch 3bddd5df99 updates MR78 2020-05-19 15:06:13 +02:00
t-kusanagi a685b398bc Fix [PyT/DLRM] bug of model.py 2020-05-19 00:15:12 +00:00
Swetha Mandava daaab9ea6a
update run_classifier.py typo 2020-05-18 12:19:01 -07:00
Swetha Mandava f3786969a3
Merge pull request #502 from swethmandava/master
BERT module update
2020-05-18 10:27:19 -07:00
Swetha Mandava 58763147ff fixing xla fragmentation in latest container 2020-05-18 10:25:13 -07:00
Swetha Mandava bd5ce85888 Updating notebooks to have relative paths instead of absolute 2020-05-18 10:24:28 -07:00
Przemek Strzelczyk 6d72839a6c Adding FastPitch/PyT (modified version of FastSpeech) 2020-05-18 18:49:00 +02:00
GrzegorzKarchNV 06e39168fb
Merge pull request #513 from GrzegorzKarchNV/trtis_cpp-update
[Tacotron2] updating trtis_cpp
2020-05-18 12:40:45 +02:00
gkarch 1cfa0ecffb small fixes 2020-05-18 12:08:59 +02:00
gkarch 53e7e4f130 updating trtis_cpp 2020-05-18 11:04:16 +02:00
PrzemekS e3a110be6b
Merge pull request #511 from NVIDIA/nvpstr/15ba456
[WideAndDeep] Improved Spark preprocessing scripts performance
2020-05-18 01:14:09 +02:00
Przemek Strzelczyk 15ba45666d [WideAndDeep] Improved Spark preprocessing scripts performance 2020-05-18 01:11:27 +02:00
Sharath T S 9df464f277
[BERT/PyT] stop and resume, single gpu and timing fixes. (#509)
* stop and resume, single gpu and timing fixes.

* Update utils.py

* accumulation features check
2020-05-17 12:46:53 -07:00
Sharath T S 3aae0204c3
1. stop and resume 2020-05-16 22:47:53 -07:00
Swetha Mandava 942b09611c change default dllog path, disable horovod for 1 gpu 2020-05-14 14:25:37 -07:00
GrzegorzKarchNV 78aacb4797
Merge pull request #504 from GrzegorzKarchNV/trtis-fixes
[Tacotron2] Triton fixes
2020-05-14 22:29:30 +02:00
gkarch bf2b7a0767 fixed trtis dockerfile 2020-05-14 22:24:54 +02:00
gkarch 8d8337196f fixed trtis 2020-05-14 19:58:45 +02:00
Swetha Mandava cfc2395057 merge conflicts 2020-05-13 10:54:53 -07:00
BO-YANG HSUEH 92ac3236ea
Merge pull request #501 from byshiue/master
[FasterTransformer] Fix bug of trt sample.
2020-05-13 17:40:59 +08:00
bhsueh 13e601c6e4 1. Fix the bugs of trt sample codes 2020-05-13 09:38:33 +00:00
GrzegorzKarchNV 6fa33f63c6
Merge pull request #500 from GrzegorzKarchNV/master
[Tacotron2] fixed load_and_setup_model in ONNX exports
2020-05-13 11:27:45 +02:00
gkarch 86dc81241c [Tacotron2] fixed load_and_setup_model in ONNX exports 2020-05-13 11:24:39 +02:00
GrzegorzKarchNV 42ba4eab16
Merge pull request #499 from GrzegorzKarchNV/master
[Tacotron2] fixed get_model in train.py
2020-05-13 11:12:11 +02:00
gkarch 1272f6fafa [Tacotron2] fixed get_model in train.py 2020-05-13 11:08:15 +02:00
Swetha Mandava 398dc781a1 amp env variable to amp api 2020-05-12 22:18:05 -07:00
Swetha Mandava 58a3ed6bab trtis to triton update 2020-05-12 22:17:33 -07:00
Swetha Mandava edef331894
Merge pull request #8 from NVIDIA/master
Update
2020-05-12 22:14:59 -07:00
jconwayNV 072f2cbb07
Updated some TRTIS references to Triton 2020-05-11 23:18:50 -07:00
PrzemekS 8fecbe7ca7
Merge pull request #420 from rajeevsrao/dev/master-trt-bert-7.0
BERT demo integration for TensorRT-7.0
2020-05-11 19:58:48 +02:00
PrzemekS b88db70dc1
Merge pull request #448 from NVIDIA/unet_industrial_fixes
Update UNet industrial
2020-05-07 11:09:46 +02:00
PrzemekS af32ac2a23
Merge pull request #456 from NVIDIA/sharathts-patch-1
Fix to load Google's checkpoint
2020-05-07 11:09:20 +02:00
PrzemekS 5175cc77eb
Merge pull request #469 from swethmandava/master
Adding DLLogger
2020-05-07 11:08:17 +02:00
PrzemekS d218a72914
Merge pull request #470 from NVIDIA/unetmed_tf2-loss_fix
Replace softmax_cross_entropy_with_logits with binary_crossentropy
2020-05-07 11:07:36 +02:00
GrzegorzKarchNV 2340f70d55
Merge pull request #486 from maggiezha/master
add Intel optimization for PyTorch
2020-05-07 08:35:19 +02:00
maggiezha 387f700c4c
add Intel Optimization for PyTorch
Intel's optimization for PyTorch on CPU are added, you need to set "export OMP_NUM_THREADS=num physical cores" based on your CPU's core number
2020-05-07 15:57:07 +10:00
maggiezha 150f877e19
adding CPU optimization
export OMP_NUM_THREADS=num physical cores
export KMP_BLOCKTIME=0
export KMP_AFFINITY=granularity=fine,compact,1,0
https://software.intel.com/content/www/us/en/develop/articles/maximize-tensorflow-performance-on-cpu-considerations-and-recommendations-for-inference.html
2020-05-07 15:51:45 +10:00
GrzegorzKarchNV 67a7d9c4eb
Merge pull request #482 from maggiezha/cpu-run
Cpu run
2020-05-06 21:27:26 +02:00
GrzegorzKarchNV c26a383288
Merge pull request #483 from GrzegorzKarchNV/trt_const_batch
constant batch in TensorRT engine build
2020-05-06 15:26:37 +02:00
maggiezha acf833f7e4
adding support for --cpu-run 2020-05-06 21:57:05 +10:00
maggiezha 3a6b667118
adding support for --cpu-run 2020-05-06 21:52:49 +10:00
maggiezha 342c4710fc
adding support for --cpu-run 2020-05-06 21:48:43 +10:00