gkarch
|
1cfa0ecffb
|
small fixes
|
2020-05-18 12:08:59 +02:00 |
|
gkarch
|
53e7e4f130
|
updating trtis_cpp
|
2020-05-18 11:04:16 +02:00 |
|
Sharath T S
|
9df464f277
|
[BERT/PyT] stop and resume, single gpu and timing fixes. (#509)
* stop and resume, single gpu and timing fixes.
* Update utils.py
* accumulation features check
|
2020-05-17 12:46:53 -07:00 |
|
Sharath T S
|
3aae0204c3
|
1. stop and resume
|
2020-05-16 22:47:53 -07:00 |
|
gkarch
|
bf2b7a0767
|
fixed trtis dockerfile
|
2020-05-14 22:24:54 +02:00 |
|
gkarch
|
8d8337196f
|
fixed trtis
|
2020-05-14 19:58:45 +02:00 |
|
gkarch
|
86dc81241c
|
[Tacotron2] fixed load_and_setup_model in ONNX exports
|
2020-05-13 11:24:39 +02:00 |
|
gkarch
|
1272f6fafa
|
[Tacotron2] fixed get_model in train.py
|
2020-05-13 11:08:15 +02:00 |
|
PrzemekS
|
af32ac2a23
|
Merge pull request #456 from NVIDIA/sharathts-patch-1
Fix to load Google's checkpoint
|
2020-05-07 11:09:20 +02:00 |
|
maggiezha
|
387f700c4c
|
add Intel Optimization for PyTorch
Intel's optimization for PyTorch on CPU are added, you need to set "export OMP_NUM_THREADS=num physical cores" based on your CPU's core number
|
2020-05-07 15:57:07 +10:00 |
|
maggiezha
|
150f877e19
|
adding CPU optimization
export OMP_NUM_THREADS=num physical cores
export KMP_BLOCKTIME=0
export KMP_AFFINITY=granularity=fine,compact,1,0
https://software.intel.com/content/www/us/en/develop/articles/maximize-tensorflow-performance-on-cpu-considerations-and-recommendations-for-inference.html
|
2020-05-07 15:51:45 +10:00 |
|
GrzegorzKarchNV
|
67a7d9c4eb
|
Merge pull request #482 from maggiezha/cpu-run
Cpu run
|
2020-05-06 21:27:26 +02:00 |
|
maggiezha
|
acf833f7e4
|
adding support for --cpu-run
|
2020-05-06 21:57:05 +10:00 |
|
maggiezha
|
3a6b667118
|
adding support for --cpu-run
|
2020-05-06 21:52:49 +10:00 |
|
maggiezha
|
342c4710fc
|
adding support for --cpu-run
|
2020-05-06 21:48:43 +10:00 |
|
maggiezha
|
0e986cc1f0
|
adding support for --cpu-run
|
2020-05-06 21:46:45 +10:00 |
|
maggiezha
|
ca43a1b1e5
|
adding support for --cpu-run
|
2020-05-06 21:42:43 +10:00 |
|
gkarch
|
d0d4df70a1
|
constant batch in TensorRT engine build
|
2020-05-06 13:01:08 +02:00 |
|
maggiezha
|
a4ce69a5a7
|
add support for --cpu-run
|
2020-05-06 20:58:45 +10:00 |
|
maggiezha
|
9268095bb1
|
add support for --cpu-run
|
2020-05-06 20:57:10 +10:00 |
|
maggiezha
|
3a11f10bfe
|
add support for --cpu-run
|
2020-05-06 20:50:59 +10:00 |
|
gkarch
|
b0ab215441
|
fixed trt inference
|
2020-04-27 16:05:33 +02:00 |
|
Sharath T S
|
4733603577
|
[BERT/PyT] Fix squad inference corner case (#462)
|
2020-04-20 21:02:18 -07:00 |
|
gkarch
|
063de87218
|
fixing trt tests
|
2020-04-20 17:01:35 +02:00 |
|
Sharath T S
|
a9c997cd57
|
Fix to load Google's checkpoint
|
2020-04-13 11:34:56 -07:00 |
|
PrzemekS
|
1cad180164
|
Merge pull request #453 from NVIDIA/nvpstr/87ec80
Nvpstr/87ec80
|
2020-04-09 07:11:48 +02:00 |
|
Przemek Strzelczyk
|
15807b36bf
|
Adding DLRM/PyT
|
2020-04-08 18:17:57 +02:00 |
|
Sharath T S
|
5626846924
|
[BERT/PyT] Revert from native gelu. Breaks ONNX export. (#447)
|
2020-04-07 10:28:45 -07:00 |
|
gkarch
|
1f7950aa9d
|
fixed alerts from https://lgtm.com/projects/g/NVIDIA/DeepLearningExamples/rev/pr-8d221f6760830499933042c03fcd83605adbd98e
|
2020-04-03 18:28:20 +02:00 |
|
GrzegorzKarchNV
|
00ca1c4bcf
|
Update README.md
|
2020-04-03 13:44:16 +02:00 |
|
Przemek Strzelczyk
|
5e3b487b89
|
[Tacotron2/PyT] custom TensorRT backend on TensorRT Inference Server; Conversional AI demo; fixed checkpoints loading; fixed FP16 export to TensorRT
|
2020-04-02 17:18:26 +02:00 |
|
Przemek Strzelczyk
|
26c2676104
|
[BERT/PyT] Triton Inference Server support
|
2020-04-02 14:39:24 +02:00 |
|
Sharath T S
|
793b92dca7
|
[BERT/PyT] fp32 and allreduce_post_accumulation compatibility (#422)
|
2020-03-15 23:03:06 -07:00 |
|
PrzemekS
|
b03375bd6c
|
Update README.md
|
2020-03-12 19:09:18 +01:00 |
|
PrzemekS
|
adeed4fcbc
|
Update README.md
|
2020-03-12 19:06:10 +01:00 |
|
PrzemekS
|
1c77a04548
|
Merge pull request #403 from yzhang123/trt_dynamic_shape_update
Jasper TRT Fix
|
2020-03-09 17:28:31 +01:00 |
|
Przemek Strzelczyk
|
96ff411ce8
|
[BERT/PyT] Typo in README
|
2020-03-05 09:54:16 +01:00 |
|
Rajeev Rao
|
4f42950e36
|
Initialize CUDA state before loading TRT engines in Tacotron sample
|
2020-03-03 09:56:54 -08:00 |
|
Przemek Strzelczyk
|
77a1bb917a
|
[Tacotron2/PyT] Updates: better perf, better trt7 support, new logging, bug fixes
|
2020-02-28 15:36:14 +01:00 |
|
Przemek Strzelczyk
|
155578a762
|
[BERT/PyT] New logging and some README updates
|
2020-02-28 13:21:20 +01:00 |
|
Yang Zhang
|
c72df2be4a
|
fix syntax error
|
2020-02-14 15:49:54 -08:00 |
|
Yang Zhang
|
b738d1d1af
|
fix trt regression bug
Signed-off-by: Yang Zhang <yangzhang@nvidia.com>
|
2020-02-14 15:44:26 -08:00 |
|
PrzemekS
|
ce73b32068
|
Merge pull request #392 from NVIDIA/nvpstr/1def26
Updating BERT/TF, Transformer-XL and NCF/PyT
|
2020-02-06 19:47:30 +01:00 |
|
Przemek Strzelczyk
|
a38deff61e
|
[Transformer-XL/PyT] Large model support; multi-node training; inference with TorchScript
|
2020-02-05 22:38:46 +01:00 |
|
Przemek Strzelczyk
|
1def26d80c
|
[NCF/PyT] Adding new logging
|
2020-02-05 21:58:04 +01:00 |
|
Sharath T S
|
ad88003e13
|
[BERT/PyT] Glue(MRPC) fine-tuning with LAMB pretrained checkpoint
* LAMB checkpoint compatibility
* LAMB checkpoint compatibility; amp training
|
2020-02-03 16:27:11 -08:00 |
|
Sharath T S
|
119838f1f6
|
Bugfix in BertAdam for fp32 finetuning (#388)
|
2020-01-30 20:11:03 -08:00 |
|
PrzemekS
|
aa061052c6
|
Merge pull request #345 from nvcforster/master
Updating BERT Readme Docs
|
2020-01-02 14:35:59 +01:00 |
|
Krzysztof Kudrynski
|
f0ef8493eb
|
ConvNets update
|
2019-12-20 14:54:58 +01:00 |
|
gkarch
|
480f1f9811
|
update perf table for taco2 trt
|
2019-12-18 11:45:17 +01:00 |
|