Commit graph

472 commits

Author SHA1 Message Date
PrzemekS 5175cc77eb
Merge pull request #469 from swethmandava/master
Adding DLLogger
2020-05-07 11:08:17 +02:00
PrzemekS d218a72914
Merge pull request #470 from NVIDIA/unetmed_tf2-loss_fix
Replace softmax_cross_entropy_with_logits with binary_crossentropy
2020-05-07 11:07:36 +02:00
GrzegorzKarchNV 2340f70d55
Merge pull request #486 from maggiezha/master
add Intel optimization for PyTorch
2020-05-07 08:35:19 +02:00
maggiezha 387f700c4c
add Intel Optimization for PyTorch
Intel's optimization for PyTorch on CPU are added, you need to set "export OMP_NUM_THREADS=num physical cores" based on your CPU's core number
2020-05-07 15:57:07 +10:00
maggiezha 150f877e19
adding CPU optimization
export OMP_NUM_THREADS=num physical cores
export KMP_BLOCKTIME=0
export KMP_AFFINITY=granularity=fine,compact,1,0
https://software.intel.com/content/www/us/en/develop/articles/maximize-tensorflow-performance-on-cpu-considerations-and-recommendations-for-inference.html
2020-05-07 15:51:45 +10:00
GrzegorzKarchNV 67a7d9c4eb
Merge pull request #482 from maggiezha/cpu-run
Cpu run
2020-05-06 21:27:26 +02:00
GrzegorzKarchNV c26a383288
Merge pull request #483 from GrzegorzKarchNV/trt_const_batch
constant batch in TensorRT engine build
2020-05-06 15:26:37 +02:00
maggiezha acf833f7e4
adding support for --cpu-run 2020-05-06 21:57:05 +10:00
maggiezha 3a6b667118
adding support for --cpu-run 2020-05-06 21:52:49 +10:00
maggiezha 342c4710fc
adding support for --cpu-run 2020-05-06 21:48:43 +10:00
maggiezha 0e986cc1f0
adding support for --cpu-run 2020-05-06 21:46:45 +10:00
maggiezha ca43a1b1e5
adding support for --cpu-run 2020-05-06 21:42:43 +10:00
gkarch d0d4df70a1 constant batch in TensorRT engine build 2020-05-06 13:01:08 +02:00
maggiezha a4ce69a5a7
add support for --cpu-run 2020-05-06 20:58:45 +10:00
maggiezha 9268095bb1
add support for --cpu-run 2020-05-06 20:57:10 +10:00
maggiezha 3a11f10bfe
add support for --cpu-run 2020-05-06 20:50:59 +10:00
Swetha Mandava b7903f0f62 Inference throughput computation removing outliers, eval when eval script is given only 2020-04-29 14:17:22 -07:00
BO-YANG HSUEH bee3ddfa0e
[FasterTransformer] Fix the bug of Readme. 2020-04-29 16:44:22 +08:00
bhsueh 5ee9b2ec03 1. [FasterTransformer] Fix the bug of encoder trt plugin. 2020-04-29 08:42:43 +00:00
GrzegorzKarchNV 90ce2a9923
Merge pull request #471 from GrzegorzKarchNV/trt-infer-fix
fixed trt inference
2020-04-27 16:07:24 +02:00
gkarch b0ab215441 fixed trt inference 2020-04-27 16:05:33 +02:00
Michał Marcinkiewicz 57b8a6ac3a
Update losses.py
Replace softmax_cross_entropy_with_logits with binary_crossentropy
2020-04-26 10:24:08 +02:00
Swetha Mandava c94b73f9ea Adding DLLogger and specificying v1.1 in the readme results for clarity 2020-04-24 14:01:53 -07:00
Swetha Mandava eb8e823c39
Merge pull request #7 from NVIDIA/master
Pull from remote
2020-04-24 13:50:28 -07:00
Sharath T S 4733603577
[BERT/PyT] Fix squad inference corner case (#462) 2020-04-20 21:02:18 -07:00
GrzegorzKarchNV a0ccd2c7f1
Merge pull request #461 from GrzegorzKarchNV/trt-fix
Tacotron2: fixing trt tests
2020-04-20 17:03:53 +02:00
gkarch 063de87218 fixing trt tests 2020-04-20 17:01:35 +02:00
PrzemekS 342d2e7649
Merge pull request #455 from NVIDIA/unetmed_tf2-fix_dockerfile
Update Dockerfile
2020-04-13 19:36:39 +02:00
Michał Marcinkiewicz 1f04b9fe4d
Update Dockerfile 2020-04-13 19:32:09 +02:00
PrzemekS 1cad180164
Merge pull request #453 from NVIDIA/nvpstr/87ec80
Nvpstr/87ec80
2020-04-09 07:11:48 +02:00
Przemek Strzelczyk 87ec806d7a Update README.md 2020-04-08 09:21:01 -07:00
Przemek Strzelczyk 15807b36bf Adding DLRM/PyT 2020-04-08 18:17:57 +02:00
Przemek Strzelczyk 4f0f43b9a5 Adding Transformer-XL/TF 2020-04-08 15:21:57 +02:00
Przemek Strzelczyk d996f54542 [Wide&Deep/TF] README updates; results for 50 runs and small fixes 2020-04-08 15:12:25 +02:00
Sharath T S 5626846924
[BERT/PyT] Revert from native gelu. Breaks ONNX export. (#447) 2020-04-07 10:28:45 -07:00
PrzemekS 19ead708fb
Merge pull request #446 from swethmandava/master
Readme fixes
2020-04-06 18:04:13 +02:00
jconwayNV bad12e8f4e
Added hyperlinks to TRTIS content in Feature Matrix 2020-04-05 21:27:37 -07:00
Swetha Mandava c57e4d2b08 readme fixes 2020-04-03 15:02:45 -07:00
Swetha Mandava e5428044f7
Merge pull request #6 from NVIDIA/master
update
2020-04-03 14:53:57 -07:00
GrzegorzKarchNV c742ed42dc
Merge pull request #444 from GrzegorzKarchNV/fixing-alerts-20.03
fixed alerts from https://lgtm.com/projects/g/NVIDIA/DeepLearningExam
2020-04-03 19:05:31 +02:00
gkarch 1f7950aa9d fixed alerts from https://lgtm.com/projects/g/NVIDIA/DeepLearningExamples/rev/pr-8d221f6760830499933042c03fcd83605adbd98e 2020-04-03 18:28:20 +02:00
GrzegorzKarchNV 00ca1c4bcf
Update README.md 2020-04-03 13:44:16 +02:00
bhsueh f0809df478 1. Update the README. 2020-04-03 09:07:20 +00:00
GrzegorzKarchNV 63696a55ce
Merge pull request #442 from NVIDIA/nvpstr/5e3b487b8
[Tacotron2/PyT] custom TensorRT backend + fixes + demo
2020-04-02 17:59:14 +02:00
Przemek Strzelczyk 5e3b487b89 [Tacotron2/PyT] custom TensorRT backend on TensorRT Inference Server; Conversional AI demo; fixed checkpoints loading; fixed FP16 export to TensorRT 2020-04-02 17:18:26 +02:00
PrzemekS 157a3acaa9
Merge pull request #441 from NVIDIA/nvpstr/26c267610
[BERT/PyT] Triton Inference Server support
2020-04-02 15:31:55 +02:00
Przemek Strzelczyk 26c2676104 [BERT/PyT] Triton Inference Server support 2020-04-02 14:39:24 +02:00
PrzemekS c007e8e6dc
Merge pull request #439 from NVIDIA/UNet_Med_fix_pillow
Fix requirements for UNet Medical
2020-04-02 12:34:45 +02:00
Pablo Ribalta Lorenzo c11a89aa76 Fix Unet tf2
Signed-off-by: Pablo Ribalta Lorenzo <pribalta@nvidia.com>
2020-04-02 07:20:58 +00:00
Pablo Ribalta Lorenzo e9fc30acb8 Fix requirements
Signed-off-by: Pablo Ribalta Lorenzo <pribalta@nvidia.com>
2020-04-02 07:16:47 +00:00