Commit graph

187 commits

Author SHA1 Message Date
Swetha Mandava c2051b49eb pulling v1 tritonserver 2020-07-20 13:23:04 -07:00
Swetha Mandava 26e47a1083 update triton for amp 2020-07-20 13:20:09 -07:00
Krzysztof Kudrynski 878b004e6b [NCF/TF, WideAndDeep/TF] Updating for Ampere, [BERT/TF] MRPC support 2020-07-14 12:34:25 +02:00
Krzysztof Kudrynski 40c3be6e9b [Transformer-XL/TF] Updated perf table 2020-07-09 11:09:15 +02:00
nv-kkudrynski 37672df8f7
Merge pull request #584 from peri044/rn50_qat_v2
Add quantization aware training (QAT) support for Resnet 50
2020-07-08 13:00:47 +02:00
nv-kkudrynski 2729732c31
Update README.md 2020-07-08 12:59:12 +02:00
Krzysztof Kudrynski 33bdf65b18 readme fixes 2020-07-08 12:55:56 +02:00
Krzysztof Kudrynski 1e35179e96 [SSD/TF] Updating for Ampere 2020-07-08 00:10:36 +02:00
Dheeraj Peri 4cb58b61c7 Add licenses to new files 2020-07-06 12:01:17 -07:00
Przemek Strzelczyk 79d4ced0be Adding 3DUnet/TF 2020-07-04 03:28:33 +02:00
Przemek Strzelczyk 6e12b5ab8a [Transformer-XL/TF] Updating for Ampere 2020-07-04 02:41:39 +02:00
Przemek Strzelczyk 9f4678de1b [UNet industrial/TF] Updating for Ampere 2020-07-04 02:17:58 +02:00
Przemek Strzelczyk b27abeba07 [UNet_medical/TF1&2] Updating for Ampere 2020-07-04 01:42:09 +02:00
Przemek Strzelczyk 76a056cd33 [VNet/TF] Updating for 20.06 container 2020-07-04 01:37:11 +02:00
Przemek Strzelczyk 96138d5087 [BERT/TF] Updating for Ampere 2020-07-04 01:00:48 +02:00
Dheeraj Peri 57d9cc0444 Update instructions 2020-07-01 13:19:26 -07:00
Dheeraj Peri 00bde50081 Remove folder for QAT 2020-07-01 13:16:58 -07:00
Dheeraj Peri ed7831daa0 Update instructions and output node name 2020-07-01 13:09:39 -07:00
Dheeraj Peri da2e33a67d Merge branch 'master' into rn50_qat_v2 2020-07-01 11:50:10 -07:00
Dheeraj Peri 6d2357a9b8 code Refactor 2020-07-01 11:42:19 -07:00
Dheeraj Peri e0f399def4 Update frozen graph script and instructions 2020-06-30 11:49:00 -07:00
Przemek Strzelczyk 4eaa4434de [ConvNets/TF] Adding support for Ampere 2020-06-27 09:24:41 +02:00
Dheeraj Peri c4f90be499 Add QAT instructions for RN50 2020-06-21 16:41:23 -07:00
Krzysztof Kudrynski f11884b38a minor readme updates 2020-06-12 13:50:44 +02:00
Przemek Strzelczyk fed7ba99cd [ConvNets/TF] Updating RN50, Adding ResNext and SE-ResNext 2020-06-12 12:38:25 +02:00
Przemek Strzelczyk 23cc1cd5bb [FastPitch/PyT] Small README fixes 2020-06-12 12:13:05 +02:00
Swetha Mandava 57d1a91274
Merge pull request #532 from swethmandava/master
Bert TF Patch - correcting triton image names to match README.md
2020-05-27 14:59:39 -07:00
Swetha Mandava bf1b5eedb1 correcting image names, changing throughput computations 2020-05-27 14:57:37 -07:00
Swetha Mandava daaab9ea6a
update run_classifier.py typo 2020-05-18 12:19:01 -07:00
Swetha Mandava f3786969a3
Merge pull request #502 from swethmandava/master
BERT module update
2020-05-18 10:27:19 -07:00
Swetha Mandava 58763147ff fixing xla fragmentation in latest container 2020-05-18 10:25:13 -07:00
Swetha Mandava bd5ce85888 Updating notebooks to have relative paths instead of absolute 2020-05-18 10:24:28 -07:00
Przemek Strzelczyk 15ba45666d [WideAndDeep] Improved Spark preprocessing scripts performance 2020-05-18 01:11:27 +02:00
Swetha Mandava 942b09611c change default dllog path, disable horovod for 1 gpu 2020-05-14 14:25:37 -07:00
Swetha Mandava cfc2395057 merge conflicts 2020-05-13 10:54:53 -07:00
Swetha Mandava 398dc781a1 amp env variable to amp api 2020-05-12 22:18:05 -07:00
Swetha Mandava 58a3ed6bab trtis to triton update 2020-05-12 22:17:33 -07:00
PrzemekS 8fecbe7ca7
Merge pull request #420 from rajeevsrao/dev/master-trt-bert-7.0
BERT demo integration for TensorRT-7.0
2020-05-11 19:58:48 +02:00
PrzemekS b88db70dc1
Merge pull request #448 from NVIDIA/unet_industrial_fixes
Update UNet industrial
2020-05-07 11:09:46 +02:00
Swetha Mandava b7903f0f62 Inference throughput computation removing outliers, eval when eval script is given only 2020-04-29 14:17:22 -07:00
Swetha Mandava c94b73f9ea Adding DLLogger and specificying v1.1 in the readme results for clarity 2020-04-24 14:01:53 -07:00
Rajeev Rao 33112129c5 Fix docker build script 2020-04-23 15:40:50 -07:00
Rajeev Rao c608b656ea Python notebook fixes 2020-04-20 17:35:20 -07:00
Rajeev Rao d0466c06ce Update performance data for BERT-7.0 2020-04-20 17:35:20 -07:00
Rajeev Rao ba2840dcf9 Update TensorRT Dockerfile 2020-04-20 17:35:20 -07:00
Rajeev Rao af69862cfb Remove deadcode in TensorRT builder
Signed-off-by: Rajeev Rao <rajeevrao@nvidia.com>
2020-04-20 17:35:20 -07:00
Rajeev Rao 899c9988f7 Fix BERT/TRT-7.0 regressions
- Revert network to use fcPlugin and geluPlugin

Signed-off-by: Rajeev Rao <rajeevrao@nvidia.com>
2020-04-20 17:35:20 -07:00
Rajeev Rao 5cac8bcee4 Address review comments
Signed-off-by: Rajeev Rao <rajeevrao@nvidia.com>
2020-04-20 17:35:20 -07:00
Rajeev Rao 56a8c20b6b Fix TRT base container version in README
Signed-off-by: Rajeev Rao <rajeevrao@nvidia.com>
2020-04-20 17:35:20 -07:00
Rajeev Rao 0c42d181b7 BERT demo integration for TensorRT-7.0
Signed-off-by: Rajeev Rao <rajeevrao@nvidia.com>
2020-04-20 17:35:20 -07:00