maxmustermann/DeepLearningExamples

Author	SHA1	Message	Date
Przemek Strzelczyk	15ba45666d	[WideAndDeep] Improved Spark preprocessing scripts performance	2020-05-18 01:11:27 +02:00
Sharath T S	9df464f277	[BERT/PyT] stop and resume, single gpu and timing fixes. (#509 ) * stop and resume, single gpu and timing fixes. * Update utils.py * accumulation features check	2020-05-17 12:46:53 -07:00
Sharath T S	3aae0204c3	1. stop and resume	2020-05-16 22:47:53 -07:00
GrzegorzKarchNV	78aacb4797	Merge pull request #504 from GrzegorzKarchNV/trtis-fixes [Tacotron2] Triton fixes	2020-05-14 22:29:30 +02:00
gkarch	bf2b7a0767	fixed trtis dockerfile	2020-05-14 22:24:54 +02:00
gkarch	8d8337196f	fixed trtis	2020-05-14 19:58:45 +02:00
BO-YANG HSUEH	92ac3236ea	Merge pull request #501 from byshiue/master [FasterTransformer] Fix bug of trt sample.	2020-05-13 17:40:59 +08:00
bhsueh	13e601c6e4	1. Fix the bugs of trt sample codes	2020-05-13 09:38:33 +00:00
GrzegorzKarchNV	6fa33f63c6	Merge pull request #500 from GrzegorzKarchNV/master [Tacotron2] fixed load_and_setup_model in ONNX exports	2020-05-13 11:27:45 +02:00
gkarch	86dc81241c	[Tacotron2] fixed load_and_setup_model in ONNX exports	2020-05-13 11:24:39 +02:00
GrzegorzKarchNV	42ba4eab16	Merge pull request #499 from GrzegorzKarchNV/master [Tacotron2] fixed get_model in train.py	2020-05-13 11:12:11 +02:00
gkarch	1272f6fafa	[Tacotron2] fixed get_model in train.py	2020-05-13 11:08:15 +02:00
jconwayNV	072f2cbb07	Updated some TRTIS references to Triton	2020-05-11 23:18:50 -07:00
PrzemekS	8fecbe7ca7	Merge pull request #420 from rajeevsrao/dev/master-trt-bert-7.0 BERT demo integration for TensorRT-7.0	2020-05-11 19:58:48 +02:00
PrzemekS	b88db70dc1	Merge pull request #448 from NVIDIA/unet_industrial_fixes Update UNet industrial	2020-05-07 11:09:46 +02:00
PrzemekS	af32ac2a23	Merge pull request #456 from NVIDIA/sharathts-patch-1 Fix to load Google's checkpoint	2020-05-07 11:09:20 +02:00
PrzemekS	5175cc77eb	Merge pull request #469 from swethmandava/master Adding DLLogger	2020-05-07 11:08:17 +02:00
PrzemekS	d218a72914	Merge pull request #470 from NVIDIA/unetmed_tf2-loss_fix Replace softmax_cross_entropy_with_logits with binary_crossentropy	2020-05-07 11:07:36 +02:00
GrzegorzKarchNV	2340f70d55	Merge pull request #486 from maggiezha/master add Intel optimization for PyTorch	2020-05-07 08:35:19 +02:00
maggiezha	387f700c4c	add Intel Optimization for PyTorch Intel's optimization for PyTorch on CPU are added, you need to set "export OMP_NUM_THREADS=num physical cores" based on your CPU's core number	2020-05-07 15:57:07 +10:00
maggiezha	150f877e19	adding CPU optimization export OMP_NUM_THREADS=num physical cores export KMP_BLOCKTIME=0 export KMP_AFFINITY=granularity=fine,compact,1,0 https://software.intel.com/content/www/us/en/develop/articles/maximize-tensorflow-performance-on-cpu-considerations-and-recommendations-for-inference.html	2020-05-07 15:51:45 +10:00
GrzegorzKarchNV	67a7d9c4eb	Merge pull request #482 from maggiezha/cpu-run Cpu run	2020-05-06 21:27:26 +02:00
GrzegorzKarchNV	c26a383288	Merge pull request #483 from GrzegorzKarchNV/trt_const_batch constant batch in TensorRT engine build	2020-05-06 15:26:37 +02:00
maggiezha	acf833f7e4	adding support for --cpu-run	2020-05-06 21:57:05 +10:00
maggiezha	3a6b667118	adding support for --cpu-run	2020-05-06 21:52:49 +10:00
maggiezha	342c4710fc	adding support for --cpu-run	2020-05-06 21:48:43 +10:00
maggiezha	0e986cc1f0	adding support for --cpu-run	2020-05-06 21:46:45 +10:00
maggiezha	ca43a1b1e5	adding support for --cpu-run	2020-05-06 21:42:43 +10:00
gkarch	d0d4df70a1	constant batch in TensorRT engine build	2020-05-06 13:01:08 +02:00
maggiezha	a4ce69a5a7	add support for --cpu-run	2020-05-06 20:58:45 +10:00
maggiezha	9268095bb1	add support for --cpu-run	2020-05-06 20:57:10 +10:00
maggiezha	3a11f10bfe	add support for --cpu-run	2020-05-06 20:50:59 +10:00
Swetha Mandava	b7903f0f62	Inference throughput computation removing outliers, eval when eval script is given only	2020-04-29 14:17:22 -07:00
BO-YANG HSUEH	bee3ddfa0e	[FasterTransformer] Fix the bug of Readme.	2020-04-29 16:44:22 +08:00
bhsueh	5ee9b2ec03	1. [FasterTransformer] Fix the bug of encoder trt plugin.	2020-04-29 08:42:43 +00:00
GrzegorzKarchNV	90ce2a9923	Merge pull request #471 from GrzegorzKarchNV/trt-infer-fix fixed trt inference	2020-04-27 16:07:24 +02:00
gkarch	b0ab215441	fixed trt inference	2020-04-27 16:05:33 +02:00
Michał Marcinkiewicz	57b8a6ac3a	Update losses.py Replace softmax_cross_entropy_with_logits with binary_crossentropy	2020-04-26 10:24:08 +02:00
Swetha Mandava	c94b73f9ea	Adding DLLogger and specificying v1.1 in the readme results for clarity	2020-04-24 14:01:53 -07:00
Swetha Mandava	eb8e823c39	Merge pull request #7 from NVIDIA/master Pull from remote	2020-04-24 13:50:28 -07:00
Rajeev Rao	33112129c5	Fix docker build script	2020-04-23 15:40:50 -07:00
Sharath T S	4733603577	[BERT/PyT] Fix squad inference corner case (#462 )	2020-04-20 21:02:18 -07:00
Rajeev Rao	c608b656ea	Python notebook fixes	2020-04-20 17:35:20 -07:00
Rajeev Rao	d0466c06ce	Update performance data for BERT-7.0	2020-04-20 17:35:20 -07:00
Rajeev Rao	ba2840dcf9	Update TensorRT Dockerfile	2020-04-20 17:35:20 -07:00
Rajeev Rao	af69862cfb	Remove deadcode in TensorRT builder Signed-off-by: Rajeev Rao <rajeevrao@nvidia.com>	2020-04-20 17:35:20 -07:00
Rajeev Rao	899c9988f7	Fix BERT/TRT-7.0 regressions - Revert network to use fcPlugin and geluPlugin Signed-off-by: Rajeev Rao <rajeevrao@nvidia.com>	2020-04-20 17:35:20 -07:00
Rajeev Rao	5cac8bcee4	Address review comments Signed-off-by: Rajeev Rao <rajeevrao@nvidia.com>	2020-04-20 17:35:20 -07:00
Rajeev Rao	56a8c20b6b	Fix TRT base container version in README Signed-off-by: Rajeev Rao <rajeevrao@nvidia.com>	2020-04-20 17:35:20 -07:00
Rajeev Rao	0c42d181b7	BERT demo integration for TensorRT-7.0 Signed-off-by: Rajeev Rao <rajeevrao@nvidia.com>	2020-04-20 17:35:20 -07:00

1 2 3 4 5 ...

499 commits