maxmustermann/DeepLearningExamples

Author	SHA1	Message	Date
PrzemekS	af32ac2a23	Merge pull request #456 from NVIDIA/sharathts-patch-1 Fix to load Google's checkpoint	2020-05-07 11:09:20 +02:00
maggiezha	387f700c4c	add Intel Optimization for PyTorch Intel's optimization for PyTorch on CPU are added, you need to set "export OMP_NUM_THREADS=num physical cores" based on your CPU's core number	2020-05-07 15:57:07 +10:00
maggiezha	150f877e19	adding CPU optimization export OMP_NUM_THREADS=num physical cores export KMP_BLOCKTIME=0 export KMP_AFFINITY=granularity=fine,compact,1,0 https://software.intel.com/content/www/us/en/develop/articles/maximize-tensorflow-performance-on-cpu-considerations-and-recommendations-for-inference.html	2020-05-07 15:51:45 +10:00
GrzegorzKarchNV	67a7d9c4eb	Merge pull request #482 from maggiezha/cpu-run Cpu run	2020-05-06 21:27:26 +02:00
maggiezha	acf833f7e4	adding support for --cpu-run	2020-05-06 21:57:05 +10:00
maggiezha	3a6b667118	adding support for --cpu-run	2020-05-06 21:52:49 +10:00
maggiezha	342c4710fc	adding support for --cpu-run	2020-05-06 21:48:43 +10:00
maggiezha	0e986cc1f0	adding support for --cpu-run	2020-05-06 21:46:45 +10:00
maggiezha	ca43a1b1e5	adding support for --cpu-run	2020-05-06 21:42:43 +10:00
gkarch	d0d4df70a1	constant batch in TensorRT engine build	2020-05-06 13:01:08 +02:00
maggiezha	a4ce69a5a7	add support for --cpu-run	2020-05-06 20:58:45 +10:00
maggiezha	9268095bb1	add support for --cpu-run	2020-05-06 20:57:10 +10:00
maggiezha	3a11f10bfe	add support for --cpu-run	2020-05-06 20:50:59 +10:00
gkarch	b0ab215441	fixed trt inference	2020-04-27 16:05:33 +02:00
Sharath T S	4733603577	[BERT/PyT] Fix squad inference corner case (#462 )	2020-04-20 21:02:18 -07:00
gkarch	063de87218	fixing trt tests	2020-04-20 17:01:35 +02:00
Sharath T S	a9c997cd57	Fix to load Google's checkpoint	2020-04-13 11:34:56 -07:00
PrzemekS	1cad180164	Merge pull request #453 from NVIDIA/nvpstr/87ec80 Nvpstr/87ec80	2020-04-09 07:11:48 +02:00
Przemek Strzelczyk	15807b36bf	Adding DLRM/PyT	2020-04-08 18:17:57 +02:00
Sharath T S	5626846924	[BERT/PyT] Revert from native gelu. Breaks ONNX export. (#447 )	2020-04-07 10:28:45 -07:00
gkarch	1f7950aa9d	fixed alerts from https://lgtm.com/projects/g/NVIDIA/DeepLearningExamples/rev/pr-8d221f6760830499933042c03fcd83605adbd98e	2020-04-03 18:28:20 +02:00
GrzegorzKarchNV	00ca1c4bcf	Update README.md	2020-04-03 13:44:16 +02:00
Przemek Strzelczyk	5e3b487b89	[Tacotron2/PyT] custom TensorRT backend on TensorRT Inference Server; Conversional AI demo; fixed checkpoints loading; fixed FP16 export to TensorRT	2020-04-02 17:18:26 +02:00
Przemek Strzelczyk	26c2676104	[BERT/PyT] Triton Inference Server support	2020-04-02 14:39:24 +02:00
Sharath T S	793b92dca7	[BERT/PyT] fp32 and allreduce_post_accumulation compatibility (#422 )	2020-03-15 23:03:06 -07:00
PrzemekS	b03375bd6c	Update README.md	2020-03-12 19:09:18 +01:00
PrzemekS	adeed4fcbc	Update README.md	2020-03-12 19:06:10 +01:00
PrzemekS	1c77a04548	Merge pull request #403 from yzhang123/trt_dynamic_shape_update Jasper TRT Fix	2020-03-09 17:28:31 +01:00
Przemek Strzelczyk	96ff411ce8	[BERT/PyT] Typo in README	2020-03-05 09:54:16 +01:00
Rajeev Rao	4f42950e36	Initialize CUDA state before loading TRT engines in Tacotron sample	2020-03-03 09:56:54 -08:00
Przemek Strzelczyk	77a1bb917a	[Tacotron2/PyT] Updates: better perf, better trt7 support, new logging, bug fixes	2020-02-28 15:36:14 +01:00
Przemek Strzelczyk	155578a762	[BERT/PyT] New logging and some README updates	2020-02-28 13:21:20 +01:00
Yang Zhang	c72df2be4a	fix syntax error	2020-02-14 15:49:54 -08:00
Yang Zhang	b738d1d1af	fix trt regression bug Signed-off-by: Yang Zhang <yangzhang@nvidia.com>	2020-02-14 15:44:26 -08:00
PrzemekS	ce73b32068	Merge pull request #392 from NVIDIA/nvpstr/1def26 Updating BERT/TF, Transformer-XL and NCF/PyT	2020-02-06 19:47:30 +01:00
Przemek Strzelczyk	a38deff61e	[Transformer-XL/PyT] Large model support; multi-node training; inference with TorchScript	2020-02-05 22:38:46 +01:00
Przemek Strzelczyk	1def26d80c	[NCF/PyT] Adding new logging	2020-02-05 21:58:04 +01:00
Sharath T S	ad88003e13	[BERT/PyT] Glue(MRPC) fine-tuning with LAMB pretrained checkpoint * LAMB checkpoint compatibility * LAMB checkpoint compatibility; amp training	2020-02-03 16:27:11 -08:00
Sharath T S	119838f1f6	Bugfix in BertAdam for fp32 finetuning (#388 )	2020-01-30 20:11:03 -08:00
PrzemekS	aa061052c6	Merge pull request #345 from nvcforster/master Updating BERT Readme Docs	2020-01-02 14:35:59 +01:00
Krzysztof Kudrynski	f0ef8493eb	ConvNets update	2019-12-20 14:54:58 +01:00
gkarch	480f1f9811	update perf table for taco2 trt	2019-12-18 11:45:17 +01:00
Rajeev Rao	42555b734b	Minor README cleanup for TRT Tacotron2 example Signed-off-by: Rajeev Rao <rajeevrao@nvidia.com>	2019-12-17 14:33:59 -08:00
gkarch	e5bd631f77	fixed trt table	2019-12-17 16:54:41 +01:00
gkarch	684678086a	fixed speedup factor	2019-12-17 16:51:22 +01:00
gkarch	82518bad2d	fixed speedup in trt readme	2019-12-17 16:49:44 +01:00
gkarch	5dbb6c5b91	updated perf scripts	2019-12-17 16:48:08 +01:00
gkarch	3634315318	fixed tabs	2019-12-16 18:31:29 +01:00
gkarch	2992264d3a	updated Tacotron2: included trt, new dllogger	2019-12-16 18:29:08 +01:00
nv-kkudrynski	e0b365ce6e	Merge pull request #347 from NVIDIA/nvpstr/63cf4f Nvpstr/63cf4f	2019-12-16 13:24:07 +01:00

1 2 3 4

169 commits