NeMo/examples/asr at main - maxmustermann/NeMo

History

Samuel Kriman b7a175b7b9 Self-supervised pre-training for speech models (#3139 ) * self-supervised training Signed-off-by: sam1373 <samuelkriman@gmail.com> * test Signed-off-by: sam1373 <samuelkriman@gmail.com> * remove imports Signed-off-by: sam1373 <samuelkriman@gmail.com> * fix Signed-off-by: sam1373 <samuelkriman@gmail.com> * sort imports Signed-off-by: sam1373 <samuelkriman@gmail.com> * fix audio_to_text Signed-off-by: sam1373 <samuelkriman@gmail.com> * manifest handle no text Signed-off-by: sam1373 <samuelkriman@gmail.com> * loss init Signed-off-by: sam1373 <samuelkriman@gmail.com> * style Signed-off-by: sam1373 <samuelkriman@gmail.com> * remove tokenizer from config Signed-off-by: sam1373 <samuelkriman@gmail.com> * config changes Signed-off-by: sam1373 <samuelkriman@gmail.com> * remove hydra import Signed-off-by: sam1373 <samuelkriman@gmail.com> * always spec augment Signed-off-by: sam1373 <samuelkriman@gmail.com> * fixes Signed-off-by: sam1373 <samuelkriman@gmail.com> * copyright Signed-off-by: sam1373 <samuelkriman@gmail.com> * fix cosine sim Signed-off-by: sam1373 <samuelkriman@gmail.com> * fix cosine sim Signed-off-by: sam1373 <samuelkriman@gmail.com> * fix cosine sim Signed-off-by: sam1373 <samuelkriman@gmail.com> * changes based on comments Signed-off-by: sam1373 <samuelkriman@gmail.com> * changes based on comments Signed-off-by: sam1373 <samuelkriman@gmail.com> * configs Signed-off-by: sam1373 <samuelkriman@gmail.com> * name fix Signed-off-by: sam1373 <samuelkriman@gmail.com> * ci config changes Signed-off-by: sam1373 <samuelkriman@gmail.com> * renamed to num_negatives Signed-off-by: sam1373 <samuelkriman@gmail.com> * minor changes Signed-off-by: sam1373 <samuelkriman@gmail.com> * name changes, type annotations Signed-off-by: sam1373 <samuelkriman@gmail.com> Co-authored-by: Yang Zhang <yzhang123@users.noreply.github.com>		2021-11-10 15:33:11 -08:00
..
conf	Self-supervised pre-training for speech models (#3139 )	2021-11-10 15:33:11 -08:00
experimental	[BigNLP] Merge Megatron GPT to main (#2975 )	2021-10-20 21:06:37 -06:00
export/transducer	Enable RNNT ONNX Export (#2510 )	2021-07-20 20:01:18 -07:00
quantization	patch quantization (#2314 )	2021-06-07 13:19:08 -07:00
speech_pre_training.py	Self-supervised pre-training for speech models (#3139 )	2021-11-10 15:33:11 -08:00
speech_to_label.py	Merge tag 'v1.0.0' into main	2021-06-03 15:49:50 -07:00
speech_to_text.py	ASR Refactoring (#2240 )	2021-05-26 15:07:02 -07:00
speech_to_text_bpe.py	Merge tag 'v1.0.0' into main	2021-06-03 15:49:50 -07:00
speech_to_text_buffered_infer.py	Script for ASR inference on long files (#2373 )	2021-07-14 15:51:28 -07:00
speech_to_text_infer.py	Update speech_to_text_infer.py (#2027 )	2021-04-07 10:28:03 -04:00
speech_to_text_rnnt.py	ASR Refactoring (#2240 )	2021-05-26 15:07:02 -07:00
speech_to_text_rnnt_bpe.py	ASR Refactoring (#2240 )	2021-05-26 15:07:02 -07:00
transcribe_speech.py	Enable RNNT ONNX Export (#2510 )	2021-07-20 20:01:18 -07:00
transcribe_speech_parallel.py	Adding parallel transcribe for ASR models - suppports multi-gpu/multi-node (#3017 )	2021-11-10 00:37:19 -08:00
vad_infer.py	Update vad_infer.py (#2702 )	2021-08-20 10:39:18 -07:00