NeMo/examples/asr
Samuel Kriman b7a175b7b9
Self-supervised pre-training for speech models (#3139)
* self-supervised training

Signed-off-by: sam1373 <samuelkriman@gmail.com>

* test

Signed-off-by: sam1373 <samuelkriman@gmail.com>

* remove imports

Signed-off-by: sam1373 <samuelkriman@gmail.com>

* fix

Signed-off-by: sam1373 <samuelkriman@gmail.com>

* sort imports

Signed-off-by: sam1373 <samuelkriman@gmail.com>

* fix audio_to_text

Signed-off-by: sam1373 <samuelkriman@gmail.com>

* manifest handle no text

Signed-off-by: sam1373 <samuelkriman@gmail.com>

* loss init

Signed-off-by: sam1373 <samuelkriman@gmail.com>

* style

Signed-off-by: sam1373 <samuelkriman@gmail.com>

* remove tokenizer from config

Signed-off-by: sam1373 <samuelkriman@gmail.com>

* config changes

Signed-off-by: sam1373 <samuelkriman@gmail.com>

* remove hydra import

Signed-off-by: sam1373 <samuelkriman@gmail.com>

* always spec augment

Signed-off-by: sam1373 <samuelkriman@gmail.com>

* fixes

Signed-off-by: sam1373 <samuelkriman@gmail.com>

* copyright

Signed-off-by: sam1373 <samuelkriman@gmail.com>

* fix cosine sim

Signed-off-by: sam1373 <samuelkriman@gmail.com>

* fix cosine sim

Signed-off-by: sam1373 <samuelkriman@gmail.com>

* fix cosine sim

Signed-off-by: sam1373 <samuelkriman@gmail.com>

* changes based on comments

Signed-off-by: sam1373 <samuelkriman@gmail.com>

* changes based on comments

Signed-off-by: sam1373 <samuelkriman@gmail.com>

* configs

Signed-off-by: sam1373 <samuelkriman@gmail.com>

* name fix

Signed-off-by: sam1373 <samuelkriman@gmail.com>

* ci config changes

Signed-off-by: sam1373 <samuelkriman@gmail.com>

* renamed to num_negatives

Signed-off-by: sam1373 <samuelkriman@gmail.com>

* minor changes

Signed-off-by: sam1373 <samuelkriman@gmail.com>

* name changes, type annotations

Signed-off-by: sam1373 <samuelkriman@gmail.com>

Co-authored-by: Yang Zhang <yzhang123@users.noreply.github.com>
2021-11-10 15:33:11 -08:00
..
conf Self-supervised pre-training for speech models (#3139) 2021-11-10 15:33:11 -08:00
experimental [BigNLP] Merge Megatron GPT to main (#2975) 2021-10-20 21:06:37 -06:00
export/transducer Enable RNNT ONNX Export (#2510) 2021-07-20 20:01:18 -07:00
quantization patch quantization (#2314) 2021-06-07 13:19:08 -07:00
speech_pre_training.py Self-supervised pre-training for speech models (#3139) 2021-11-10 15:33:11 -08:00
speech_to_label.py Merge tag 'v1.0.0' into main 2021-06-03 15:49:50 -07:00
speech_to_text.py ASR Refactoring (#2240) 2021-05-26 15:07:02 -07:00
speech_to_text_bpe.py Merge tag 'v1.0.0' into main 2021-06-03 15:49:50 -07:00
speech_to_text_buffered_infer.py Script for ASR inference on long files (#2373) 2021-07-14 15:51:28 -07:00
speech_to_text_infer.py Update speech_to_text_infer.py (#2027) 2021-04-07 10:28:03 -04:00
speech_to_text_rnnt.py ASR Refactoring (#2240) 2021-05-26 15:07:02 -07:00
speech_to_text_rnnt_bpe.py ASR Refactoring (#2240) 2021-05-26 15:07:02 -07:00
transcribe_speech.py Enable RNNT ONNX Export (#2510) 2021-07-20 20:01:18 -07:00
transcribe_speech_parallel.py Adding parallel transcribe for ASR models - suppports multi-gpu/multi-node (#3017) 2021-11-10 00:37:19 -08:00
vad_infer.py Update vad_infer.py (#2702) 2021-08-20 10:39:18 -07:00