1106ff93c0
* Pushing WFST_tutorial for open draft. (Still need to review collab code. Signed-off-by: tbartley94 <tbartley@nvidia.com> * Checked tutorial code for WFST_Tutorial is properly functioning. Also included some formatting edits. Signed-off-by: tbartley94 <tbartley@nvidia.com> * Responding to editorial comments for WFST_tutorial Signed-off-by: tbartley94 <tbartley@nvidia.com> * Added images to folder and wrote README for tutorials Signed-off-by: tbartley94 <tbartley@nvidia.com> * Few more editorial changes to explain permutations in classification. Signed-off-by: tbartley94 <tbartley@nvidia.com> * Updated tutorials documentation page. Signed-off-by: tbartley94 <tbartley@nvidia.com> * Forgot links for README Signed-off-by: tbartley94 <tbartley@nvidia.com> * TOC links were dead Signed-off-by: tbartley94 <tbartley@nvidia.com> * More dead links to fix. Signed-off-by: tbartley94 <tbartley@nvidia.com> * removing collab install and appending a warning instead. Signed-off-by: tbartley94 <tbartley@nvidia.com> * Update WFST_Tutorial.ipynb Signed-off-by: tbartley94 <tbartley@nvidia.com>
145 lines
8.4 KiB
ReStructuredText
145 lines
8.4 KiB
ReStructuredText
.. _tutorials:
|
|
|
|
Tutorials
|
|
=========
|
|
|
|
The best way to get started with NeMo is to start with one of our tutorials.
|
|
|
|
Most NeMo tutorials can be run on `Google's Colab <https://colab.research.google.com/notebooks/intro.ipynb>`_.
|
|
|
|
To run a tutorial:
|
|
|
|
#. Click the **Colab** link (see table below).
|
|
#. Connect to an instance with a GPU. For example, click **Runtime** > **Change runtime type** and select **GPU** for the hardware accelerator.
|
|
|
|
.. list-table:: **Tutorials**
|
|
:widths: 15 25 25
|
|
:header-rows: 1
|
|
|
|
* - Domain
|
|
- Title
|
|
- GitHub URL
|
|
* - General
|
|
- Getting Started: Exploring Nemo Fundamentals
|
|
- `NeMo Fundamentals <https://colab.research.google.com/github/NVIDIA/NeMo/blob/stable/tutorials/00_NeMo_Primer.ipynb>`_
|
|
* - General
|
|
- Getting Started: Sample Conversational AI application
|
|
- `Audio translator example <https://colab.research.google.com/github/NVIDIA/NeMo/blob/stable/tutorials/AudioTranslationSample.ipynb>`_
|
|
* - General
|
|
- Getting Started: Voice swap application
|
|
- `Voice swap example <https://colab.research.google.com/github/NVIDIA/NeMo/blob/stable/tutorials/VoiceSwapSample.ipynb>`_
|
|
* - General
|
|
- Exploring NeMo Model Construction
|
|
- `NeMo Models <https://colab.research.google.com/github/NVIDIA/NeMo/blob/stable/tutorials/01_NeMo_Models.ipynb>`_
|
|
* - ASR
|
|
- ASR with NeMo
|
|
- `ASR with NeMo <https://colab.research.google.com/github/NVIDIA/NeMo/blob/stable/tutorials/asr/ASR_with_NeMo.ipynb>`_
|
|
* - ASR
|
|
- ASR with Subword Tokenization
|
|
- `ASR with Subword Tokenization <https://colab.research.google.com/github/NVIDIA/NeMo/blob/stable/tutorials/asr/ASR_with_Subword_Tokenization.ipynb>`_
|
|
* - ASR
|
|
- Offline ASR Inference with Beam Search and External Language Model Rescoring
|
|
- `Offline ASR <https://colab.research.google.com/github/NVIDIA/NeMo/blob/stable/tutorials/asr/Offline_ASR.ipynb>`_
|
|
* - ASR
|
|
- Online ASR inference with Microphone
|
|
- `Online ASR Microphone <https://github.com/NVIDIA/NeMo/blob/stable/tutorials/asr/Online_ASR_Microphone_Demo.ipynb>`_
|
|
* - ASR
|
|
- Fine-tuning CTC Models on New Languages
|
|
- `ASR CTC Language Fine-Tuning <https://colab.research.google.com/github/NVIDIA/NeMo/blob/stable/tutorials/asr/ASR_CTC_Language_Finetuning.ipynb>`_
|
|
* - ASR
|
|
- Intro to Transducers
|
|
- `Intro to Transducers <https://colab.research.google.com/github/NVIDIA/NeMo/blob/stable/tutorials/asr/Intro_to_Transducers.ipynb>`_
|
|
* - ASR
|
|
- ASR with Transducers
|
|
- `ASR with Transducers <https://colab.research.google.com/github/NVIDIA/NeMo/blob/stable/tutorials/asr/ASR_with_Transducers.ipynb>`_
|
|
* - ASR
|
|
- Speech Commands
|
|
- `Speech Commands <https://colab.research.google.com/github/NVIDIA/NeMo/blob/stable/tutorials/asr/Speech_Commands.ipynb>`_
|
|
* - ASR
|
|
- Online and Offline Speech Commands Inference
|
|
- `Online Offline Microphone Speech Commands <https://github.com/NVIDIA/NeMo/blob/stable/tutorials/asr/Online_Offline_Speech_Commands_Demo.ipynb>`_
|
|
* - ASR
|
|
- Voice Activity Detection (VAD)
|
|
- `Voice Activity Detection <https://colab.research.google.com/github/NVIDIA/NeMo/blob/stable/tutorials/asr/Voice_Activity_Detection.ipynb>`_
|
|
* - ASR
|
|
- Online and Offline VAD Inference
|
|
- `Online Offline Microphone VAD <https://github.com/NVIDIA/NeMo/blob/stable/tutorials/asr/Online_Offline_Microphone_VAD_Demo.ipynb>`_
|
|
* - ASR
|
|
- Speaker Recognition and Verification
|
|
- `Speaker Recognition and Verification <https://colab.research.google.com/github/NVIDIA/NeMo/blob/stable/tutorials/speaker_recognition/Speaker_Recognition_Verification.ipynb>`_
|
|
* - ASR
|
|
- Speaker Diarization Inference
|
|
- `Speaker Diarization Inference <https://colab.research.google.com/github/NVIDIA/NeMo/blob/stable/tutorials/speaker_recognition/Speaker_Diarization_Inference.ipynb>`_
|
|
* - ASR
|
|
- ASR with Speaker Diarization
|
|
- `ASR with Speaker Diarization <https://colab.research.google.com/github/NVIDIA/NeMo/blob/stable/tutorials/speaker_recognition/ASR_with_SpeakerDiarization.ipynb>`_
|
|
* - ASR
|
|
- Online Noise Augmentation
|
|
- `Online Noise Augmentation <https://colab.research.google.com/github/NVIDIA/NeMo/blob/stable/tutorials/asr/Online_Noise_Augmentation.ipynb>`_
|
|
* - ASR
|
|
- ASR for Telephony Speech
|
|
- `ASR for Telephony Speech <https://github.com/NVIDIA/NeMo/blob/stable/tutorials/asr/ASR_for_telephony_speech.ipynb>`_
|
|
* - ASR
|
|
- Streaming inference for ASR
|
|
- `Streaming inference <https://github.com/NVIDIA/NeMo/blob/stable/tutorials/asr/Streaming_ASR.ipynb>`_
|
|
* - NLP
|
|
- Using Pretrained Language Models for Downstream Tasks
|
|
- `Pretrained Language Models for Downstream Tasks <https://colab.research.google.com/github/NVIDIA/NeMo/blob/stable/tutorials/nlp/01_Pretrained_Language_Models_for_Downstream_Tasks.ipynb>`_
|
|
* - NLP
|
|
- Exploring NeMo NLP Tokenizers
|
|
- `NLP Tokenizers <https://colab.research.google.com/github/NVIDIA/NeMo/blob/stable/tutorials/nlp/02_NLP_Tokenizers.ipynb>`_
|
|
* - NLP
|
|
- Text Classification (Sentiment Analysis) with BERT
|
|
- `Text Classification (Sentiment Analysis) <https://colab.research.google.com/github/NVIDIA/NeMo/blob/stable/tutorials/nlp/Text_Classification_Sentiment_Analysis.ipynb>`_
|
|
* - NLP
|
|
- Question Answering with SQuAD
|
|
- `Question Answering Squad <https://colab.research.google.com/github/NVIDIA/NeMo/blob/stable/tutorials/nlp/Question_Answering_Squad.ipynb>`_
|
|
* - NLP
|
|
- Token Classification (Named Entity Recognition)
|
|
- `Token Classification: Named Entity Recognition <https://colab.research.google.com/github/NVIDIA/NeMo/blob/stable/tutorials/nlp/Token_Classification_Named_Entity_Recognition.ipynb>`_
|
|
* - NLP
|
|
- Joint Intent Classification and Slot Filling
|
|
- `Joint Intent and Slot Classification <https://colab.research.google.com/github/NVIDIA/NeMo/blob/stable/tutorials/nlp/Joint_Intent_and_Slot_Classification.ipynb>`_
|
|
* - NLP
|
|
- GLUE Benchmark
|
|
- `GLUE Benchmark <https://colab.research.google.com/github/NVIDIA/NeMo/blob/stable/tutorials/nlp/GLUE_Benchmark.ipynb>`_
|
|
* - NLP
|
|
- Punctuation and Capitalization
|
|
- `Punctuation and Capitalization <https://colab.research.google.com/github/NVIDIA/NeMo/blob/stable/tutorials/nlp/Punctuation_and_Capitalization.ipynb>`_
|
|
* - NLP
|
|
- Entity Linking
|
|
- `Entity Linking <https://colab.research.google.com/github/NVIDIA/NeMo/blob/v1.0.2/tutorials/nlp/Entity_Linking_Medical.ipynb>`_
|
|
* - NLP
|
|
- Named Entity Recognition - BioMegatron
|
|
- `Named Entity Recognition - BioMegatron <https://colab.research.google.com/github/NVIDIA/NeMo/blob/stable/tutorials/nlp/Token_Classification-BioMegatron.ipynb>`_
|
|
* - NLP
|
|
- Relation Extraction - BioMegatron
|
|
- `Relation Extraction - BioMegatron <https://colab.research.google.com/github/NVIDIA/NeMo/blob/stable/tutorials/nlp/Relation_Extraction-BioMegatron.ipynb>`_
|
|
* - TTS
|
|
- Speech Synthesis
|
|
- `TTS Inference <https://colab.research.google.com/github/NVIDIA/NeMo/blob/stable/tutorials/tts/Inference_ModelSelect.ipynb>`_
|
|
* - TTS
|
|
- Speech Synthesis
|
|
- `FastPitch Duration and Pitch Control <https://colab.research.google.com/github/NVIDIA/NeMo/blob/stable/tutorials/tts/Inference_DurationPitchControl.ipynb>`_
|
|
* - TTS
|
|
- Speech Synthesis
|
|
- `Tacotron2 Training <https://colab.research.google.com/github/NVIDIA/NeMo/blob/stable/tutorials/tts/Tacotron2_Training.ipynb>`_
|
|
* - TTS
|
|
- Speech Synthesis
|
|
- `TalkNet Training <https://colab.research.google.com/github/NVIDIA/NeMo/blob/stable/tutorials/tts/TalkNet_Training.ipynb>`_
|
|
* - TTS
|
|
- Speech Synthesis
|
|
- `FastPitch Fine-Tuning <https://colab.research.google.com/github/NVIDIA/NeMo/blob/stable/tutorials/tts/FastPitch_Finetuning.ipynb>`_
|
|
* - Tools
|
|
- CTC Segmentation
|
|
- `CTC Segmentation <https://colab.research.google.com/github/NVIDIA/NeMo/blob/stable/tutorials/tools/CTC_Segmentation_Tutorial.ipynb>`_
|
|
* - Text Processing
|
|
- Text Normalization for TTS
|
|
- `Text Normalization <https://colab.research.google.com/github/NVIDIA/NeMo/blob/stable/tutorials/text_processing/Text_Normalization.ipynb>`_
|
|
* - Text Processing
|
|
- Inverse Text Normalization for ASR
|
|
- `Inverse Text Normalization <https://colab.research.google.com/github/NVIDIA/NeMo/blob/stable/tutorials/text_processing/Inverse_Text_Normalization.ipynb>`_
|
|
* - Text Processing
|
|
- Constructing Normalization Grammars with WFSTs
|
|
- `WFST Tutorial <https://github.com/NVIDIA/NeMo/blob/stable/tutorials/text_processing/WFST_Tutorial.ipynb>`_
|