NeMo/scripts/speaker_tasks/rttm_to_manifest.py
Nithin Rao 0aa5b4526a
Move speaker folders (#2777)
* initial push

Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com>

change folder

Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com>

readme

Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com>

Create README.md

initial diar readme

scp_manifest

Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com>

rebase and move folders

Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com>

updated scp to manifest script

Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com>

small_fix

Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com>

Update README.md

add recogniton read me

tutorial update

Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com>

initial push

Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com>

readme

Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com>

scp_manifest

Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com>

rebase and move folders

Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com>

updated scp to manifest script

Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com>

add recogniton read me

tutorial update

Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com>

add diarization README

initial push

Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com>

readme

Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com>

scp_manifest

Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com>

rebase and move folders

Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com>

updated scp to manifest script

Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com>

add recogniton read me

tutorial update

Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com>

initial push

Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com>

readme

Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com>

scp_manifest

Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com>

rebase and move folders

Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com>

updated scp to manifest script

Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com>

add recogniton read me

tutorial update

Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com>

Updated README.md 001

Updated README.md and committing for saving purpose

Update README.md

conf changes

Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com>

Update README.md 002

Added examples for input and output.

Added diarization_utils.py and asr_with_diarization.py

Signed-off-by: Taejin Park <tango4j@gmail.com>

slight changes diarization

oracle null and style --fix

Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com>

Reflected LGTM comments.

Signed-off-by: Taejin Park <tango4j@gmail.com>

reflected changes

Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com>

remove duplicate seeds

Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com>

Reflected PR review and removed unused variables

Signed-off-by: Taejin Park <tango4j@gmail.com>

Update README.md 003

Added a few titles and revised the descriptions.

Update README.md 003

Added a few titles and revised the descriptions.

Signed-off-by: Taejin Park <tango4j@gmail.com>

scripts and tutorial link fixes

Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com>

LGTM fixes

Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com>

Added more docstrings and reused get_DER

Signed-off-by: Taejin Park <tango4j@gmail.com>

style fix

Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com>

* update ecapa config

Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com>
2021-09-08 20:58:08 -07:00

41 lines
1.6 KiB
Python

# Copyright (c) 2020, NVIDIA CORPORATION. All rights reserved.
#
# Licensed under the Apache License, Version 2.0 (the "License");
# you may not use this file except in compliance with the License.
# You may obtain a copy of the License at
#
# http://www.apache.org/licenses/LICENSE-2.0
#
# Unless required by applicable law or agreed to in writing, software
# distributed under the License is distributed on an "AS IS" BASIS,
# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
# See the License for the specific language governing permissions and
# limitations under the License.
import argparse
from nemo.collections.asr.parts.utils.speaker_utils import write_rttm2manifest
from nemo.utils import logging
"""
This file converts vad outputs to manifest file for speaker diarization purposes
present in vad output directory.
every vad line consists of start_time, end_time , speech/non-speech
"""
if __name__ == "__main__":
parser = argparse.ArgumentParser()
parser.add_argument(
"--paths2rttm_files", help="path to vad output rttm-like files. Could be a list or a text file", required=True
)
parser.add_argument(
"--paths2audio_files",
help="path to audio files that vad was computed. Could be a list or a text file",
required=True,
)
parser.add_argument("--manifest_file", help="output manifest file name", type=str, required=True)
args = parser.parse_args()
write_rttm2manifest(args.paths2audio_files, args.paths2rttm_files, args.manifest_file)
logging.info("wrote {} file from vad output files present in {}".format(args.manifest_file, args.paths2rttm_files))