Skip to content

Commit fe2ecd4

Browse files
authored
Organizing the script folder (NVIDIA-NeMo#1844)
* fixed paths in tutorials. Signed-off-by: Vahid <vnoroozi@nvidia.com> * Updated. Signed-off-by: Vahid <vnoroozi@nvidia.com> * added nvidia copyright. Signed-off-by: Vahid <vnoroozi@nvidia.com> * fixed the paths. Signed-off-by: Vahid <vnoroozi@nvidia.com> * fixed the branch. Signed-off-by: Vahid <vnoroozi@nvidia.com>
1 parent 79bb319 commit fe2ecd4

36 files changed

Lines changed: 57 additions & 32 deletions

examples/asr/speech_to_label.py

Lines changed: 4 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -16,10 +16,10 @@
1616
# Task 1: Speech Command
1717
1818
## Preparing the dataset
19-
Use the `process_speech_commands_data.py` script under <NEMO_ROOT>/scripts in order to prepare the dataset.
19+
Use the `process_speech_commands_data.py` script under <NEMO_ROOT>/scripts/dataset_processing in order to prepare the dataset.
2020
2121
```sh
22-
python <NEMO_ROOT>/scripts/process_speech_commands_data.py \
22+
python <NEMO_ROOT>/scripts/dataset_processing/process_speech_commands_data.py \
2323
--data_root=<absolute path to where the data should be stored> \
2424
--data_version=<either 1 or 2, indicating version of the dataset> \
2525
--class_split=<either "all" or "sub", indicates whether all 30/35 classes should be used, or the 10+2 split should be used> \
@@ -47,7 +47,7 @@
4747
# Task 2: Voice Activity Detection
4848
4949
## Preparing the dataset
50-
Use the `process_vad_data.py` script under <NEMO_ROOT>/scripts in order to prepare the dataset.
50+
Use the `process_vad_data.py` script under <NEMO_ROOT>/scripts/dataset_processing in order to prepare the dataset.
5151
5252
```sh
5353
python process_vad_data.py \
@@ -82,7 +82,7 @@
8282
Note that it's possible that tarred datasets impacts validation scores because it drop values in order to have same amount of files per tarfile;
8383
Scores might be off since some data is missing.
8484
85-
Use the `convert_to_tarred_audio_dataset.py` script under <NEMO_ROOT>/scripts in order to prepare tarred audio dataset.
85+
Use the `convert_to_tarred_audio_dataset.py` script under <NEMO_ROOT>/scripts/speech_recognition in order to prepare tarred audio dataset.
8686
For details, please see TarredAudioToClassificationLabelDataset in <NEMO_ROOT>/nemo/collections/asr/data/audio_to_label.py
8787
8888
python speech_to_label.py \

examples/asr/speech_to_text_bpe.py

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -14,10 +14,10 @@
1414

1515
"""
1616
# Preparing the Tokenizer for the dataset
17-
Use the `process_asr_text_tokenizer.py` script under <NEMO_ROOT>/scripts in order to prepare the tokenizer.
17+
Use the `process_asr_text_tokenizer.py` script under <NEMO_ROOT>/scripts/tokenizers/ in order to prepare the tokenizer.
1818
1919
```sh
20-
python <NEMO_ROOT>/scripts/process_asr_text_tokenizer.py \
20+
python <NEMO_ROOT>/scripts/tokenizers/process_asr_text_tokenizer.py \
2121
--manifest=<path to train manifest files, seperated by commas>
2222
OR
2323
--data_file=<path to text data, seperated by commas> \

examples/asr/speech_to_text_rnnt.py

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -21,10 +21,10 @@
2121

2222
"""
2323
# Preparing the Tokenizer for the dataset
24-
Use the `process_asr_text_tokenizer.py` script under <NEMO_ROOT>/scripts in order to prepare the tokenizer.
24+
Use the `process_asr_text_tokenizer.py` script under <NEMO_ROOT>/scripts/tokenizers/ in order to prepare the tokenizer.
2525
2626
```sh
27-
python <NEMO_ROOT>/scripts/process_asr_text_tokenizer.py \
27+
python <NEMO_ROOT>/scripts/tokenizers/process_asr_text_tokenizer.py \
2828
--manifest=<path to train manifest files, seperated by commas> \
2929
--data_root="<output directory>" \
3030
--vocab_size=<number of tokens in vocabulary> \

examples/asr/vad_infer.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -17,7 +17,7 @@
1717
1) shift the window of length time_length (e.g. 0.63s) by shift_length (e.g. 10ms) to generate the frame and use the prediction of the window to represent the label for the frame;
1818
[this script demonstrate how to do this approach]
1919
2) generate predictions with overlapping input segments. Then a smoothing filter is applied to decide the label for a frame spanned by multiple segments.
20-
[get frame level prediction by this script and use vad_overlap_posterior.py in NeMo/scripts/
20+
[get frame level prediction by this script and use vad_overlap_posterior.py in NeMo/scripts/voice_activity_detection
2121
One can also find posterior about converting frame level prediction
2222
to speech/no-speech segment in start and end times format in that script.]
2323

examples/speaker_recognition/speaker_reco.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -43,7 +43,7 @@
4343
Note that it's possible that tarred datasets impacts validation scores because it drop values in order to have same amount of files per tarfile;
4444
Scores might be off since some data is missing.
4545
46-
Use the `convert_to_tarred_audio_dataset.py` script under <NEMO_ROOT>/scripts in order to prepare tarred audio dataset.
46+
Use the `convert_to_tarred_audio_dataset.py` script under <NEMO_ROOT>/speech_recognition/scripts in order to prepare tarred audio dataset.
4747
For details, please see TarredAudioToClassificationLabelDataset in <NEMO_ROOT>/nemo/collections/asr/data/audio_to_label.py
4848
"""
4949

scripts/rttm_to_manifest.py renamed to scripts/ speaker_recognition/rttm_to_manifest.py

File renamed without changes.

scripts/scp_to_manifest.py renamed to scripts/ speaker_recognition/scp_to_manifest.py

File renamed without changes.
File renamed without changes.

scripts/create_tarred_transformer_lm_dataset.py renamed to scripts/asr_language_modelling/create_tarred_transformer_lm_dataset.py

File renamed without changes.

scripts/install_ctc_decoders.sh renamed to scripts/asr_language_modelling/install_ctc_decoders.sh

File renamed without changes.

0 commit comments

Comments
 (0)