speechbrain
diff --git a/‎.github/ISSUE_TEMPLATE/bug_report.yaml‎
Lines changed: 25 additions & 12 deletions b/‎.github/ISSUE_TEMPLATE/bug_report.yaml‎
Lines changed: 25 additions & 12 deletions
diff --git a/‎.github/pull_request_template.md‎
Lines changed: 38 additions & 21 deletions b/‎.github/pull_request_template.md‎
Lines changed: 38 additions & 21 deletions
diff --git a/‎.github/workflows/pre-commit.yml‎
Lines changed: 1 addition & 1 deletion b/‎.github/workflows/pre-commit.yml‎
Lines changed: 1 addition & 1 deletion
diff --git a/‎.github/workflows/pythonapp.yml‎
Lines changed: 13 additions & 6 deletions b/‎.github/workflows/pythonapp.yml‎
Lines changed: 13 additions & 6 deletions
diff --git a/‎.github/workflows/release.yml‎
Lines changed: 1 addition & 1 deletion b/‎.github/workflows/release.yml‎
Lines changed: 1 addition & 1 deletion
diff --git a/‎.github/workflows/verify-docs-gen.yml‎
Lines changed: 5 additions & 2 deletions b/‎.github/workflows/verify-docs-gen.yml‎
Lines changed: 5 additions & 2 deletions
diff --git a/‎.gitignore‎
Lines changed: 3 additions & 6 deletions b/‎.gitignore‎
Lines changed: 3 additions & 6 deletions
diff --git a/‎.readthedocs.yaml‎
Lines changed: 7 additions & 5 deletions b/‎.readthedocs.yaml‎
Lines changed: 7 additions & 5 deletions
diff --git a/‎CITATION.cff‎
Lines changed: 116 additions & 0 deletions b/‎CITATION.cff‎
Lines changed: 116 additions & 0 deletions
@@ -1,26 +1,29 @@
 name: 🪲 Bug Report
 description: Something went wrong? Let us know! 🐣
-title: "[Bug]: "
 labels: ["bug"]
 body:
   - type: markdown
     attributes:
       value: |
-        Before submitting a bug, please make sure the issue hasn't been already addressed by searching through the existing and past issues.
+        **Before submitting a bug report, please read the following instructions:**
+
+        - Make sure the issue hasn't already been addressed by searching through existing and past issues.
+        - Use a clear and concise title for your bug report.
+        - Fill out all relevant sections below to help us understand and reproduce the issue.
 
   - type: textarea
     id: describe-the-bug
     attributes:
       label: Describe the bug
-      description: Short and clear description of what the bug is.
+      description: Provide a clear and concise description of the bug.
     validations:
       required: True
 
   - type: textarea
     id: expected-behaviour
     attributes:
       label: Expected behaviour
-      description: A description of what you expected to happen.
+      description: Describe what you expected to happen.
     validations:
       required: True
 
@@ -29,35 +32,45 @@ body:
     attributes:
       label: To Reproduce
       description: |
-        If relevant, add a minimal example so that we can reproduce the error by running the code. It is very important for the snippet to be as minimal as possible. We will copy-paste your code, and we expect to get the same result as you did: avoid any external data, and include the relevant imports.
+        If relevant, add a minimal example or detailed steps to reproduce the error. You can share code directly using Google Colab:
+        1. Visit [Google Colab](https://colab.research.google.com/).
+        2. Create a new notebook.
+        3. Paste your code into the notebook.
+        4. Share the notebook by clicking on "Share" in the top-right corner.
+        5. Share the notebook's link here.
+
+        In the worst case, provide detailed steps to reproduce the behavior.
+
       placeholder: "```python #your code ``` \n ```yaml #your yaml code ```"
     validations:
       required: False
 
   - type: textarea
     id: versions
     attributes:
-      label: Versions
-      description: "Please tell us more about your current SpeechBrain version and/or git hash (if installed via cloning+editable install). You can also add other setup information that might be relevant."
+      label: Environment Details
+      description: Provide information about your SpeechBrain version, setup, and any other relevant environment details.
     validations:
       required: False
 
   - type: textarea
     id: logs
     attributes:
-      label: Relevant log output
-      description: Please copy and paste any relevant log output.
+      label: Relevant Log Output
+      description: Copy and paste any relevant log output here.
       render: shell
+    validations:
+      required: False
 
   - type: textarea
     id: add-context
     attributes:
-      label: Additional context
-      description: "Add any other context about the problem here."
+      label: Additional Context
+      description: Share any other context about the problem or your environment that may help in troubleshooting.
     validations:
       required: False
 
   - type: markdown
     attributes:
       value: |
-        Thanks for contributing to SpeechBrain!
+        **Thank you for contributing to SpeechBrain!** Your bug report helps us improve the project's reliability.
@@ -1,28 +1,45 @@
-# Contribution in a nutshell
-Hey, this could help our community 🌱
+## What does this PR do?
 
-# Scope
-* [ ] I want to get done ...
-* [ ] ... and hope to also achieve ...
+<!--
+Please include a summary of the change and which issue is fixed.
+Please also include relevant motivation and context.
+List any dependencies that are required for this change.
 
-# Notes for reviewing (optional)
-This change has these implication which might need attention over here; —how should we tackle this?
+-->
 
-# Pre-review
-* [ ] (if applicable) add an `extra_requirements.txt` file
-* [ ] (if applicable) add database preparation scripts & use symlinks for nested folders (to the level of task READMEs)
-* [ ] (if applicable) add a recipe test entry in the depending CSV file under: tests/recipes
-* [ ] create a fresh testing environment (install SpeechBrain from cloned repo branch of this PR)
-* [ ] (if applicable) run a recipe test for each yaml/your recipe dataset
-* [ ] check function comments: are there docstrings w/ arguments & returns? If you're not the verbose type, put a comment every three lines of code (better: every line)
-* [ ] use CI locally: `pre-commit run -a` to check linters; run `pytest tests/consistency`
-* [ ] (optional) run `tests/.run-doctests.sh` & `tests/.run-unittests.sh`
-* [ ] exhausted patience before clicking « Ready for review » in the merge box 🍄
+Fixes #<issue_number>
 
----
+<!-- Does your PR introduce any breaking changes? If yes, please list them. -->
 
-Note: when merged, we desire to include your PR title in our contributions list, check out one of our past version releases
-—https://github.com/speechbrain/speechbrain/releases/tag/v0.5.14
+<details>
+  <summary><b>Before submitting</b></summary>
 
-Tip: below, on the « Create Pull Request » use the drop-down to select: « Create Draft Pull Request » – your PR will be in draft mode until you declare it « Ready for review »
+- [ ] Did you read the [contributor guideline](https://speechbrain.readthedocs.io/en/latest/contributing.html)?
+- [ ] Did you make sure your **PR does only one thing**, instead of bundling different changes together?
+- [ ] Did you make sure to **update the documentation** with your changes? (if necessary)
+- [ ] Did you write any **new necessary tests**? (not for typos and docs)
+- [ ] Did you verify new and **existing [tests](https://github.com/speechbrain/speechbrain/tree/develop/tests) pass** locally with your changes?
+- [ ] Did you list all the **breaking changes** introduced by this pull request?
+- [ ] Does your code adhere to project-specific code style and conventions?
 
+</details>
+
+## PR review
+
+<details>
+  <summary>Reviewer checklist</summary>
+
+- [ ] Is this pull request ready for review? (if not, please submit in draft mode)
+- [ ] Check that all items from **Before submitting** are resolved
+- [ ] Make sure the title is self-explanatory and the description concisely explains the PR
+- [ ] Add labels and milestones (and optionally projects) to the PR so it can be classified
+- [ ] Confirm that the changes adhere to compatibility requirements (e.g., Python version, platform)
+- [ ] Review the self-review checklist to ensure the code is ready for review
+
+</details>
+
+<!--
+
+🎩 Magic happens when you code. Keep the spells flowing!
+
+-->
@@ -12,5 +12,5 @@ jobs:
       - uses: actions/checkout@v2
       - uses: actions/setup-python@v2
         with:
-          python-version: '3.8'
+          python-version: '3.9'
       - uses: pre-commit/action@v2.0.3
@@ -21,22 +21,29 @@ jobs:
               uses: actions/setup-python@v1
               with:
                   python-version: ${{ matrix.python-version }}
-            - name: Install libsndfile
+            - name: Install sox
               run: |
                   sudo apt-get update
-                  sudo apt-get install -y libsndfile1
-            - name: Install ffmpeg
-              run: |
-                  sudo apt-get update
-                  sudo apt-get install -y ffmpeg
+                  sudo apt install sox libsox-dev
+            # Installing only SoX for now due to FFmpeg issues on the CI server with Torchaudio 2.1.
+            # FFmpeg works fine on all other machines. We'll switch back when the CI server is fixed.
+            #- name: Install ffmpeg
+            #  run: |
+            #      sudo apt-get update
+            #      sudo apt-get install -y ffmpeg
             - name: Display Python version
               run: python -c "import sys; print(sys.version)"
             - name: Full dependencies
               run: |
                   sudo apt-get update
+                  # up to k2 compatible torch version
+                  pip install torch==2.1.2 torchaudio==2.1.2
                   pip install -r requirements.txt
                   pip install --editable .
                   pip install ctc-segmentation
+                  pip install k2==1.24.4.dev20231220+cpu.torch2.1.2 -f https://k2-fsa.github.io/k2/cpu.html
+                  pip install protobuf
+                  pip install kaldilm==1.15
             - name: Consistency tests with pytest
               run: |
                   pytest tests/consistency
 
@@ -17,7 +17,7 @@ jobs:
           ref: main
       - uses: actions/setup-python@v2
         with:
-          python-version: 3.8
+          python-version: 3.9
       - name: Install pypa/build
         run: python -m pip install build --user
       - name: Build binary wheel and source tarball
 
@@ -11,15 +11,18 @@ jobs:
         runs-on: ubuntu-latest
         steps:
             - uses: actions/checkout@v2
-            - name: Setup Python 3.8
+            - name: Setup Python 3.9
               uses: actions/setup-python@v2
               with:
-                  python-version: '3.8'
+                  python-version: '3.9'
             - name: Full dependencies
               run: |
+                  # up to k2 compatible torch version
+                  pip install torch==2.1.2 torchaudio==2.1.2
                   pip install -r requirements.txt
                   pip install --editable .
                   pip install -r docs/docs-requirements.txt
+                  pip install k2==1.24.4.dev20231220+cpu.torch2.1.2 -f https://k2-fsa.github.io/k2/cpu.html
             - name: Generate docs
               run: |
                   cd docs
 
@@ -72,11 +72,8 @@ instance/
 .scrapy
 
 # Sphinx documentation
-docs/_build/
-docs/source/*.rst
-!docs/source/index.rst
-!docs/source/_templates
-!docs/source/_static
+docs/build/
+docs/API/*.rst
 
 # PyBuilder
 target/
@@ -158,4 +155,4 @@ dmypy.json
 **/log/
 
 # Mac OS
-.DS_Store
+.DS_Store
@@ -1,13 +1,15 @@
 # .readthedocs.yaml
 
+version: 2
+
 build:
-  image: latest
+  os: ubuntu-20.04
+  tools:
+    python: "3.9"
 
 python:
-  version: 3.8
-  pip_install: True
+  install:
+    - requirements: docs/readthedocs-requirements.txt
 
 # Don't build any extra formats
 formats: []
-
-requirements_file: docs/docs-requirements.txt
 
@@ -0,0 +1,116 @@
+# This CITATION.cff file was generated with cffinit.
+# Visit https://bit.ly/cffinit to generate yours today!
+
+cff-version: 1.2.0
+title: SpeechBrain
+message: A PyTorch-based Speech Toolkit
+type: software
+authors:
+  - given-names: Mirco
+    family-names: Ravanelli
+    affiliation: 'Mila - Quebec AI Institute, Université de Montréal'
+  - given-names: Titouan
+    family-names: Parcollet
+    affiliation: >-
+      LIA - Avignon Université, CaMLSys - University of
+      Cambridge
+  - given-names: Peter
+    family-names: Plantinga
+    affiliation: Ohio State University
+  - given-names: Aku
+    family-names: Rouhe
+    affiliation: Aalto University
+  - given-names: Samuele
+    family-names: Cornell
+    affiliation: Università Politecnica delle Marche
+  - given-names: Loren
+    family-names: Lugosch
+    affiliation: 'Mila - Quebec AI Institute, McGill University'
+  - given-names: Cem
+    family-names: Subakan
+    affiliation: Mila - Quebec AI Institute
+  - given-names: Nauman
+    family-names: Dawalatabad
+    affiliation: Indian Institute of Technology Madras
+  - given-names: Abdelwahab
+    family-names: Heba
+    affiliation: IRIT - Université Paul Sabatier
+  - given-names: Jianyuan
+    family-names: Zhong
+    affiliation: Mila - Quebec AI Institute
+  - given-names: Ju-Chieh
+    family-names: Chou
+    affiliation: Toyota Technological Institute at Chicago
+  - given-names: Sung-Lin
+    family-names: Yeh
+    affiliation: University of Edinburgh
+  - given-names: Szu-Wei
+    family-names: Fu
+    affiliation: 'Academia Sinica, Taiwan'
+  - given-names: Chien-Feng
+    family-names: Liao
+    affiliation: 'Academia Sinica, Taiwan'
+  - given-names: Elena
+    family-names: Rastorgueva
+    affiliation: NVIDIA
+  - given-names: François
+    family-names: Grondin
+    affiliation: Université de Sherbrooke
+  - given-names: William
+    family-names: Aris
+    affiliation: Université de Sherbrooke
+  - given-names: Hwidong
+    family-names: Na
+    affiliation: Samsung-SAIT
+  - given-names: Yan
+    family-names: Gao
+    affiliation: CaMLSys - University of Cambridge
+  - given-names: Renato
+    name-particle: De
+    family-names: Mori
+    affiliation: 'LIA - Avignon Université, McGill University'
+  - given-names: Yoshua
+    family-names: Bengio
+    affiliation: 'Mila - Quebec AI Institute, Université de Montréal'
+identifiers:
+  - type: doi
+    value: 10.48550/arXiv.2106.04624
+    description: 'SpeechBrain: A General-Purpose Speech Toolkit'
+repository-code: 'https://github.com/speechbrain/speechbrain/'
+url: 'https://speechbrain.github.io/'
+abstract: >-
+  SpeechBrain is an open-source and all-in-one speech
+  toolkit. It is designed to facilitate the research and
+  development of neural speech processing technologies by
+  being simple, flexible, user-friendly, and
+  well-documented. This paper describes the core
+  architecture designed to support several tasks of common
+  interest, allowing users to naturally conceive, compare
+  and share novel speech processing pipelines. SpeechBrain
+  achieves competitive or state-of-the-art performance in a
+  wide range of speech benchmarks. It also provides training
+  recipes, pretrained models, and inference scripts for
+  popular speech datasets, as well as tutorials which allow
+  anyone with basic Python proficiency to familiarize
+  themselves with speech technologies.
+keywords:
+  - speech toolkit
+  - audio
+  - deep learning
+  - PyTorch
+  - transformers
+  - voice recognition
+  - speech recognition
+  - speech-to-text
+  - language model
+  - speaker recognition
+  - speaker verification
+  - speech processing
+  - audio processing
+  - ASR
+  - speaker diarization
+  - speech separation
+  - speech enhancement
+  - spoken language understanding
+  - HuggingFace
+license: Apache-2.0