Voice analysis functions by pplantinga · Pull Request #2689 · speechbrain/speechbrain

pplantinga · 2024-09-17T21:13:05Z

This PR introduces functions for helping with voice analysis, such as dysarthric speech detection.

The planned functions are as follows:

This provides a start, more may get added later. Tutorial included.

…mula

TParcollet · 2024-10-12T06:58:27Z

@pplantinga let me know if you are satisfied with this PR, it looks good to me. Maybe we may want to provide a tutorial or a short example?

pplantinga · 2024-10-14T19:52:09Z

Let's wait on this, I'm still tweaking it and a tutorial would be nice too

…te and 100x faster

pplantinga · 2024-10-29T20:23:47Z

Tutorial is added, ready for review.

pplantinga · 2025-01-07T17:21:11Z

Alright, I think this is finally ready for review again @TParcollet @ycemsubakan @mravanelli . It now includes spectral features, and matches PRAAT and OpenSMILE. Later perhaps a recipe can be added for some open dataset.

bcordel

Couple of comments for the jupyter notebook:

Title section:
This notebook goes through a simple voice analysis of a few speech samples. If you are new to speech processing, we recommend reading through this introduction before going through the notebook. First we download a public Parkinson's dataset and cut to just the sustained phonation.

Compute autocorrelation and related features (code box 1):
line 8: perhaps a comment/link about how to estimate best_lags
line 24: step_samples is hardcoded as 441

Could put the same comment from vocal_features.py into .ipynb for GNE

Maybe a "Here are some additional speech processing resources" box before the Speechbrain citation with (for ex):
https://tahull.github.io/blog/2020/08/acf-animated
https://github.com/chautruonglong/Fundamental-Frequency
https://www.fon.hum.uva.nl/praat/
https://www.audeering.com/opensmile/

No comments for the .pys, looks good to me !

pplantinga · 2025-02-27T15:01:05Z

Hi @mravanelli , @bcordel has completed his review and I was able to address the comments. I guess the last thing is your review, let me know if there's anything I can do to help.

… tutorial

ycemsubakan

I think the additional explanations help a lot!

pplantinga and others added 4 commits September 11, 2024 22:17

Add initial draft of f0 estimation

670784c

Finish initial draft of f0 algorithm

473f5d5

Finally got a failed test, hooray!

6a84c90

Finally passed the test, hooray!

30e6b6b

pplantinga added the enhancement New feature or request label Sep 17, 2024

pplantinga self-assigned this Sep 17, 2024

pplantinga and others added 8 commits September 18, 2024 17:04

First draft of jitter and shimmer

489d700

Fix boundaries and average lag calculation

efcf3c6

Add harmonicity-to-noise ratio and use it for voice detection

6fdec38

Update harmonicity-to-noise ratio using formula, change detection for…

18359f4

…mula

Update comment to match latest code

6e1b387

Match format of HNR (dB instead of ratio)

f52beff

Match PRAAT more closely

1e01709

Use jitter rather than power ratio for voice detection

722f6f7

pplantinga added 7 commits October 15, 2024 12:57

Update vocal analysis feature computation algorithm to be more accura…

ba91d6f

…te and 100x faster

Add GNE measure

7890bc6

Fix doctest on cross correlation

7483d19

Jitter/shimmer measures fix for including positive and negative peaks

a0f513b

Undo incorrect fix to jitter/shimmer

2d08998

Merge branch 'develop' into voice-analysis

594cce1

Add tutorial for vocal features

486738e

pplantinga marked this pull request as ready for review October 29, 2024 20:23

Merge branch 'develop' into voice-analysis

9d5bc44

pplantinga requested review from mravanelli and ycemsubakan October 29, 2024 20:56

pplantinga added 2 commits November 7, 2024 11:01

Ensure values remain finite for HNR and GNE

8373cdd

Make features compatible with running in batch mode

6e02b58

ycemsubakan and others added 14 commits November 18, 2024 14:27

added some more comments

675b7e3

Merge branch 'develop' into voice-analysis

1728a57

Add options to convert vocal features to log scale

03dd159

Improve accuracy without harming speed

645b2f0

Fix wrong device and size

8177326

Fix test due to behavior change (returns log score by default)

925289b

Merge branch 'develop' into voice-analysis

4c96b41

Ensure neighboring periods fit in analysis window

42c168d

Add spectral features

cb4b3b4

Move main vocal feature extraction code to lobes

b18b278

Add examples to all vocal features functions

8ff6724

Update vocal features unittests to latest code

2ca4234

Merge branch 'develop' into voice-analysis

8062f54

Merge branch 'develop' into voice-analysis

8708786

pplantinga added 4 commits January 9, 2025 16:18

Merge branch 'develop' into voice-analysis

547a8df

Merge branch 'develop' into voice-analysis

52b2133

Merge branch 'develop' into voice-analysis

a8952c7

Merge branch 'develop' into voice-analysis

35d0437

pplantinga added this to the v1.0.3 milestone Feb 14, 2025

Merge branch 'develop' into voice-analysis

3a7da12

bcordel reviewed Feb 26, 2025

View reviewed changes

pplantinga added 2 commits February 26, 2025 16:01

Documentation fixes and removing/fixing hard-coded numbers

181e8df

Merge branch 'develop' into voice-analysis

6234208

Add clearer explanations of the various measures in the vocal feature…

1536bb6

… tutorial

ycemsubakan approved these changes Mar 6, 2025

View reviewed changes

Add labels to all numbers and charts

8e7eaac

ycemsubakan merged commit 2b3e767 into speechbrain:develop Mar 6, 2025
5 checks passed

pplantinga deleted the voice-analysis branch March 6, 2025 18:50

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Voice analysis functions#2689

Voice analysis functions#2689
ycemsubakan merged 56 commits intospeechbrain:developfrom
pplantinga:voice-analysis

pplantinga commented Sep 17, 2024 •

edited

Loading

Uh oh!

TParcollet commented Oct 12, 2024

Uh oh!

pplantinga commented Oct 14, 2024

Uh oh!

pplantinga commented Oct 29, 2024

Uh oh!

pplantinga commented Jan 7, 2025

Uh oh!

bcordel left a comment

Uh oh!

pplantinga commented Feb 27, 2025

Uh oh!

ycemsubakan left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

pplantinga commented Sep 17, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

TParcollet commented Oct 12, 2024

Uh oh!

pplantinga commented Oct 14, 2024

Uh oh!

pplantinga commented Oct 29, 2024

Uh oh!

pplantinga commented Jan 7, 2025

Uh oh!

bcordel left a comment

Choose a reason for hiding this comment

Uh oh!

pplantinga commented Feb 27, 2025

Uh oh!

ycemsubakan left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

pplantinga commented Sep 17, 2024 •

edited

Loading