py-intelligibility

Python implementation of the following speech intelligibility prediction methods: weighted Spectro-Temporal Modulation Index (wSTMI) Spectro-Temporal Glimpsing Index (STGI)

Usage

The functions wstmi and stgi take three inputs:

d = pywstmi(clean_speech, degraded_speech, sampling_frequency)
d = pystgi(clean_speech, degraded_speech, sampling_frequency)

clean_speech: A numpy array containing a single-channel clean (reference) speech signal.
degraded_speech: A numpy array containing a single-channel degraded/processed speech signal.
sampling_frequency: The sampling frequency of the input signals in Hz.

Note that the clean and degraded speech signals must be time-aligned and of the same length.

References

If you use pywstmi or pystgi, please cite the references [1] and [2], respectively:

[1] A. Edraki, W.-Y. Chan, J. Jensen, & D. Fogerty, “Speech Intelligibility Prediction Using Spectro-Temporal Modulation Analysis,” IEEE/ACM Trans. Audio, Speech, & Language Processing, vol. 29, pp. 210-225, 2021.
[2] A. Edraki, W.-Y. Chan, J. Jensen, & D. Fogerty, “A Spectro-Temporal Glimpsing Index (STGI) for Speech Intelligibility Prediction," Proc. Interspeech, 5 pages, Aug 2021.

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
py-intelligibility		py-intelligibility
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

py-intelligibility

py-intelligibility

LICENSE

LICENSE

README.md

README.md

Repository files navigation

py-intelligibility

Usage

References

About

Releases

Packages

Languages

License

aminEdraki/py-intelligibility

Folders and files

Latest commit

History

Repository files navigation

py-intelligibility

Usage

References

About

Topics

Resources

License

Stars

Watchers

Forks

Languages