audeering/opensmile-python

Stars: 182

Forks: 30

Pull Requests: 38

Issues: 48

Watchers: 9

Last Updated: 2023-07-21 07:29:54

#machine-learning #feature-extraction #audio

Python package for openSMILE

License: Other

Languages: Python, PHP, C++, HTML

https://audeering.github.io/opensmile-python/

openSMILE Python

Python interface for extracting openSMILE features.

$ pip install opensmile

Note

Only 64-bit Python is supported.

Feature sets

Currently, three standard sets are supported. ComParE 2016 is the largest with more than 6k features. The smaller sets GeMAPS and eGeMAPS come in variants v01a, v01b and v02 (only eGeMAPS). We suggest to use the latest version unless backward compatibility with the original papers is desired.

Each feature set can be extracted on two levels:

Low-level descriptors (LDD)
Functionals

For ComParE 2016 a third level is available:

LLD deltas

Note

Pre v2.0.0 some LLDs of the GeMAPS family were incorrectly output as deltas. This was corrected with v2.0.0 and these features are now correctly returned as LLDs. Note that with v2.0.0 deltas are no longer available for the GeMAPS family.

The following table lists the number of features for each set and level.

With v2.0.0

Name	#features
ComParE_2016	65 / 65 / 6373
GeMAPSv01a	18 / - / 62
GeMAPSv01b	18 / - / 62
eGeMAPSv01a	23 / - / 88
eGeMAPSv01b	23 / - / 88
eGeMAPSv02	25 / - / 88

Note

Additional feature sets have been added by the community. For a full list please see the documentation of opensmile.FeatureSet.

Pre v2.0.0

Name	#features
ComParE_2016	65 / 65 / 6373
GeMAPSv01a	5 / 13 / 62
GeMAPSv01b	5 / 13 / 62
eGeMAPSv01a	10 / 13 / 88
eGeMAPSv01b	10 / 13 / 88

Code example

Code example, that extracts ComParE 2016 functionals from an audio file:

import opensmile

smile = opensmile.Smile(
    feature_set=opensmile.FeatureSet.ComParE_2016,
    feature_level=opensmile.FeatureLevel.Functionals,
)
y = smile.process_file('audio.wav')

License

openSMILE follows a dual-licensing model. Since the main goal of the project is a widespread use of the software to facilitate research in the field of machine learning from audio-visual signals, the source code and binaries are freely available for private, research, and educational use under an open-source license (see LICENSE). It is not allowed to use the open-source version of openSMILE for any sort of commercial product. Fundamental research in companies, for example, is permitted, but if a product is the result of the research, we require you to buy a commercial development license. Contact us at [email protected] (or visit us at https://www.audeering.com) for more information.

Original authors: Florian Eyben, Felix Weninger, Martin Wöllmer, Björn Schuller

Citing

Please cite openSMILE in your publications by citing the following paper:

Florian Eyben, Martin Wöllmer, Björn Schuller: "openSMILE - The Munich Versatile and Fast Open-Source Audio Feature Extractor", Proc. ACM Multimedia (MM), ACM, Florence, Italy, ISBN 978-1-60558-933-6, pp. 1459-1462, 25.-29.10.2010.

OPEN ISSUES

See all

RuntimeError: Input signal has 2 channels, but 'num_channels' to set to 1. by @AliChaudhry8
Is it possible to extract frame-wise features with the parameter "options"? by @00001101-xt
AttributeError: module 'opensmile' has no attribute 'Smail' by @sumhncku
How to adapt a opensmile config to pyopensmile? by @zyznull
AttributeError: module 'opensmile' has no attribute 'core' by @ckchandler
flag to return only file index by @felixbur
Extraction of fixed windows for LLD by @giorgiolbt
Convert old configuration files to the new python configuration by @aviasd
sox: not found SoX could not be found! by @fmnbijbzq
State supported Linux distributions in the docs by @hagenw
About opensmile-python multithreading by @ykingliu
I want to change the window width and step width of the frame in openSMILE by @Zinc0816
opensmile-python in real time by @ThomasJanssoone
Unsupported features for 16-bit PCM WAV by @jasondraether
version `GLIBC_2.27' not found by @eli7gn
Why is the last row's frame size different from others when extracting LLD? by @Epholy
openSMILE isn't Open Source by @LourensVeen
Use emobase2010.conf on python by @ZLDA22
UnicodeEncodeError by @felixbur
OSError when importing opensmile on MacBook M1 laptop by @Omer80
Change framesize and framestep in ComParE_2016 config file by @rodaslemos
Scale of the loudness feature in the eGeMAPS set by @YangLiyli131
Add support for Apple M1 architecture by @hagenw
OSError: /path_to_file/libaudresample.so: cannot open shared object file: No such file or directory by @funnyshape
change frame size for LLD extraction by @naufalrif
How to convert LLDs into Functionals？ by @Shen-JK
About emobase2010.conf by @zhanglina94
no maxPitch and voicingCutoff constraints in eGeMAPS v01b config by @reichelu

RELEASES

See all

Release v2.4.1 by @github-actions[bot]
Release v2.2.0 by @github-actions[bot]
Release v2.1.3 by @github-actions[bot]
Release v2.1.2 by @github-actions[bot]
Release v2.1.1 by @github-actions[bot]
Release v2.1.0 by @github-actions[bot]
Release v2.0.2 by @github-actions[bot]
Release v2.0.0 by @github-actions[bot]
Release v1.0.1 by @github-actions[bot]
opensmile-python 1.0.0 by @hagenw
Release v2.4.2 by @github-actions[bot]

audeering/opensmile-python

octicon-link" viewBox="0 0 16 16" version="1.1" width="16" height="16" aria-hidden="true">openSMILE Python

octicon-link" viewBox="0 0 16 16" version="1.1" width="16" height="16" aria-hidden="true">Feature sets

octicon-link" viewBox="0 0 16 16" version="1.1" width="16" height="16" aria-hidden="true">With v2.0.0

octicon-link" viewBox="0 0 16 16" version="1.1" width="16" height="16" aria-hidden="true">Pre v2.0.0

octicon-link" viewBox="0 0 16 16" version="1.1" width="16" height="16" aria-hidden="true">Code example

octicon-link" viewBox="0 0 16 16" version="1.1" width="16" height="16" aria-hidden="true">License

octicon-link" viewBox="0 0 16 16" version="1.1" width="16" height="16" aria-hidden="true">Citing

OPEN ISSUES

RELEASES

openSMILE Python

Feature sets

With v2.0.0

Pre v2.0.0

Code example

License

Citing