SER

To give an output of emotion to the input of speech

What is RAVDESS?

The Ryerson Audio-Visual Database of Emotional Speech and Song (RAVDESS) The Ryerson Audio-Visual Database of Emotional Speech and Song (RAVDESS) contains 7356 files (total size: 24.8 GB). The database contains 24 professional actors (12 female, 12 male), vocalizing two lexically-matched statements in a neutral North American accent. Speech includes calm, happy, sad, angry, fearful, surprise, and disgust expressions, and the song contains calm, happy, sad, angry, and fearful emotions. Each expression is produced at two levels of emotional intensity (normal, strong), with an additional neutral expression. In this dataset we are going to use only the speech of the actors. Link to download RAVDESS

File Summary:-

In total, the RAVDESS collection includes 7356 files (2880+2024+1440+1012 files). File naming convention Each of the 7356 RAVDESS files has a unique filename. The filename consists of a 7-part numerical identifier (e.g., 02-01-06-01-02-01-12.mp4). These identifiers define the stimulus characteristics:

Filename identifiers Modality (01 = full-AV, 02 = video-only, 03 = audio-only).

Vocal channel (01 = speech, 02 = song).

Emotion (01 = neutral, 02 = calm, 03 = happy, 04 = sad, 05 = angry, 06 = fearful, 07 = disgust, 08 = surprised).

Emotional intensity (01 = normal, 02 = strong). NOTE: There is no strong intensity for the 'neutral' emotion.

Statement (01 = "Kids are talking by the door", 02 = "Dogs are sitting by the door").

Repetition (01 = 1st repetition, 02 = 2nd repetition).

Actor (01 to 24. Odd numbered actors are male, even numbered actors are female).

Filename example: 02-01-06-01-02-01-12.mp4 Video-only (02) Speech (01) Fearful (06) Normal intensity (01) Statement "dogs" (02) 1st Repetition (01) 12th Actor (12) Female, as the actor ID number is even.

MLP classifier have an accurarcy of 76.25% CNN model have an accurarcy of 82.25%

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
README.md		README.md
SER (Speech Emotion Recognition).pdf		SER (Speech Emotion Recognition).pdf
SER_CNN.ipynb		SER_CNN.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

SER

About

Releases

Packages

Languages

aluvala-nikhil/SER

Folders and files

Latest commit

History

Repository files navigation

SER

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages