modif on readme
Esse commit está contido em:
|
Antes Largura: | Altura: | Tamanho: 853 KiB Depois Largura: | Altura: | Tamanho: 853 KiB |
@@ -1,7 +1,7 @@
|
||||
# Speech Emotion Recognition
|
||||
|
||||
|
||||

|
||||

|
||||
|
||||
The aim of this section is to explore speech emotion recognition techniques from an audio recording.
|
||||
|
||||
|
||||
+4
-4
@@ -1,6 +1,6 @@
|
||||
# Text-based Personality Traits Recognition
|
||||
|
||||

|
||||

|
||||
|
||||
In this section you will find all resources, models and Python scripts relative to text-based personality traits recognition.
|
||||
|
||||
@@ -47,7 +47,7 @@ Gensim : 3.4.0
|
||||
|
||||
## Pipeline
|
||||
|
||||

|
||||

|
||||
|
||||
The text-based personality recognition pipeline has the following structure :
|
||||
- Text data retrieving
|
||||
@@ -66,7 +66,7 @@ The text-based personality recognition pipeline has the following structure :
|
||||
## Model
|
||||
|
||||
We have chosen a neural network architecture based on both one-dimensional convolutional neural networks and recurrent neural networks.
|
||||
The one-dimensional convolution layer plays a role comparable to feature extraction : it allows finding patterns in text data. The Long-Short Term Memory cell is then used in order to leverage on the sequential nature of natural language : unlike regular neural network where inputs are assumed to be independent of each other, these architectures progressively accumulate and capture information through the sequences. LSTMs have the property of selectively remembering patterns for long durations of time.
|
||||
The one-dimensional convolution layer plays a role comparable to feature extraction : it allows finding patterns in text data. The Long-Short Term Memory cell is then used in order to leverage on the sequential nature of natural language : unlike regular neural network where inputs are assumed to be independent of each other, these architectures progressively accumulate and capture information through the sequences. LSTMs have the property of selectively remembering patterns for long durations of time.
|
||||
Our final model first includes 3 consecutive blocks consisting of the following four layers : one-dimensional convolution layer - max pooling - spatial dropout - batch normalization. The numbers of convolution filters are respectively 128, 256 and 512 for each block, kernel size is 8, max pooling size is 2 and dropout rate is 0.3.
|
||||
Following the three blocks, we chose to stack 3 LSTM cells with 180 outputs each. Finally, a fully connected layer of 128 nodes is added before the last classification layer.
|
||||
|
||||
@@ -74,4 +74,4 @@ Following the three blocks, we chose to stack 3 LSTM cells with 180 outputs each
|
||||
|
||||
We tried different baseline models in order to assess the performance of our final architecture. Here are the accuracies of the different models.
|
||||
|
||||

|
||||

|
||||
|
||||
+3
-3
@@ -6,7 +6,7 @@ Don't hesitate to ⭐ the repo if you enjoy our work !
|
||||
|
||||
## In a nutshell
|
||||
|
||||
We developped a multimodal emotion recognition platform to analyze the emotions of job candidates, in partnership with the French Employment Agency.
|
||||
We developped a multimodal emotion recognition platform to analyze the emotions of job candidates, in partnership with the French Employment Agency.
|
||||
|
||||
We analye facial, vocal and textual emotions, using mostly deep learning based approaches. We deployed a web app using Flask :
|
||||
|
||||
@@ -117,7 +117,7 @@ To limit overfitting, we tuned the model with :
|
||||
- And kept the best model
|
||||
|
||||
<p align="center">
|
||||
<img src="/Presentation/Images/Accuracy_Speech.png" width="400" height="400" />
|
||||
<img src="/00-Presentation/Images/Accuracy_Speech.png" width="400" height="400" />
|
||||
</p>
|
||||
|
||||
### c. [Video Analysis](https://github.com/maelfabien/Multimodal-Emotion-Recognition/tree/master/Video)
|
||||
@@ -199,5 +199,5 @@ https://www.overleaf.com/read/xvtrrfpvzwhf
|
||||
<td align="center"><a href="https://github.com/STF-R"><img src="https://avatars0.githubusercontent.com/u/43505879?v=4" width="100px;" alt="mfix22"/>
|
||||
<br /><sub><b>STF-R</b></sub>
|
||||
</a><br /><a href="https://github.com/maelfabien/Multimodal-Emotion-Recognition/commits?author=STF-R" title="Code">💻</a></td>
|
||||
|
||||
|
||||
</tr></table>
|
||||
|
||||
Referência em uma Nova Issue
Bloquear um usuário