modif on readme

Esse commit está contido em:
Anatoli-deBRADKE
2019-06-17 09:00:45 +02:00
commit a652055fe7
4 arquivos alterados com 8 adições e 8 exclusões

Antes

Largura:  |  Altura:  |  Tamanho: 853 KiB

Depois

Largura:  |  Altura:  |  Tamanho: 853 KiB

+1 -1
Ver Arquivo
@@ -1,7 +1,7 @@
# Speech Emotion Recognition
![image](audio_app.png)
![image](Images/audio_app.png)
The aim of this section is to explore speech emotion recognition techniques from an audio recording.
+4 -4
Ver Arquivo
@@ -1,6 +1,6 @@
# Text-based Personality Traits Recognition
![image](text_app.png)
![image](/00-Presentation/text_app.png)
In this section you will find all resources, models and Python scripts relative to text-based personality traits recognition.
@@ -47,7 +47,7 @@ Gensim : 3.4.0
## Pipeline
![image](/Presentation/Images/text_pipeline.png)
![image](/00-Presentation/Images/text_pipeline.png)
The text-based personality recognition pipeline has the following structure :
- Text data retrieving
@@ -66,7 +66,7 @@ The text-based personality recognition pipeline has the following structure :
## Model
We have chosen a neural network architecture based on both one-dimensional convolutional neural networks and recurrent neural networks.
The one-dimensional convolution layer plays a role comparable to feature extraction : it allows finding patterns in text data. The Long-Short Term Memory cell is then used in order to leverage on the sequential nature of natural language : unlike regular neural network where inputs are assumed to be independent of each other, these architectures progressively accumulate and capture information through the sequences. LSTMs have the property of selectively remembering patterns for long durations of time.
The one-dimensional convolution layer plays a role comparable to feature extraction : it allows finding patterns in text data. The Long-Short Term Memory cell is then used in order to leverage on the sequential nature of natural language : unlike regular neural network where inputs are assumed to be independent of each other, these architectures progressively accumulate and capture information through the sequences. LSTMs have the property of selectively remembering patterns for long durations of time.
Our final model first includes 3 consecutive blocks consisting of the following four layers : one-dimensional convolution layer - max pooling - spatial dropout - batch normalization. The numbers of convolution filters are respectively 128, 256 and 512 for each block, kernel size is 8, max pooling size is 2 and dropout rate is 0.3.
Following the three blocks, we chose to stack 3 LSTM cells with 180 outputs each. Finally, a fully connected layer of 128 nodes is added before the last classification layer.
@@ -74,4 +74,4 @@ Following the three blocks, we chose to stack 3 LSTM cells with 180 outputs each
We tried different baseline models in order to assess the performance of our final architecture. Here are the accuracies of the different models.
![image](/Presentation/Images/perf_text_final.png)
![image](/00-Presentation/Images/perf_text_final.png)
+3 -3
Ver Arquivo
@@ -6,7 +6,7 @@ Don't hesitate to ⭐ the repo if you enjoy our work !
## In a nutshell
We developped a multimodal emotion recognition platform to analyze the emotions of job candidates, in partnership with the French Employment Agency.
We developped a multimodal emotion recognition platform to analyze the emotions of job candidates, in partnership with the French Employment Agency.
We analye facial, vocal and textual emotions, using mostly deep learning based approaches. We deployed a web app using Flask :
@@ -117,7 +117,7 @@ To limit overfitting, we tuned the model with :
- And kept the best model
<p align="center">
<img src="/Presentation/Images/Accuracy_Speech.png" width="400" height="400" />
<img src="/00-Presentation/Images/Accuracy_Speech.png" width="400" height="400" />
</p>
### c. [Video Analysis](https://github.com/maelfabien/Multimodal-Emotion-Recognition/tree/master/Video)
@@ -199,5 +199,5 @@ https://www.overleaf.com/read/xvtrrfpvzwhf
<td align="center"><a href="https://github.com/STF-R"><img src="https://avatars0.githubusercontent.com/u/43505879?v=4" width="100px;" alt="mfix22"/>
<br /><sub><b>STF-R</b></sub>
</a><br /><a href="https://github.com/maelfabien/Multimodal-Emotion-Recognition/commits?author=STF-R" title="Code">💻</a></td>
</tr></table>