Amirreza Vishteh

I graduated with a Bachelor's degree from Iran University of Science and Technology (IUST). Currently, I am pursuing a Master's degree in Computer Engineering at Sharif University of Technology.

Wav2Vec2 Sentiment Analysis Using Shemo Dataset

Model

Overview

In this project, we fine-tuned the Wav2Vec2 model to perform sentiment analysis based on both voice features and text transcripts from the Shemo dataset. This hybrid approach allows robust emotion recognition using both audio and textual data for classification.

Team Members

Amirreza Vishteh
sentiment analysis in speech
Iran University of Science and Technology
6/09/2024

1. Data Loading

We used the Shemo dataset from Sharif University, which includes .wav audio files paired with corresponding transcripts and emotion labels stored in a JSON file. Data Loading

2. Data Preprocessing

The loaded data was converted into a pandas DataFrame, and paths were verified to ensure file existence. Missing paths were dropped from the dataset. The dataset was split into training (80%) and validation (20%) sets using stratified sampling based on emotion labels. Data Preprocessing

3.Model Configuration and Preprocessing

We loaded a pre-trained Wav2Vec2 model for Persian speech emotion recognition. Configuration was customized to set up the pooling mode and label mappings. Model Configuration and Preprocessing Model Definition

4.Model Definition

We defined a custom Wav2Vec2 model for speech emotion classification, which included a feature extractor and a classification head. Trainer Setup

In the forward() method, hidden states from Wav2Vec2 were pooled, and the resulting tensor was classified into the target emotion label.

6. Trainer Setup

We used Hugging Face’s Trainer class to fine-tune the model. A data collator was implemented for dynamic padding, and evaluation metrics (accuracy, F1-score) were set up.

7. Results

After training the model, we evaluated its performance using the following metrics: Results

The final accuracy was 94%, demonstrating the effectiveness of using both voice features and text transcripts for sentiment analysis.

2024 4
2023 2
2021 2
2020 3

2024

Wav2Vec2 Sentiment Analysis Using Shemo Dataset

1 minute read

Overview In this project, we fine-tuned the Wav2Vec2 model to perform sentiment analysis based on both voice features and text transcripts from the Shemo da...

Psychological Health Chatbot:

1 minute read

Osmium Project Tasks

1 minute read

Overview In this project, we developed an Android application to estimate the location of cellular network cells using Received Signal Strength Indicator (RS...

Multimodal Sentiment Analysis Project(Persian-3classes)

less than 1 minute read

Overview In this project, we explore multimodal sentiment analysis, which involves analyzing both text and image data together. Our goal is to predict sentim...

2023

Project Iridium

1 minute read

Project Iridium

My Internship at the NLP Lab

less than 1 minute read

My Internship at the NLP Lab :

2021

My Heroku project with blazore

less than 1 minute read

Its My Heroku project :

Sonic pi

less than 1 minute read

My sonic pi project : . این پست من مربوط به پوروژه سونیک پای بنده است

Amirreza Vishteh

Wav2Vec2 Sentiment Analysis Using Shemo Dataset

Overview

Team Members

Table of Contents

1. Data Loading

2. Data Preprocessing

3.Model Configuration and Preprocessing

4.Model Definition

6. Trainer Setup

7. Results

2024

Wav2Vec2 Sentiment Analysis Using Shemo Dataset

Psychological Health Chatbot:

Osmium Project Tasks

Multimodal Sentiment Analysis Project(Persian-3classes)

2023

Project Iridium

My Internship at the NLP Lab

2021

My Heroku project with blazore

Sonic pi

2020

مصاحبه

My works and wishes

My web site in lab exam(hacketan)