Wav2Vec2 Sentiment Analysis Using Shemo Dataset
Overview In this project, we fine-tuned the Wav2Vec2 model to perform sentiment analysis based on both voice features and text transcripts from the Shemo da...
In this project, we fine-tuned the Wav2Vec2 model to perform sentiment analysis based on both voice features and text transcripts from the Shemo dataset. This hybrid approach allows robust emotion recognition using both audio and textual data for classification.
We used the Shemo dataset from Sharif University, which includes .wav
audio files paired with corresponding transcripts and emotion labels stored in a JSON file.
The loaded data was converted into a pandas DataFrame, and paths were verified to ensure file existence. Missing paths were dropped from the dataset. The dataset was split into training (80%) and validation (20%) sets using stratified sampling based on emotion labels.
We loaded a pre-trained Wav2Vec2 model for Persian speech emotion recognition. Configuration was customized to set up the pooling mode and label mappings.
We defined a custom Wav2Vec2 model for speech emotion classification, which included a feature extractor and a classification head.
In the forward() method, hidden states from Wav2Vec2 were pooled, and the resulting tensor was classified into the target emotion label.
We used Hugging Face’s Trainer class to fine-tune the model. A data collator was implemented for dynamic padding, and evaluation metrics (accuracy, F1-score) were set up.
After training the model, we evaluated its performance using the following metrics:
The final accuracy was 94%, demonstrating the effectiveness of using both voice features and text transcripts for sentiment analysis.
Overview In this project, we fine-tuned the Wav2Vec2 model to perform sentiment analysis based on both voice features and text transcripts from the Shemo da...
Overview In this project, we developed an Android application to estimate the location of cellular network cells using Received Signal Strength Indicator (RS...
Overview In this project, we explore multimodal sentiment analysis, which involves analyzing both text and image data together. Our goal is to predict sentim...
Project Iridium
My Internship at the NLP Lab :
Its My Heroku project :
My sonic pi project : . این پست من مربوط به پوروژه سونیک پای بنده است
My works and wishes Success secret مصاحبه با جناب اقا پارسا: ایشون بسیار ادم سخت کوشی بودن و گفتن که حتی در دوران دانشجویی شون درس هم میدادن 1 به موار...
Its my favorite university This is oxford: In England The best unniversity. this university is first inthe world. this university have best descover...
Its My Hackathon project :