Skip to content

Tobibiggest/Real-Time-Audio-Transcription-and-Sentiment-Analysis-Using-AI

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 
 
 

Repository files navigation

Real-Time Audio Transcription and Sentiment Analysis Using AI

This project combines the power of FasterWhisper for live transcription and Hugging Face's Sentiment Analysis pipeline to provide real-time feedback on both transcription accuracy and the emotional tone of the spoken content. The entire project is integrated into a user-friendly Gradio interface for seamless interaction.


📜 Overview

This application listens to audio input, transcribes it into text, and analyzes the emotional sentiment of each sentence in real-time. The model is designed to provide instant feedback, making it ideal for a wide range of use cases across industries like customer service, healthcare, education, and media.


🚀 Features

  • Real-Time Transcription: Using FasterWhisper for accurate and fast transcription.
  • Sentiment Analysis: Hugging Face's pipeline to detect positive, negative, or neutral sentiment.
  • Live Gradio Interface: An easy-to-use interface for real-time interaction and output display.
  • Customizable Outputs: Modify the application to suit specific use cases in customer service, healthcare, media, etc.

🖼 Project Demo

Screenshots

Screenshot (64)

Screenshot (63)

Sneak Peek Video

GIThub.Trancription.mp4

💻 Installation

  1. Clone the repository:

    git clone https://github.com/tobibiggest/real-time-audio-transcription-sentiment.git
  2. Navigate to the project folder:

    cd real-time-audio-transcription-sentiment
  3. Install dependencies:

    pip install -r requirements.txt
  4. Run the application:

    python app.py

🛠 Technologies Used

  • Python
  • FasterWhisper
  • Hugging Face Transformers
  • Gradio

✨ Future Development

  • Multilingual transcription support
  • Advanced emotion detection
  • Video integration for analyzing visual and audio data
  • Custom sentiment analysis for specific industries

🤝 Contributing

Feel free to submit issues or pull requests to improve the project! Contributions are always welcome.


📜 License

This project is licensed under the MIT License. See the LICENSE file for more information.


🙌 Acknowledgments

Special thanks to the Hugging Face and OpenAI teams for providing the tools to build this project.


Feel free to customize the image and video paths after uploading the corresponding files to your repository.

About

Real-Time Audio Transcription and Sentiment Analysis Using FasterWhisper, Hugging Face Transformers and Gradio

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published