Skip to content

jethrothelion/SimpleSpeachToText

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

10 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

🗣️ SimpleSpeechToText

Python Version License: MIT made-with-python

A simple, lightweight, Python tool that transcribes audio into a text file and prints on command line using the nvidia/parakeet-tdt-0.6b-v2 model, very battery and hardware efficient

Very handy for Meetings and Lectures on a laptop or a pi


✨ Features

  • Offline Conversion: Works entirely offline, no internet connection needed other then to download the model
  • Playback Control: Pause and resume the Transcription at any time.
  • Cross-Platform: Tested with Linux and windows 10
  • Lightweight: Minimal dependencies, a small footprint and easy to run
  • User Input You can add text along side the transcription with a different line header to differentiate
  • Append or new file Asks if want to create a new or add on to a existing file

🚀 Getting Started

Follow these instructions to get a copy of the project up and running on your local machine.

Prerequisites

I use python 12 but versions within reason should work

Installation

  1. Clone the repository:

    git clone [https://github.com/jethrothelion/SimpleTextToSpeech.git](https://github.com/jethrothelion/SimpleTextToSpeech.git) cd SimpleTextToSpeech
  2. Install the required packages: The project relies on the Nemo, Numpy and other libraries. Install them using pip:

    pip install -r requirements.txt

💻 Usage

Using the SimpleTextToSpeech script is straightforward. Just run the python file and it will ask all it needs to know

Feel free to build off of this very simple file

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages