The all-in-one AI library for Persian, supporting a wide variety of tasks and modalities!
A large-scale validated database for Persian speech emotion detection.
Persian spoken digit recognition
A dataset of informal Persian audio and text chunks, along with a fully open processing pipeline, suitable for ASR and TTS tasks. Created from crawled content on virgool.io.
This project focuses on implementing a Keyword Spotting (KWS) system for Persian (Farsi) conversational speech using a fine-tuned version of wav2vec2-xlsr-large.