"An offline video & audio transcription tool powered by OpenAI Whisper. Convert your tutorials, lectures, and podcasts into accurate text transcripts and use AI to generate summaries, notes, and mind ...
An advanced study tool that transforms raw audio recordings and PDF slides into structured, professional LaTeX university notes. Powered by fast local transcription (Whisper) and Google Gemini AI for ...
Abstract: Despite recent advancements in speech processing, zero-resource speech translation (ST) and automatic speech recognition (ASR) remain challenging problems. In this work, we propose to ...
Abstract: Speech Emotion Recognition (SER) technology analyzes speech characteristics in human-computer interactions to understand user intent and improve interaction experience. It is widely used in ...