GitHub Trending

FireRedTeam/FireRedASR2S — FireRedASR2S is a SOTA Industrial-Grade All-in-One ASR system with ASR, VAD, LID

40 sec read14 views0 listens

217 stars | 8 forks | Python

FireRedASR2S is a SOTA Industrial-Grade All-in-One ASR system with ASR, VAD, LID, and Punc modules. FireRedASR2 supports Chinese (Mandarin, 20+ dialects/accents), English, code-switching, and singing lyrics recognition. FireRedVAD supports speech/singing/music in 100+ langs. FireRedLID supports 100+ langs and 20+ zh dialects.

What it does

FireRedASR2S is a cutting-edge ASR system that integrates speech recognition, voice activity detection, language identification, and punctuation prediction. Its high accuracy across multiple languages and dialects makes it a valuable tool for developers and researchers in the speech technology domain.

Why it matters: Unlock the power of speech technology with FireRedASR2S - the all-in-one solution for accurate and efficient speech processing!

View on GitHub

Want to create content about this repo? Use Nemati AI tools to generate articles, tutorials, and social posts.

QwenLM/Qwen3-ASR — Qwen3-ASR is an open-source series of ASR models developed by the Qwen team at A

1,529 stars | 125 forks | Python Qwen3-ASR is an open-source series of ASR models developed by the Qwen team at Alibaba Cloud, supporting stable multilingual speech/music/song recognition, language detection and timestamp prediction. What it does Qwe...

Ali Nemati

GitHub Trending4 hours ago29 sec read

openai/whisper — Robust Speech Recognition via Large-Scale Weak Supervision

101,763 stars | 12,435 forks | Python Robust Speech Recognition via Large-Scale Weak Supervision What it does Whisper is a versatile speech recognition model developed by OpenAI, capable of multilingual transcription, translation, and language identi...

Ali Nemati

GitHub TrendingFeb 2138 sec read

TrevorS/voxtral-mini-realtime-rs — Streaming speech recognition running natively and in the browser. A pure Rust im

605 stars | 24 forks | Rust Streaming speech recognition running natively and in the browser. A pure Rust implementation of Mistral's Voxtral Mini 4B Realtime model using the Burn ML framework. What it does Voxtral Mini 4B Realtime is a Rust-based st...

Ali Nemati

GitHub Trending1 day ago39 sec read

Panniantong/Agent-Reach — Give your AI agent eyes to see the entire internet. Read & search Twitter, Reddi

21,418 stars | 1,852 forks | Python Give your AI agent eyes to see the entire internet. Read & search Twitter, Reddit, YouTube, GitHub, Bilibili, XiaoHongShu — one CLI, zero API fees. What it does Agent Reach is a Python tool that allows AI agent...

Ali Nemati

GitHub Trending1 day ago29 sec read

MemPalace/mempalace — The best-benchmarked open-source AI memory system. And it's free.

53,741 stars | 7,073 forks | Python The best-benchmarked open-source AI memory system. And it's free. What it does MemPalace is an open-source AI memory system that stores conversation history verbatim and retrieves it with semantic search, offering ...

Ali Nemati

FireRedTeam/FireRedASR2S — FireRedASR2S is a SOTA Industrial-Grade All-in-One ASR system with ASR, VAD, LID

What it does

Related Articles

QwenLM/Qwen3-ASR — Qwen3-ASR is an open-source series of ASR models developed by the Qwen team at A

openai/whisper — Robust Speech Recognition via Large-Scale Weak Supervision

TrevorS/voxtral-mini-realtime-rs — Streaming speech recognition running natively and in the browser. A pure Rust im

Panniantong/Agent-Reach — Give your AI agent eyes to see the entire internet. Read & search Twitter, Reddi

MemPalace/mempalace — The best-benchmarked open-source AI memory system. And it's free.