Page cover

Speech Recognition 语音识别输入

Uses Sherpa-ncnn:

Supported functions

Real-time Speech recognition
Voice activity detection

✔️

✔️

Supported platforms

Architecture
Android
iOS
Windows
macOS
linux

x64

✔️

✔️

✔️

✔️

x86

✔️

✔️

arm64

✔️

✔️

✔️

✔️

✔️

arm32

✔️

✔️

riscv64

✔️

Supported programming languages

1. C++
2. C
3. Python
4. JavaScript

✔️

✔️

✔️

✔️

5. Go
6. C#
7. Kotlin
8. Swift

✔️

✔️

✔️

✔️

It also supports WebAssembly.

Introduction

This repository supports running the following functions locally

  • Streaming speech-to-text (i.e., real-time speech recognition)

  • VAD (e.g., silero-vad)

on the following platforms and operating systems:

with the following APIs

  • C++, C, Python, Go, C#

  • Kotlin

  • JavaScript

  • Swift

We support all platforms that ncnn supports.

Everything can be compiled from source with static link. The generated executable depends only on system libraries.

HINT: It does not depend on PyTorch or any other inference frameworks other than ncnn.

Please see the documentation https://k2-fsa.github.io/sherpa/ncnn/index.html for installation and usages, e.g.,

  • How to build an Android app

  • How to download and use pre-trained models

We provide a few YouTube videos for demonstration about real-time speech recognition with sherpa-ncnn using a microphone:

Description
URL

Streaming speech recognition

https://github.com/k2-fsa/sherpa-ncnn/releases/tag/models

How to reach us

Please see https://k2-fsa.github.io/sherpa/social-groups.html for 新一代 Kaldi 微信交流群 and QQ 交流群.

Last updated