ASR Client (Python)

Table of contents

Overview
Setup
- Requirements
- Manual submodule update
Install
- Using the provided script
- Manual installation
Usage
- ASR Client

ASR Client (Python)

The gRPC Python client for Techmo ASR Service.

Overview

For project details, its structure, and functionality, head to the documentation.

Setup

The project can be used as-is and does not require any additional setup.

For basic development use, consider convenient ./setup.sh.

Requirements

Python >=3.8
uv (install: curl -LsSf https://astral.sh/uv/install.sh | sh)
PortAudio 19.6.0 (required by the PyAudio Python package)

Manual submodule update

It is the duty of the build configuration to clone all the necessary submodules. However, it sometimes fails, for example, when building a Docker image from an uninitialized repository. In that case, the solution is to download the missing dependencies manually.

Example:

git submodule update --init --depth 1 submodules/asr-api-python

Do not forget about the submodules of the submodules. Eventually, use the --recursive flag.

Install

Using the provided script

./install.sh

Creates a .venv virtualenv with uv and installs the package with its dependencies.

Manual installation

uv venv .venv
source .venv/bin/activate
uv pip install .

If installation fails, the troubleshooting section of the documentation may be helpful.

Usage

ASR Client

Performs speech recognition on an ASR Service instance.

asr_client [-h, --help] [-v, --version] [OPTIONS]... [-s, ]--service-address ADDRESS [-m, ]--audio-mic --audio-stream-chunk-duration ARG
asr_client [-h, --help] [-v, --version] [OPTIONS]... [-s, ]--service-address ADDRESS [-a, ]--audio-paths PATH...

Examples:

perform speech recognition on an audio stream coming from a file

python -m asr_client -s 0.0.0.0:30384 -a ./audio.wav

perform speech recognition on an audio stream coming from a microphone in 200-milliseconds chunks

python -m asr_client -s 0.0.0.0:30384 -m --audio-stream-chunk-duration 200

perform speech recognition on an audio stream coming from a file on a zipformer model named my_zipformer_model with decoder.criterion-type set to S2S and extractor.sampling-frequency set to 16000:

python -m asr_client -s 0.0.0.0:30384 -a ./audio.wav --speech-model my_zipformer_model --decoder.criterion-type S2S --extractor.sampling-frequency 16000

prepend 150 ms of silence to the audio before sending (useful when speech starts immediately at the beginning of the file and the voice activity detector needs a moment to prime):

python -m asr_client -s 0.0.0.0:30384 -a ./audio.wav --audio-prepend-silence 150

append 300 ms of trailing silence (useful to ensure the detector finalises the last utterance):

python -m asr_client -s 0.0.0.0:30384 -a ./audio.wav --audio-append-silence 300

For some more usage scenarios, head to the documentation.

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
.github/workflows		.github/workflows
asr_client		asr_client
data		data
doc		doc
submodules		submodules
tests		tests
.dockerignore		.dockerignore
.gitignore		.gitignore
.gitmodules		.gitmodules
AGENTS.md		AGENTS.md
CHANGELOG.md		CHANGELOG.md
CLAUDE.md		CLAUDE.md
Dockerfile		Dockerfile
README.md		README.md
docker-entrypoint.sh		docker-entrypoint.sh
install.sh		install.sh
pyproject.toml		pyproject.toml
setup.sh		setup.sh
tox.ini		tox.ini

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ASR Client (Python)

Overview

Setup

Requirements

Manual submodule update

Install

Using the provided script

Manual installation

Usage

ASR Client

About

Uh oh!

Releases 1

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

ASR Client (Python)

Overview

Setup

Requirements

Manual submodule update

Install

Using the provided script

Manual installation

Usage

ASR Client

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages