Getting Started
Using Cobalt VoiceBio
-
A typical VoiceBio release, provided as a compressed archive, will contain a linux binary (
voicebio-server) for the required native CPU architecture, appropriate Dockerfile and models. -
Cobalt VoiceBio runs either locally on linux or using Docker.
-
Cobalt VoiceBio will serve the VoiceBio GRPC API on port 2727.
-
To quickly try out VoiceBio, first start the server as shown below and use the SDK in your preferred language to use VoiceBio from the command line or within your application.
Info
Thecobalt.license.key file will be provided separately that must be copied into
the directory resulting from decompressing the archive. Please do this before
running the steps below.
Running VoiceBio Server Locally on Linux
./voicebio-server
- By default, the binary assumes the presence of a configuration file, located in the same directory, named:
voicebio-server.cfg.toml. A different config file may be specified using the--configargument.
Running VoiceBio Server as a Docker Container
To build and run the Docker image for VoiceBio, run:
docker build -t cobalt-voicebio .
docker run -p 2727:2727 -p 8080:8080 cobalt-voicebio
How to Get a Copy of the VoiceBio Server and Models
Contact us for getting a release best suited to your requirements.
The release you will receive is a compressed archive (tar.bz2) and is generally structured accordingly:
release.tar.bz2
├── COPYING
├── README.md
├── voicebio-server
├── voicebio-server.cfg.toml
├── Dockerfile
├── models
│ └── en_US-16khz
│
└── cobalt.license.key [ provided separately, needs to be copied over ]
-
The
README.mdfile contains information about this release and instructions for how to start the server on your system. -
The
voicebio-serveris the server program which is configured using thevoicebio-server.cfg.tomlfile. -
The
Dockerfilecan be used to create a container that will let you run VoiceBio server on non-linux systems such as MacOS and Windows. -
The
modelsdirectory contains the speaker ID models. The content of these directory will depend on the models you are provided.
System Requirements
Cobalt VoiceBio runs on Linux. You can run it directly as a linux application.
You can evaluate the product on Windows or Linux using Docker Desktop but we would not recommend this setup for use in a production environment.
A Cobalt VoiceBio release typically includes a single VoiceBio model together with binaries and config files. The general purpose VoiceBio models take up to 100MB of disk space, and need a minimum of 2GB RAM when evaluating locally. For production workloads, we recommend configuring containerized applications with each instance allocated with 4 CPUs and 4GB RAM.
Cobalt VoiceBio runs on x86_64 CPUs. We also support Arm64 CPUs, including processors such as the Graviton (AWS c7g EC2 instances). VoiceBio is significantly more cost effective to run on C7g instances compared to similarly sized Intel or AMD processors, and we can provide you an Arm64 release on request.
To integrate Cobalt VoiceBio into your application, please follow the next steps to install or generate the SDK in a language of your choice.