Speech recognition models
Choose the model to use. We provide the default model sommers
and a fine-tuned whisper
for OpenAPI.
When using whisper
, you must set the language
parameter.
Supported languages are listed in the official whisper repo. If you want language detection, see Language.
caution
Default concurrency is 2 and can be adjusted upon request. For Whisper-related inquiries, contact us via chat below.
Sommers model
- Our in-house ASR model with fast latency and strong accuracy.
- Supported languages: Korean (
ko
) and Japanese (ja
). language
is optional. If omitted, it defaults toko
(Korean).
Whisper model
- Community model fine-tuned for our API use cases.
language
is required. For detection/multi-lingual usage, see Language.
Usage
In the POST Request body, set model_name
to sommers
or whisper
(whisper requires language
).
POST example: Example
Unified example (presets): Example