Speech recognition models
Choose the model to use. We provide the default model sommers and a fine-tuned whisper for OpenAPI.
When using whisper, you must set the language parameter.
Supported languages are listed in the official whisper repo. If you want language detection, see Language.
caution
Default concurrency is 2 and can be adjusted upon request. For Whisper-related inquiries, contact us via chat below.
Sommers model
- Our in-house ASR model with fast latency and strong accuracy.
- Supported languages: Korean (
ko) and Japanese (ja). languageis optional. If omitted, it defaults toko(Korean).
Whisper model
- Community model fine-tuned for our API use cases.
languageis required. For detection/multi-lingual usage, see Language.
Usage
In the POST Request body, set model_name to sommers or whisper (whisper requires language).
POST example: Example
Unified example (presets): Example