LMNT focuses on voice and audio manipulation, possibly leveraging AI to generate or transform sound for various creative and technical use cases
Creates a custom voice in LMNT by training on uploaded audio samples. The voice can then be used for text-to-speech synthesis. Returns the voice ID and metadata upon successful creation. The voice may be in 'training' state initially before becoming 'ready'.
Deletes a voice from your LMNT account. This operation permanently removes the voice and cancels any pending operations on it. This action cannot be undone. Only voices owned by you (owner='me') can be deleted; system voices cannot be deleted. Use case: Remove custom voices that are no longer needed to manage your voice library.
Generates speech from text and returns JSON with base64-encoded audio and optional word-level timing metadata. Use when you need the synthesis seed or word timestamps for subtitle synchronization. For lower latency without metadata, use the Synthesize Speech action instead.
Retrieves account information including subscription plan details and current usage statistics.
Gets information about a specific voice in LMNT.
Retrieves a list of available voices from LMNT. Returns both system-provided preset voices and any custom voices you have created. Use filters to narrow results by ownership (system vs custom) or starred status. Each voice includes details like ID, name, description, gender, state, and preview URL.