Note This documentation is for v2.x of the API. For documentation about v1.x of the API, please see here. Diatheke is Cobalt’s dialog management engine. It uses a combination of speech technologies and artificial intelligence to allow users to interact with computers and mobile devices through audio and text based dialogs. Audio Based Dialogs Audio based dialog management uses multiple speech technologies, including Automatic Speech Recognition (ASR), Natural Language Understanding (NLU), and Text To Speech (TTS).
Diatheke Model Overview Diatheke is Cobalt’s dialogue management engine. How the dialogues are managed by the engine is determined by a Diatheke model, which is loaded when the engine is initialized. A Diatheke model describes the desired flow of a conversation between a human user and a computer. It lists different ways a human may say something and specifies how the system should respond. In technical terms, it maps an utterance to an intent and entities and then specifies the actions the system performs in response to those inputs.
Luna API Overview Luna is Cobalt’s text to speech (TTS) engine. It can be deployed on-prem and accessed over the network or on your local machine via an API. We currently support Go, Python, and C++ and are adding support for more languages. From the API’s perspective, the process of generating audio data is fairly straightforward. After establishing a connection with the Luna server, a client submits text to Luna, and Luna returns synthesized audio data that mimics human speech.