| 1 Function | 2 Reference Model | 3 Input/Output Data |
| 4 SubAIMs | 5 JSON Metadata | 6 Profiles |
| 7 Reference Software | 8 Conformance Texting | 9 Performance Assessment |
1 Functions
Speech Translation with Descriptors (MMC-STD):
| Receives | Speech Object |
| Language Selector | |
| Produces | Synthesised Translated Speech Object having the Descriptors of the input Speech Object. |
2 Reference Model
Figure 1 depicts the Reference Model of the Speech Translation with Descriptors (MMC-STD) AIM.

Figure 1 – The Speech Translation with Descriptors (MMC-STD) AIM Reference Model
3 Input/Output Data
Table 1 specifies the Input and Output Data of the Speech Translation with Descriptors (MMC-STD) AIM.
Table 1 – I/O Data of the Speech Translation with Descriptors (MMC-STD) AIM
| Input | Description |
| Speech Object | Input Speech. |
| Language Selector | Provides codes of the input and output languages. |
| Output | Description |
| Speech Object | Output Speech of the Text-To-Speech AIM, |
4 SubAIMs
Speech Translation with Descriptors may also be implemented as a Composite AIM as specified in Figure 2.
Figure 2 – Speech Translation with Descriptors (MMC-STD) Composite AIM
Table 2 specifies the AIMs of Speech Translation with Descriptors (MMC-STD) Composite AIM
Table 2 – AIMs of Speech Translation with Descriptors (MMC-STD) Composite AIM
| AIM | Name | JSON | |
| MMC-DST | Speech Translation with Descriptors | X | |
| MMC-ASR | Automatic Speech Recognition | X | |
| MMC-ESD | Entity Speech Description | X | |
| MMC-SDT | Descriptors Text Translation | X |
5 JSON Metadata
https://schemas.mpai.community/MMC/V2.2/AIMs/SpeechTranslationWithDescriptors.json
6 Profiles
No Profiles.
7 Reference Software
8 Conformance Testing
MPAI-MMC V2.2 specifies the Conformance Testing of the Composite AIM of Figure 1.
| Input Data | Data Type | Input Conformance Testing Data |
| Language Selector | Selector | All Language Selectors to be drawn from Language Codes. |
| Speech Object | Speech | All input Speech files to be drawn from Speech files. |
| Output Data | Data Type | Input Conformance Test Results |
| Translated Speech | Speech | All Speech files produced shall conform with Speech files. |