1     Function 2     Reference Model 3     Input/Output Data
4     SubAIMs 5     JSON Metadata 6     Profiles
7     Reference Software 8     Conformance Texting 9     Performance Assessment

1     Functions

Speech Translation with Descriptors (MMC-STD):

Receives Speech Object
Language Selector
Produces Synthesised Translated Speech Object having the Descriptors of the input Speech Object.

2     Reference Model

Figure 1 depicts the Reference Model of the Speech Translation with Descriptors (MMC-STD) AIM.

Figure 1 – The Speech Translation with Descriptors (MMC-STD) AIM Reference Model

3    Input/Output Data

Table 1 specifies the Input and Output Data of the Speech Translation with Descriptors (MMC-STD) AIM.

Table 1 – I/O Data of the Speech Translation with Descriptors (MMC-STD) AIM

Input Description
Speech Object Input Speech.
Language Selector Provides codes of the input and output languages.
Output Description
Speech Object Output Speech of the Text-To-Speech AIM,

4     SubAIMs

Speech Translation with Descriptors may also be implemented as a Composite AIM as specified in Figure 2.

Figure 2 – Speech Translation with Descriptors (MMC-STD) Composite AIM

Table 2 specifies the AIMs of Speech Translation with Descriptors (MMC-STD) Composite AIM

Table 2 – AIMs of Speech Translation with Descriptors (MMC-STD) Composite AIM

AIM   Name JSON
MMC-DST Speech Translation with Descriptors X
MMC-ASR Automatic Speech Recognition X
MMC-ESD Entity Speech Description X
MMC-SDT Descriptors Text Translation X

5     JSON Metadata

https://schemas.mpai.community/MMC/V2.2/AIMs/SpeechTranslationWithDescriptors.json

6     Profiles

No Profiles.

7     Reference Software

8     Conformance Testing

MPAI-MMC V2.2 specifies the Conformance Testing of the Composite AIM of Figure 1.

Input Data Data Type Input Conformance Testing Data
Language Selector Selector All Language Selectors to be drawn from Language Codes.
Speech Object Speech All input Speech files to be drawn from Speech files.
Output Data Data Type Input Conformance Test Results 
Translated Speech Speech All Speech files produced shall conform with Speech files.

9     Performance Assessment