| 1 Function | 2 Reference Model | 3 Input/Output Data |
| 4 SubAIMs | 5 JSON Metadata | 6 Profiles |
| 7 Reference Software | 8 Conformance Texting | 9 Performance Assessment |
1 Functions
The Text and Speech Translation Composite (MMC-TST) AIM :
| Receives | Selector | To chhose between: |
| – The AIM output should be Text or Speech. | ||
| – The output Speech should retain the input Speech Features. | ||
| Language Preferences | as requested input and output language. | |
| Personal Status. | Use of Personal Status | |
| Text. | Use of Text | |
| Speech. | Use of Speech | |
| Performs | A subset of) the following: | |
| Conversion of input Speech | Into Text. | |
| Translation of Text | To the target language. | |
| Extraction of Features | From Speech. | |
| Conversion of Text | Into Speech adding the Input Speech’s Features. | |
| Produces | Translated Text. | Depends of Selector. |
| . | Translated Speech | Depends of Selector. |
2 Reference Model
Figure 1 depicts the Reference Model of the Text-and-Speech Translation Composite (MMC-TST) AIM.

Figure 1 – Text-and-Speech Translation (MMC-TST) AIM Reference Model
3 Input/Output Data
Table 1 specifies the Input and Output Data of the Text-to-Text Translation (MMC-TST) AIM.
Table 1 – I/O Data of the Text-and-Speech Translation (MMC-TST) AIM
| Input | Semantics |
| Selector | Signals: 1. Whether the input is Text or Speech 2. Whether the input Speech features are preserved in the output Speech. 3. The Input and output languages. |
| Speech Object | Speech produced in input language by a human desiring translation into output language |
| TextObject | Alternative textual source information to be translated into and pronounced in output language depending on the value of Input Selection. |
| Output | Description |
| Translated SpeechObject | Speech in input language translated into output language preserving the Input Speech features in the Output Speech, depending on Selector. |
| Translated TextObject | Text of Input Speech or Input Text translated into output language, depending on Selector. |
4 SubAIMs
Text and Speech Translation is a Composite AIM whose Reference Model is depicted in Figure 2.

Figure 2 – Text-and-Speech Translation Composite (MMC-TST) AIM
Table 2 – AIMs of Text-and-Speech Translation Composite (MMC-TST) AIM
| AIW | AIMs | AIM Names | JSON |
| MMC-TST | Text-and-Speech Translation | X | |
| MMC-ASR | Automatic Speech Recognition | X | |
| MMC-TTT | Text-to-Text Translation | X | |
| MMC-ISD | Entity Speech Description | X | |
| MMC-DTS | Descriptors Text-to-Speech | X |
5 JSON Metadata
https://schemas.mpai.community/MMC/V2.3/AIMs/TextAndSpeechTranslation.json
6 Profiles
The Profiles of Text and Speech Translation are specified.
7. Reference Software
8. Conformance Testing
Table 3 provides the Conformance Testing Method for MMC-TST AIM as a Basic AIM. Conformance Testing of the individual AIMs of the MMC-TST Composite AIM are given by the individual AIM Specification.
If a schema contains references to other schemas, conformance of data for the primary schema implies that any data referencing a secondary schema shall also validate against the relevant schema, if present and conform with the Qualifier, if present.
Table 3 – Conformance Testing Method for MMC-TST AIM
| Input | Selector | Shall validate against Selector schema. |
| Text Object | Shall validate against Text Object schema. Text Data shall conform with Text Qualifier. |
|
| Speech Object | Shall validate against Speech Object schema. Speech Data shall conform with Speech Qualifier. |
|
| Output | Translated Text Object | Shall validate against Text Object. Text Data shall conform with Text Qualifier. |
| Translated Speech Object | Shall validate against Speech Object. Speech Data shall conform with Speech Qualifier. |
Important note. This Conformance Testing Specification does not provide methods and datasets to Test the Conformance of the individual Speech Feature Extraction and Text-To-Speech Basic AIMs, only of their Descriptors Speech Translation Composite AIMs.
Table 4 provides an example of MMC-TSTAIM conformance testing.
Table 4 – An example MMC-TST AIM conformance testing
| Input Data | Data Type | Input Conformance Testing Data |
| Input Selector | Selector | All Input Selectors to conform with Selector. |
| Requested Language | Selector | All Language Selectors to be drawn from Language Codes. |
| Input Text | Unicode | All input Text files shall be drawn from Text files. |
| Input Speech | .wav | All input Text files shall be drawn from Speech files. |
| Output Data | Data Type | Conformance Test |
| Machine Text | Unicode | All Text files produced shall conform with Text files. |
| Machine Speech | .wav | All Speech files produced shall conform with Speech files. |