Go To MPAI-MMC AI Modules

1     Function 2     Reference Model 3     Input/Output Data
4     SubAIMs 5     JSON Metadata 6     Profiles
7     Reference Software 8     Conformance Texting 9     Performance Assessment

1     Functions

The Text and Speech Translation Composite (MMC-TST) AIM :

Receives Selector To chhose between:
– The AIM output should be Text or Speech.
– The output Speech should retain the input Speech Features.
Language Preferences as requested input and output language.
Personal Status. Use of Personal Status
Text. Use of Text
Speech. Use of Speech
Performs A subset of) the following:
Conversion of input Speech Into Text using Personal Status.
Translation of Text To the target language.
Extraction of Features From Speech.
Conversion of Text Into Speech adding the Input Speech’s Features.
Produces Translated Text. Depends of Selector.
. Translated Speech Depends of Selector.

2     Reference Model

Figure 1 depicts the Reference Model of the Text-and-Speech Translation Composite (MMC-TST) AIM.

Figure 1 – Text-and-Speech Translation (MMC-TST) AIM Reference Model

3    Input/Output Data

Table 1 specifies the Input and Output Data of the Text-to-Text Translation (MMC-TST) AIM.

Table 1 – I/O Data of the Text-and-Speech Translation (MMC-TST) AIM

Input Semantics
Selector Signals:
1.     Whether the input is Text or Speech
2.    Whether the input Speech features are preserved in the output Speech.
3.     The Input and output languages.
Speech Object Speech produced in input language by a human desiring translation into output language
TextObject Alternative textual source information to be translated into and pron­ounced in output language depending on the value of Input Selection.
Output Description
Translated  SpeechObject Speech in input language translated into output language preserving the Input Speech features in the Output Speech, depending on Selec­tor.
Translated TextObject Text of Input Speech or Input Text translated into output language, depending on Selector.

4     SubAIMs

Text and Speech Translation is a Composite AIM whose Reference Model is depicted in Figure 2.

Figure 2 – Text-and-Speech Translation Composite (MMC-TST) AIM

Table 2 – AIMs  of Text-and-Speech Translation Composite (MMC-TST) AIM

AIW AIMs  AIM Names JSON
MMC-TST Text-and-Speech Translation X
MMC-ASR Automatic Speech Recognition X
MMC-TTT Text-to-Text Translation X
MMC-ISD Entity Speech Description X
MMC-DTS Descriptors Text-to-Speech X

5     JSON Metadata

https://schemas.mpai.community/MMC/V2.3/AIMs/TextAndSpeechTranslation.json

6     Profiles

The Profiles of Text and Speech Translation are specified.

7. Reference Software

8. Conformance Testing

Table 3 provides the Conformance Testing Method for MMC-TST AIM as a Basic AIM. Conformance Testing of the individual AIMs of the MMC-TST Composite AIM are given by the individual AIM Specification.

If a schema contains references to other schemas, conformance of data for the primary schema implies that any data referencing a secondary schema shall also validate against the relevant schema, if present and conform with the Qualifier, if present.

Table 3 – Conformance Testing Method for MMC-TST AIM

Input Selector Shall validate against Selector schema.
Text Object Shall validate against Text Object schema.
Text Data shall conform with Text Qualifier.
Speech Object Shall validate against Speech Object schema.
Speech Data shall conform with Speech Qualifier.
Output Translated Text Object Shall validate against Text Object.
Text Data shall conform with Text Qualifier.
Translated Speech Object Shall validate against Speech Object.
Speech Data shall conform with Speech Qualifier.

Important note. This Conformance Testing Specification does not provide methods and datasets to Test the Conformance of the individual Speech Feature Extraction and Text-To-Speech Basic AIMs, only of their Descriptors Speech Translation Composite AIMs.

Table 4 provides an example of MMC-TSTAIM conformance testing.

Table 4 – An example MMC-TST AIM conformance testing

Input Data Data Type Input Conformance Testing Data
Input Selector Selector All Input Selectors to conform with Selector.
Requested Language Selector All Language Selectors to be drawn from Language Codes.
Input Text Unicode All input Text files shall be drawn from Text files.
Input Speech .wav All input Text files shall be drawn from Speech files.
Output Data Data Type Conformance Test
Machine Text Unicode All Text files produced shall conform with Text files.
Machine Speech .wav All Speech files produced shall conform with Speech files.

9. Performance Assessment

Go To MPAI-MMC AI Modules