1 Function | 2 Reference Model | 3 Input/Output Data |
4 SubAIMs | 5 JSON Metadata | 6 Profiles |
7 Reference Software | 8 Conformance Texting | 9 Performance Assessment |
1 Functions
The Text and Speech Translation Composite (MMC-TST) AIM :
Receives | Selector | To chhose between: |
– The AIM output should be Text or Speech. | ||
– The output Speech should retain the input Speech Features. | ||
Language Preferences | as requested input and output language. | |
Personal Status. | Use of Personal Status | |
Text. | Use of Text | |
Speech. | Use of Speech | |
Performs | A subset of) the following: | |
Conversion of input Speech | Into Text using Personal Status. | |
Translation of Text | To the target language. | |
Extraction of Features | From Speech. | |
Conversion of Text | Into Speech adding the Input Speech’s Features. | |
Produces | Translated Text. | Depends of Selector. |
. | Translated Speech | Depends of Selector. |
2 Reference Model
Figure 1 depicts the Reference Model of the Text-and-Speech Translation Composite (MMC-TST) AIM.
Figure 1 – Text-and-Speech Translation (MMC-TST) AIM Reference Model
3 Input/Output Data
Table 1 specifies the Input and Output Data of the Text-to-Text Translation (MMC-TST) AIM.
Table 1 – I/O Data of the Text-and-Speech Translation (MMC-TST) AIM
Input | Semantics |
Selector | Signals: 1. Whether the input is Text or Speech 2. Whether the input Speech features are preserved in the output Speech. 3. The Input and output languages. |
Speech Object | Speech produced in input language by a human desiring translation into output language |
TextObject | Alternative textual source information to be translated into and pronounced in output language depending on the value of Input Selection. |
Output | Description |
Translated SpeechObject | Speech in input language translated into output language preserving the Input Speech features in the Output Speech, depending on Selector. |
Translated TextObject | Text of Input Speech or Input Text translated into output language, depending on Selector. |
4 SubAIMs
Text and Speech Translation is a Composite AIM whose Reference Model is depicted in Figure 2.
Figure 2 – Text-and-Speech Translation Composite (MMC-TST) AIM
Table 2 – AIMs of Text-and-Speech Translation Composite (MMC-TST) AIM
AIW | AIMs | AIM Names | JSON |
MMC-TST | Text-and-Speech Translation | X | |
MMC-ASR | Automatic Speech Recognition | X | |
MMC-TTT | Text-to-Text Translation | X | |
MMC-ISD | Entity Speech Description | X | |
MMC-DTS | Descriptors Text-to-Speech | X |
5 JSON Metadata
https://schemas.mpai.community/MMC/V2.3/AIMs/TextAndSpeechTranslation.json
6 Profiles
The Profiles of Text and Speech Translation are specified.
7. Reference Software
8. Conformance Testing
Table 3 provides the Conformance Testing Method for MMC-TST AIM as a Basic AIM. Conformance Testing of the individual AIMs of the MMC-TST Composite AIM are given by the individual AIM Specification.
If a schema contains references to other schemas, conformance of data for the primary schema implies that any data referencing a secondary schema shall also validate against the relevant schema, if present and conform with the Qualifier, if present.
Table 3 – Conformance Testing Method for MMC-TST AIM
Input | Selector | Shall validate against Selector schema. |
Text Object | Shall validate against Text Object schema. Text Data shall conform with Text Qualifier. |
|
Speech Object | Shall validate against Speech Object schema. Speech Data shall conform with Speech Qualifier. |
|
Output | Translated Text Object | Shall validate against Text Object. Text Data shall conform with Text Qualifier. |
Translated Speech Object | Shall validate against Speech Object. Speech Data shall conform with Speech Qualifier. |
Important note. This Conformance Testing Specification does not provide methods and datasets to Test the Conformance of the individual Speech Feature Extraction and Text-To-Speech Basic AIMs, only of their Descriptors Speech Translation Composite AIMs.
Table 4 provides an example of MMC-TSTAIM conformance testing.
Table 4 – An example MMC-TST AIM conformance testing
Input Data | Data Type | Input Conformance Testing Data |
Input Selector | Selector | All Input Selectors to conform with Selector. |
Requested Language | Selector | All Language Selectors to be drawn from Language Codes. |
Input Text | Unicode | All input Text files shall be drawn from Text files. |
Input Speech | .wav | All input Text files shall be drawn from Speech files. |
Output Data | Data Type | Conformance Test |
Machine Text | Unicode | All Text files produced shall conform with Text files. |
Machine Speech | .wav | All Speech files produced shall conform with Speech files. |