<- JSON Syntax and Semantics   Go t o ToC   Annex1->

1       Entity Context Understanding (HMC-ECU)

2       Entity Dialogue Processing (MMC-EDP)

3       Natural Language Understanding (MMC-NLU)

4       Personal Status Extraction (MMC-PSE)

5       Text and Speech Translation (MMC-TST)

6       Audio-Visual Scene Rendering (PAF-AVR)

7       Personal Status Display (PAF-PSD)

8       Text-to-Speech (MMC-TTS)

This Chapter specifies the eight AI Modules for which Profiles can be signaled.

1        Entity Context Understanding (HMC-ECU)

1.1        Specification

The Entity Context Understanding Composite AIM is specified

1.2        Attributes

HMC-ECU Profiles are determined by the use of one or more of the following attributes by the AIM:

Attribute Code Function of HMC-ECU
Body Descriptors BDD Receives Body Descriptors
Face Descriptors FCD Receives Face Descriptors
Speech Object SPO Receives Speech Object
Text Object TXO Receives Text Object
Visual Object VIO Receives Visual Object
Audio Object AUO Receives Audio Object
Audio-Visual Scene Descriptors AVS Receives Audio-Visual Scene Descriptors
Translation TRN Translates Text Object

2        Entity Dialogue Processing (MMC-EDP)

2.1        Specification

Entity Dialogue Processing is specified.

2.2        Attributes

MMC-EDP Profiles are determined by the use of one or more of the following attributes by the AIM:

Attribute Code Function of MMC-EDP
Text Object TXO Receives Text (directly from human or through NLU).
Object Instance ID OII Receives the ID of an A/V/AV Instance referenced in the dialogue.
Input Personal Status EPS Receives Personal Status.
Text Descriptors TXD Receives Meaning.
AV Scene Descriptors AVS Receives AV Scene Descriptors to enable it to locate the Object.
Speaker ID SPI Receives Speaker ID.
Face ID FCI Receives Face ID.
Memory MEM Takes into account prior Input Data of the dialogue session.

3        Natural Language Understanding (MMC-NLU)

3.2        Specification

The Natural Language Understanding AIM is specified.

3.3        Attributes

MMC-NLU Profiles are determined by the use of one or more of the following attributes by the AIM:

Attribute Code Function of MMC-NLU
Text Object TXO Receives Text directly from human.
Recognised Text TXR Receives text from ASR.
Object Instance ID OII Receives Object Instance ID
Audio-Visual Scene Descriptors AVS Receives Audio-Visual Descriptors.
Text Descriptors TXD Produces Text Descriptors (Meaning)

4        Personal Status Extraction (MMC-PSE)

4.1        Specification

Personal Status Extraction is specified.

4.2        Attributes

MMC-PSE Profiles are determined by the use of one or more of the following attributes by the AIM:

Attribute Code Function of MMC-PSE
Text Object TXO Receives Text
Speech Object SPO Receives Speech
Face Object FCO Receives Face
Body Object BDO Receives Gesture

When an MMC-PSE is used as a component AIM in a Composite AIM as in the case of HMC-ECU, the MMC-PSE Attributes become Sub-Attributes of the Composite AIM.

5        Text and Speech Translation (MMC-TST)

5.1        Specification

Text and Speech Translation is specified.

5.2        Attributes

MMC-TST Profiles are determined by the use of one or more of the following attributes by the AIM:

Attributes Code Functions
Language Preferences LGP MMC-TST receives information on input and output languages.
Text Object TXO MMC-TST receives Text.
Speech Object SPO MMC-TST receives Speech.
Speech Descriptors SPD MMC-TST uses Speech Descriptors.
Personal Status EPS MMC-TST receives Personal Status.

When an MMC-TST is used as a component AIM in a Composite AIM as in the case of HMC-ECU, the LGP (Language Preferences) Attribute of MMC-TST become Sub-Attributes of the Composite AIM represented as 3-letter codes of [6], Part 3.

6        Audio-Visual Scene Rendering (PAF-AVR)

6.1        Specification

Audio-Visual Scene Rendering is specified.

6.2        Attributes

PAF-AVR Profiles are determined by the use of one or more of the following attributes by the AIM:

Attribute Code Function
Point of View POV PAF-AVR is informed to provide Output Audio and/or Output Visual as perceived from a Point of View.
Portable Avatar PAV PAF-AVR receives a Portable Avatar and produces an Audio-Visual Scene from the Point of View.
Audio-Visual Scene Descriptors AVS PAF-AVR receives Audio-Visual Scene Descriptors and produces an Audio-Visual Scene from the Point of View.
Output Text TXO PAF-AVR produces Text Object.
Output Audio AUO PAF-AVR produces Audio Object.
Output Visual VIO PAF-AVR produces Visual Object.

7        Personal Status Display (PAF-PSD)

7.1        Specification

Personal Status Display is specified.

7.2        Attributes

PAF-PSD Profiles are determined by the use of one or more of the following attributes by the AIM:

Attribute Code Function
Text Object TXO PAF-PSD receives Text and produces Speech.
Personal Status EPS PAF-PSD receives Personal Status.
Speech Model SPM PAF-PSD receives Speech Model
Avatar Model AVM PAF-PSD receives Avatar Model.

8        Text-to-Speech (MMC-TTS)

8.1        Specification

Text-to-Speech is specified.

8.2        Attributes

An MMC-TTS AIM Profile is determined by whether the AIM uses one or more of the following Attributes:

Attribute Code Function
Text Object TXO MMC-TTS receives Text Object
Personal Status EPS MMC-TTS receives Personal Status
Speech Model SPM MMC-TTS receives NN Speech Model

<- JSON Syntax and Semantics   Go t o ToC   Annex1->