HMC-CEC uses six groups of capability classes to process a Communication Item:
Receives | Communication Items from a Machine or Audio-Visual Scenes from a real space. |
Extracts | Personal Status from the Modalities (Text, Speech, Face, or Gesture) in the Communication Item received. |
Understands | The Communication Item from the Modalities and the extracted Personal Status, with or without use of the spatial information embedded in the Communication Item. |
Translates | Using the set of Modalities available to the Machine. |
Generates | Response. |
Displays | The response using available Modalities. |
The Attributes and Sub-Attributes of the HMC-CEC Profiles are defined in Table 1 where the Sub-Attributes are expressed with three characters where the first two are followed by O (representing Object):
- The Audio-Visual Scene represent Text (TXO), Speech (SPO), Audio (AUO), Visual (VIO), and Portable Avatar (PAF) Sub-Attributes, respectively.
- The Personal Status, Understanding, Translation, and Display Response represent Text (TXO), Speech (SPO), Face (FCO), and Gesture (GSO), respectively.
The SPC Sub-Attribute of Understanding represents Spatial Information (SPaCe), i.e., the capability of an HMC-CEC implementation to use also Spatial Information to understand a Communication Item.
Table 1 – Attribute and Sub-Attribute Codes of HMC-CEC.
Attributes | Codes | Sub-Attribute Codes | ||||
Audio-Visual Scene | AVS | TXO | SPO | AUO | VIO | PAF |
Personal Status | EPS | TXO | SPO | FCO | GSO | |
Understanding | UND | TXO | SPO | FCO | GSO | SPC |
Translation | TRN | TXO | SPO | FCO | GSO | |
Display Response | RES | TXO | SPO | FCO | GSO |
The formal specification of MPAI-HMC Profiles is provided by the following JSON file.
https://schemas.mpai.community/HMC/V1.1/data/CECProfiles.json
}