Terms beginning with a capital letter have the meaning defined in Table 1. Terms beginning with a small letter have the meaning commonly defined for the context in which they are used. For instance, Table 1 defines Object and Scene but does not define object and scene. Words beginning with a small letter may also refer to an entity in the real world as opposed to a Virtual Environment.
A dash “-” preceding a Term in Table 1 indicates the following readings according to the font:
- Normal font: the Term in the table without a dash and preceding the one with a dash should be read before that Term. For example, “Avatar” and “- Model” will yield “Avatar Model.”
- Italic font: the Term in the table without a dash and preceding the one with a dash should be read after that Term. For example, “Avatar” and “- Portable” will yield “Portable Avatar.”
The full collection of MPAI Term definitions is available online.
Table 1 – General MPAI-HMC terms
| Term | Definition |
| Audio | Digital representation of an analogue audio signal sampled at a frequency between 8-192 kHz with a number of bits/sample between 8 and 32, and non-linear and linear quantisation. Data with characteristics of Audio may be synthetically produced. |
| Avatar | An Object rendered to represent a Human of a Machine in a virtual space. |
| – Model | An inanimate Avatar exposing animation interfaces. |
| – Portable | A Data Type including Avatar ID, Time, Audio-VisualScene Descriptors, Spatial Attitude, Avatar Model, Body Descriptors, Face Descriptors, Language Preference, Speech Coding, Speech Data, Text, and Personal Status [5]. |
| Centre Point | The point of an Object selected to have coordinates (0,0,0). |
| Context | Additional information about a communication emitted by an Entity, such as language, culture etc.. |
| Data | Information in digital form. |
| – Format | The standard digital representation of Data. |
| – Type | An instance of Data with a specific Data Format. |
| Descriptor | The Digital Representation of a feature of an Object. |
| – Body | A Data Type including the digital representation of the features of the body of a real or digital human. |
| – Face | A Data Type including the digital representation of a feature of the face of a real or digital human. |
| Digital Representation | Data corresponding to and representing a physical entity. |
| Environment | A Virtual Space that may be null or may include an Audio-Visual Scene. |
| Human | A human being in a real space. |
| – Digital | A Digitised or a Virtual Human. |
| – Digitised | An Object that has the appearance of a specific human when rendered. |
| – Virtual | An Object created by a computer that has a human appearance when rendered but is not a Digitised Human. |
| Identifier | The label uniquely associated with a human or an Object. |
| Instance | An element of a set of entities – Objects, Digital Humans etc. – belonging to some levels in a hierarchical classification (taxonomy). |
| – Audio | The instance of an Audio Object. |
| – Visual | The instance of a Visual Object. |
| Object | A data structure that can be rendered to cause an Experience. |
| – Audio | An Object described by Audio Descriptors. |
| – Audio-Visual | An Object described by Audio-Visual Descriptors. |
| – Body | A digital representation of the body of a Human or a Machine. |
| – Descriptor | The digital representation of the feature of an Object. |
| – Digital | A Digitised or a Virtual Object. |
| – Digitised | The digital representation of a real object. |
| – Face | The digital representation of the face of a Human or a Machine. |
| – Speech | An Object described by Speech Descriptors. |
| – Text | A string of Text. |
| – Virtual | An Object not representing an object in the real environment. |
| – Visual | An Object described by Visual Descriptors. |
| Orientation | The 3 Euler angles of an Object in a Virtual Space. |
| Position | The coordinates of a representative point for an object in a Virtual Space with respect to a set of coordinate axes. |
| Rendering | The process of instantiating a Virtual Space as a human-perceptible entity. |
| Scene | A composition of Objects located according to a Scene Geometry. |
| – Audio | A Scene composed of Audio Objects. |
| – Digital | A digitised scene or a Virtual Scene |
| – Audio-Visual | A Scene composed of Audio Objects, Visual Objects and co-located Audio-Visual Objects. |
| – Visual | A Scene composed of Visual Objects. |
| Scene Descriptors | The digital representation of a feature of a scene. |
| – Audio | A Data Type including the digital representation of the audio features of a digital scene. |
| – Audio-Visual | A Data Type combining the Audio or Visual Scene Descriptors. |
| – Visual | A Data Type including the digital representation of the visual features of a digital scene. |
| Scene Geometry | The digital representation of the Object arrangement of a Scene. |
| – Audio | A Data Type describing the Spatial arrangement of the Visual Objects of a Scene. |
| – Audio-Visual | A Data Type describing the Spatial arrangement of the Audio, Visual, and Audio-Visual Objects of a Scene. |
| – Visual | A Data Type describing the Spatial arrangement of the Visual Objects of a Scene. |
| Attitude | |
| – Spatial | Position and Orientation and their velocities and accelerations of a Human and Visual Object in a Virtual Environment. |
| Virtual Space | A space generated and maintained by a computing platform that can be rendered. |
| Speech | Digital representation of analogue speech sampled at a frequency between 8 kHz and 96 kHz with a number of bits/sample of 8, 16 or 24, and non-linear and linear quantisation or compressed. Data with characteristics of Speech may be synthetically produced. |