1. Definitions
A Data Type that includes
- A set of avatar-related Data: M-InstanceID, Avatar ID, Space-Time, Avatar, Face Descriptors, Language Selector, Text, Speech Object, Personal Status, and
- Descriptors of the Audio-Visual Scene where the Avatar may be embedded and its Space-Time.
2. Functional Requirements of Portable Avatar
Portable Avatar provides the following information:
- The ID of the Virtual Space
- The set of Data characterising a speaking avatar.
- The M-Instance in which the the Avatar is located.
- The Space-Time of the Avatar.
- The Language Preference of the Avatar.
- The Text the Avatar is associated with, or which will be converted into a Speech Object.
- The Speech Model used to synthesise Text.
- The Speech Object that the Avatar utters.
- The Personal Status of the Avatar.
- The Space-Time information of the Avatar embedded in the Audio-Visual Scene.
3. Syntax of Portable Avatar
https://schemas.mpai.community/PAF/V1.2/data/PortableAvatar.json
4. Semantics of Portable Avatar
Label | Size | Description |
Header | N1 Bytes | The Header of the Portable Avatar Data. |
– Standard-PortableAvatar | 9 Bytes | The characters “PAF-PAV-V” |
– Version | N2 Bytes | Major version |
– Dot-separator | 1 Byte | The character “.” |
– Subversion | N3 Byte | Minor version |
MInstanceID | N4 Bytes | The ID of the M-Instance |
PortableAvatarID | N5 Bytes | Identifier of the Portable Avatar. |
AvatarData | N6 Bytes | Set of Data related to Avatar |
– Avatar | N7 Bytes | Model of Avatar. |
– PortableAvatarSpaceTime | N8 Bytes | Space-Time of Avatar instance in AV Scene. |
– LanguageSelector | N9 Bytes | Avatar’s Language Preference. |
– TextObject | N10 Bytes | Text associated with Avatar. |
– SpeechObject | N11 Bytes | Set of Data related to Speech Object. |
– SpeechModel | N12 Bytes | Neural Network Model for Speech Synthesis. |
– PersonalStatus | N13 Bytes | Personal Status of Avatar. |
AudioVisualScene | N14 Bytes | Set of Data related to AV Scene. |
– AudioVisualSceneDescriptors | N15 Bytes | AV Scene Descriptors. |
– AudioVisualSceneSpaceTime | N16 Bytes | Space and Time info of AV Scene in M-instance. |
DescrMetadata | N17 Bytes | Descriptive Metadata |