1. Definitions

A Data Type that includes

  1. A set of avatar-related Data: M-InstanceID, Avatar ID, Space-Time,  Avatar, Face Descriptors, Language Selector, Text, Speech Object, Personal Status, and
  2. Descriptors of the Audio-Visual Scene where the Avatar may be embedded and its Space-Time.

2. Functional Requirements of Portable Avatar

Portable Avatar provides the following information:

  1. The ID of the Virtual Space
  2. The set of Data characterising a speaking avatar.
    1. The M-Instance in which the the Avatar is located.
    2. The Space-Time of the Avatar.
    3. The Language Preference of the Avatar.
    4. The Text the Avatar is associated with, or which will be converted into a Speech Object.
    5. The Speech Model used to synthesise Text.
    6. The Speech Object that the Avatar utters.
    7. The Personal Status of the Avatar.
    8. The Space-Time information of the Avatar embedded in the Audio-Visual Scene.

3. Syntax of Portable Avatar

https://schemas.mpai.community/PAF/V1.2/data/PortableAvatar.json

4. Semantics of Portable Avatar

Label Size Description
Header N1 Bytes The Header of the Portable Avatar Data.
– Standard-PortableAvatar 9 Bytes The characters “PAF-PAV-V”
– Version N2 Bytes Major version
– Dot-separator 1 Byte The character “.”
– Subversion N3 Byte Minor version
MInstanceID N4 Bytes The ID of the M-Instance
PortableAvatarID N5 Bytes Identifier of the Portable Avatar.
AvatarData N6 Bytes Set of Data related to Avatar
– Avatar N7 Bytes Model of Avatar.
– PortableAvatarSpaceTime N8 Bytes Space-Time of Avatar instance in AV Scene.
– LanguageSelector N9 Bytes Avatar’s Language Preference.
– TextObject N10 Bytes Text associated with Avatar.
– SpeechObject N11 Bytes Set of Data related to Speech Object.
– SpeechModel N12 Bytes Neural Network Model for Speech Synthesis.
– PersonalStatus N13 Bytes Personal Status of Avatar.
AudioVisualScene N14 Bytes Set of Data related to AV Scene.
– AudioVisualSceneDescriptors N15 Bytes AV Scene Descriptors.
– AudioVisualSceneSpaceTime N16 Bytes Space and Time info of AV Scene in M-instance.
DescrMetadata N17 Bytes Descriptive Metadata