1      Definition

An Object whose rendering has both Audio and Visual perceptibility attributes.

2      Functional Requirements

Audio-Visual Object includes:

  1. The ID of a Virtual Space (M-Instance) where it is or will be located.
  2. The Speech-Audio-Visual Objects’ Space-Time location.
  3. The IDs of the Speech, Audio, and Visual Objects’ and their Space-Time information.

3      Syntax

https://schemas.mpai.community/OSD/V1.1/data/AudioVisualObject.json

4      Semantics

Label Size Description
Header N1 Bytes Audio-Visual Object Header
– Standard-AudioVisualObject 9 Bytes The characters “OSD-AVO-V”
– Version N2 Byte Major version – 1 or 2 Bytes
– Dot-separator 1 Byte The character “.”
– Subversion N3 Bytes Minor version – 1 or 2 Bytes
MInstanceID N4 Bytes Identifier of M-Instance.
AudioVisualObjectID N5 Bytes Identifier of Audio-Visual Object.
AudioVisualObjectSpaceTime N6 Bytes Space-Time of Audio-Visual Object
AudioVisualQualifier N7 Bytes Qualifier of the Audio-Visual Object
SpeechObjectData N8 Bytes Speech Object Data
– SpeechObjectID and/or Speech Object N9 Bytes Speech Object ID and/or Object
– SpeechObjectSpaceTime N10 Bytes Space-Time of Speech Object
AudioObjectData N11 Bytes Audio Object Data
– AudioObjectID and/or Audio Object N12 Bytes Audio Object ID and/or Object
– AudioObjectSpaceTime N13 Bytes Space-Time of Audio Object
VisualObjectData N14 Bytes Visual Object Data
– VisualObjectID and/or Visual Object N15 Bytes Visual Object ID and/or Object
– VisualObjectSpaceTime N16 Bytes Space-Time of Visual Object
DescrMetadata N17 Bytes Descriptive Metadata