1 Definition
An Object whose rendering has both Audio and Visual perceptibility attributes.
2 Functional Requirements
Audio-Visual Object includes:
- The ID of a Virtual Space (M-Instance) where it is or will be located.
- The Speech-Audio-Visual Objects’ Space-Time location.
- The IDs of the Speech, Audio, and Visual Objects’ and their Space-Time information.
3 Syntax
https://schemas.mpai.community/OSD/V1.1/data/AudioVisualObject.json
4 Semantics
| Label | Size | Description |
| Header | N1 Bytes | Audio-Visual Object Header |
| – Standard-AudioVisualObject | 9 Bytes | The characters “OSD-AVO-V” |
| – Version | N2 Byte | Major version – 1 or 2 Bytes |
| – Dot-separator | 1 Byte | The character “.” |
| – Subversion | N3 Bytes | Minor version – 1 or 2 Bytes |
| MInstanceID | N4 Bytes | Identifier of M-Instance. |
| AudioVisualObjectID | N5 Bytes | Identifier of Audio-Visual Object. |
| AudioVisualObjectSpaceTime | N6 Bytes | Space-Time of Audio-Visual Object |
| AudioVisualQualifier | N7 Bytes | Qualifier of the Audio-Visual Object |
| SpeechObjectData | N8 Bytes | Speech Object Data |
| – SpeechObjectID and/or Speech Object | N9 Bytes | Speech Object ID and/or Object |
| – SpeechObjectSpaceTime | N10 Bytes | Space-Time of Speech Object |
| AudioObjectData | N11 Bytes | Audio Object Data |
| – AudioObjectID and/or Audio Object | N12 Bytes | Audio Object ID and/or Object |
| – AudioObjectSpaceTime | N13 Bytes | Space-Time of Audio Object |
| VisualObjectData | N14 Bytes | Visual Object Data |
| – VisualObjectID and/or Visual Object | N15 Bytes | Visual Object ID and/or Object |
| – VisualObjectSpaceTime | N16 Bytes | Space-Time of Visual Object |
| DescrMetadata | N17 Bytes | Descriptive Metadata |