1     Definition 2     Functional Requirements 3     Syntax
4     Semantics 5    Conformance Testing 6     Performance Assessment

1      Definition

A Data Type including the Audio-Visual Scene’s Objects and Sub-Scenes and their arrangement in the Scene.

2      Functional Requirements

Audio-Visual Scene Descriptors includes Scenes in addition to Objects.

3      Syntax

https://schemas.mpai.community/OSD/V1.2/data/AudioVisualSceneDescriptors.json

4      Semantics

Label Size Description
Header N1 Bytes Audio-Visual Scene Descriptors Header
– Standard-AVSceneDescriptors 9 Bytes The characters “OSD-AVS-V”
– Version N2 Bytes Major version – 1 or 2 characters
– Dot-separator 1 Byte The character “.”
– Subversion N3 Bytes Minor version – 1 or 2 characters
MInstanceID N4 Bytes Identifier of M-Instance.
AVBasicSceneDescriptorsID N5 Bytes Identifier of the AV Object.
ObjectCount N6 Bytes Number of Objects in Scene
AVSceneSpaceTime N7 Bytes Data about Space and Time
SpeechObjects[] N8 Bytes Set of Speech Objects
– SpeechObject N9 Bytes Speech Object
– SpeechObjectSpaceTime N10 Bytes Space-Time of Speech Object
AudioObjects[] N11 Bytes Set of Audio Objects
– AudioObject N12 Bytes ID of Audio Object
– AudioObjectSpaceTime N13 Bytes Space-Time of Audio Object
VisualObjects[] N14 Bytes Set of Visual Objects
– VisualObjectID N15 Bytes ID of Visual Object
– VisualObjectSpaceTime N16 Bytes Space-Time of Visual Object
AudioVisualObjects[] N17 Bytes Set of Audio-Visual Objects
– AudioVisualObjectID N18 Bytes ID of Audio-Visual Object
– AudioObjectSpaceTime N19 Bytes Space-Time of Audio-Visual Object
SubSceneCount N20 Bytes Number of Sub-Scenes in Scene
SpeechSubScenes[] N21 Bytes Set of Speech Objects
– SpeechSubScene N22 Bytes Speech SubScene
– SpeechSubSceneSpaceTime N23 Bytes Space-Time of Speech SubScene
AudioSubScenes[] N24 Bytes Set of Audio SubScenes
– AudioSubScene N25 Bytes ID of Audio SubScene
– AudioSubSceneSpaceTime N26 Bytes Space-Time of Audio SubScene
VisualSubScenes[] N27 Bytes Set of Visual SubScenes
– VisualSubSceneID N28 Bytes ID of Visual SubScene
– VisualSubSceneSpaceTime N29 Bytes Space-Time of Visual SubScene
AudioVisualSubScenes[] N30 Bytes Set of Audio-Visual SubScenes
– AudioVisualSubSceneID N31 Bytes ID of Audio-Visual SubScene
– AudioSubSceneSpaceTime N31 Bytes Space-Time of Audio-Visual SubScene
DescrMetadata N33 Bytes Descriptive Metadata

5     Conformance Testing

A Data instance Conforms with Audio-Visual Scene Descriptors (OSD-AVS) V1.2 if:

  1. The Data validates against the Audio-Visual Scene Descriptors’ JSON Schema.
  2. All Data in the  Audio-Visual Scene Descriptors’ JSON Schema
    1. Have the specified type
    2. Validate against their JSON Schemas
    3. Conform with their Data Qualifiers if present.

6     Performance Assessment