1     Definition 2     Functional Requirements 3     Syntax
4     Semantics 5    Conformance Testing 6     Performance Assessment

1      Definition

A Data Type including the Objects of an Audio-Visual Scene and their arrangement in the Scene.

2      Functional Requirements

Audio-Visual Basic Scene Descriptors includes:

  1. The ID of a Virtual Space where the Audio-Visual Basic Scene is or will be located.
  2. The ID of the Audio-Visual Basic Scene Descriptors.
  3. The number of
    1. Speech Objects in the Audio-Visual Basic Scene.
    2. Audio Objects in the Audio-Visual Basic Scene.
    3. Visual Objects in the Audio-Visual Basic Scene.
    4. Audio-Visual Objects in the Audio-Visual Basic Scene.
  4. The Audio-Visual Basic Scene Space-Time info.
  5. The Audio Objects including, for each Speech Object:
    1. The Speech Object Space-Time.
    2. The Speech Object.
  6. The Audio Objects including, for each Audio Object:
    1. The Audio Object Space-Time.
    2. The Audio Object.
  7. The Visual Objects including, for each Visual Object:
    1. The Visual Object Space-Time.
    2. The Visual Object.
  8. The Audio-Visual Objects including, for each Audio-Visual Object:
    1. The Audio-Visual Object Space-Time.
    2. The Audio-Visual Object.

3      Syntax

https://schemas.mpai.community/OSD/V1.2/data/AudioVisualBasicSceneDescriptors.json

4      Semantics

Label Size Description
Header N1 Bytes Audio-Visual Basic Scene Descriptors Header
– Standard-AVScene 9 Bytes The characters “OSD-BSD-V”
– Version N2 Bytes Major version – 1 or 2 characters
– Dot-separator 1 Byte The character “.”
– Subversion N3 Bytes Minor version – 1 or 2 characters
MInstanceID N4 Bytes Identifier of M-Instance.
AVBasicSceneDescriptorsID N5 Bytes Identifier of the AV Object.
AVBasicSceneSpaceTime N7 Bytes Data about AVScene’s Space and Time
AudioObjectCount N6 Bytes Number of Audio Objects in Scene
AudioObjectsData[] N8 Bytes Set of Audio Objects
– AudioObjectID and/or Object N9 Bytes Audio Object ID and/or Object
– AudioObjectSpaceTime N10 Bytes Space-Time of Audio Object
SpeechObjectCount N6 Bytes Number of Speech Objects in Scene
SpeechObjectsData[] N11 Bytes Set of Speech Objects
– SpeechObjectID  and/or Object N12 Bytes Speech Object ID and/or Object
– SpeechObjectSpaceTime N13 Bytes Space-Time of Speech Object
VisualObjectCount N6 Bytes Number of Visual Objects in Scene
VisualObjectsData[] N14 Bytes Set of Visual Objects
– VisualObjectID  and/or Object N15 Bytes Visual Object ID and/or Object
– VisualObjectSpaceTime N16 Bytes Space-Time of Visual Object
AudioVisualObjectCount N6 Bytes Number of Audio-Visual Objects in Scene
AudioVisualObjectsData[] N17 Bytes Set of Audio-Visual Objects
– AudioVisualObjectID and/or Object N18 Bytes Audio-Visual Object ID and/or Object
– AudioObjectSpaceTime N19 Bytes Space-Time of Audio-Visual Object
DescrMetadata N20 Bytes Descriptive Metadata

5     Conformance Testing

A Data instance Conforms with MPAI-OSD V1.2 Audio-Visual Basic Scene Descriptors (OSD-BSD) if:

  1. The Data validates against the Audio-Visual Basic Scene Descriptors’ JSON Schema.
  2. All Data in the  Audio-Visual Basic Scene Descriptors’ JSON Schema
    1. Have the specified type
    2. Validate against their JSON Schemas
    3. Conform with their Data Qualifiers if present.

6     Performance Assessment