1     Definition 2     Functional Requirements 3     Syntax
4     Semantics 5    Conformance Testing 6     Performance Assessment

1      Definition

A Data Type including the 3D Model, Speech, Audio, Visual and Audio-Visual Objects of an Audio-Visual Scene and their arrangement in the Scene.

2      Functional Requirements

Audio-Visual Basic Scene Descriptors include:

  1. The ID of the Virtual Space where the Audio-Visual Basic Scene is located.
  2. The ID of the Audio-Visual Basic Scene Descriptors.
  3. The Space-Time info of the Audio-Visual Basic Scene.
  4. The number of:
    1. 3D Model Objects in the Audio-Visual Basic Scene.
    2. Speech Objects in the Audio-Visual Basic Scene.
    3. Audio Objects in the Audio-Visual Basic Scene.
    4. Visual Objects in the Audio-Visual Basic Scene.
    5. Audio-Visual Objects in the Audio-Visual Basic Scene.
  5. The 3D Model Objects including, for each 3D Model Object:
    1. The 3D Model Object Space-Time.
    2. The 3D Model Object.
  6. The Speech Objects including, for each Speech Object:
    1. The Speech Object Space-Time.
    2. The Speech Object.
  7. The Audio Objects including, for each Audio Object:
    1. The Audio Object Space-Time.
    2. The Audio Object.
  8. The Visual Objects including, for each Visual Object:
    1. The Visual Object Space-Time.
    2. The Visual Object.
  9. The Audio-Visual Objects including, for each Audio-Visual Object:
    1. The Audio-Visual Object Space-Time.
    2. The Audio-Visual Object.

3      Syntax

https://schemas.mpai.community/OSD/V1.3/data/AudioVisualBasicSceneDescriptors.json

4      Semantics

Label Size Description
Header N1 Bytes Audio-Visual Basic Scene Descriptors Header
– Standard-AVBasicSceneDescriptors 9 Bytes The characters “OSD-BSD-V”
– Version N2 Bytes Major version – 1 or 2 characters
– Dot-separator 1 Byte The character “.”
– Subversion N3 Bytes Minor version – 1 or 2 characters
MInstanceID N4 Bytes Identifier of M-Instance.
AVBasicSceneDescriptorsID N5 Bytes Identifier of the AV Object.
AVBasicSceneSpaceTime N6 Bytes Data about AVScene’s Space and Time
3DModelObjectCount N7 Bytes Number of 3D Model Objects in Scene
3DModelObjectsData[] N8 Bytes Set of 3D Model Objects
– 3DModelObjectID and/or Object N9 Bytes 3D Model Object ID and/or Object
– 3DModelObjectSpaceTime N10 Bytes Space-Time of 3D Model Object
SpeechObjectCount N11 Bytes Number of Speech Objects in Scene
SpeechObjectsData[] N12 Bytes Set of Speech Objects
– SpeechObjectID and/or Object N13 Bytes Speech Object ID and/or Object
– SpeechObjectSpaceTime N14 Bytes Space-Time of Speech Object
AudioObjectCount N15 Bytes Number of Audio Objects in Scene
AudioObjectsData[] N16 Bytes Set of Audio Objects
– AudioObjectID and/or Object N17 Bytes Audio Object ID and/or Object
– AudioObjectSpaceTime N18 Bytes Space-Time of Audio Object
VisualObjectCount N19 Bytes Number of Visual Objects in Scene
VisualObjectsData[] N20 Bytes Set of Visual Objects
– VisualObjectID and/or Object N21 Bytes Visual Object ID and/or Object
– VisualObjectSpaceTime N22 Bytes Space-Time of Visual Object
AudioVisualObjectCount N23 Bytes Number of Audio-Visual Objects in Scene
AudioVisualObjectsData[] N24 Bytes Set of Audio-Visual Objects
– AudioVisualObjectID and/or Object N25 Bytes Audio-Visual Object ID and/or Object
– AudioObjectSpaceTime N26 Bytes Space-Time of Audio-Visual Object
DescrMetadata N27 Bytes Descriptive Metadata

5     Conformance Testing

A Data instance Conforms with MPAI-OSD V1.2 Audio-Visual Basic Scene Descriptors (OSD-BSD) if:

  1. The Data validates against the Audio-Visual Basic Scene Descriptors’ JSON Schema.
  2. All Data in the  Audio-Visual Basic Scene Descriptors’ JSON Schema
    1. Have the specified type
    2. Validate against their JSON Schemas
    3. Conform with their Data Qualifiers if present.

6     Performance Assessment