| 1 Definition | 2 Functional Requirements | 3 Syntax |
| 4 Semantics | 5 Conformance Testing | 6 Performance Assessment |
1 Definition
A Data Type including the 3D Model, Speech, Audio, Visual and Audio-Visual Objects of an Audio-Visual Scene and their arrangement in the Scene.
2 Functional Requirements
Audio-Visual Basic Scene Descriptors include:
- The ID of the Virtual Space where the Audio-Visual Basic Scene is located.
- The ID of the Audio-Visual Basic Scene Descriptors.
- The Space-Time info of the Audio-Visual Basic Scene.
- The number of:
- 3D Model Objects in the Audio-Visual Basic Scene.
- Speech Objects in the Audio-Visual Basic Scene.
- Audio Objects in the Audio-Visual Basic Scene.
- Visual Objects in the Audio-Visual Basic Scene.
- Audio-Visual Objects in the Audio-Visual Basic Scene.
- The 3D Model Objects including, for each 3D Model Object:
- The 3D Model Object Space-Time.
- The 3D Model Object.
- The Speech Objects including, for each Speech Object:
- The Speech Object Space-Time.
- The Speech Object.
- The Audio Objects including, for each Audio Object:
- The Audio Object Space-Time.
- The Audio Object.
- The Visual Objects including, for each Visual Object:
- The Visual Object Space-Time.
- The Visual Object.
- The Audio-Visual Objects including, for each Audio-Visual Object:
- The Audio-Visual Object Space-Time.
- The Audio-Visual Object.
3 Syntax
https://schemas.mpai.community/OSD/V1.3/data/AudioVisualBasicSceneDescriptors.json
4 Semantics
| Label | Size | Description |
| Header | N1 Bytes | Audio-Visual Basic Scene Descriptors Header |
| – Standard-AVBasicSceneDescriptors | 9 Bytes | The characters “OSD-BSD-V” |
| – Version | N2 Bytes | Major version – 1 or 2 characters |
| – Dot-separator | 1 Byte | The character “.” |
| – Subversion | N3 Bytes | Minor version – 1 or 2 characters |
| MInstanceID | N4 Bytes | Identifier of M-Instance. |
| AVBasicSceneDescriptorsID | N5 Bytes | Identifier of the AV Object. |
| AVBasicSceneSpaceTime | N6 Bytes | Data about AVScene’s Space and Time |
| 3DModelObjectCount | N7 Bytes | Number of 3D Model Objects in Scene |
| 3DModelObjectsData[] | N8 Bytes | Set of 3D Model Objects |
| – 3DModelObjectID and/or Object | N9 Bytes | 3D Model Object ID and/or Object |
| – 3DModelObjectSpaceTime | N10 Bytes | Space-Time of 3D Model Object |
| SpeechObjectCount | N11 Bytes | Number of Speech Objects in Scene |
| SpeechObjectsData[] | N12 Bytes | Set of Speech Objects |
| – SpeechObjectID and/or Object | N13 Bytes | Speech Object ID and/or Object |
| – SpeechObjectSpaceTime | N14 Bytes | Space-Time of Speech Object |
| AudioObjectCount | N15 Bytes | Number of Audio Objects in Scene |
| AudioObjectsData[] | N16 Bytes | Set of Audio Objects |
| – AudioObjectID and/or Object | N17 Bytes | Audio Object ID and/or Object |
| – AudioObjectSpaceTime | N18 Bytes | Space-Time of Audio Object |
| VisualObjectCount | N19 Bytes | Number of Visual Objects in Scene |
| VisualObjectsData[] | N20 Bytes | Set of Visual Objects |
| – VisualObjectID and/or Object | N21 Bytes | Visual Object ID and/or Object |
| – VisualObjectSpaceTime | N22 Bytes | Space-Time of Visual Object |
| AudioVisualObjectCount | N23 Bytes | Number of Audio-Visual Objects in Scene |
| AudioVisualObjectsData[] | N24 Bytes | Set of Audio-Visual Objects |
| – AudioVisualObjectID and/or Object | N25 Bytes | Audio-Visual Object ID and/or Object |
| – AudioObjectSpaceTime | N26 Bytes | Space-Time of Audio-Visual Object |
| DescrMetadata | N27 Bytes | Descriptive Metadata |
5 Conformance Testing
A Data instance Conforms with MPAI-OSD V1.2 Audio-Visual Basic Scene Descriptors (OSD-BSD) if:
- The Data validates against the Audio-Visual Basic Scene Descriptors’ JSON Schema.
- All Data in the Audio-Visual Basic Scene Descriptors’ JSON Schema
- Have the specified type
- Validate against their JSON Schemas
- Conform with their Data Qualifiers if present.