1 Definition | 2 Functional Requirements | 3 Syntax |
4 Semantics | 5 Conformance Testing | 6 Performance Assessment |
1 Definition
A Data Type including the 3D Model, Speech, Audio, Visual and Audio-Visual Objects of an Audio-Visual Scene and their arrangement in the Scene.
2 Functional Requirements
Audio-Visual Basic Scene Descriptors include:
- The ID of the Virtual Space where the Audio-Visual Basic Scene is located.
- The ID of the Audio-Visual Basic Scene Descriptors.
- The Space-Time info of the Audio-Visual Basic Scene.
- The number of:
- 3D Model Objects in the Audio-Visual Basic Scene.
- Speech Objects in the Audio-Visual Basic Scene.
- Audio Objects in the Audio-Visual Basic Scene.
- Visual Objects in the Audio-Visual Basic Scene.
- Audio-Visual Objects in the Audio-Visual Basic Scene.
- The 3D Model Objects including, for each 3D Model Object:
- The 3D Model Object Space-Time.
- The 3D Model Object.
- The Speech Objects including, for each Speech Object:
- The Speech Object Space-Time.
- The Speech Object.
- The Audio Objects including, for each Audio Object:
- The Audio Object Space-Time.
- The Audio Object.
- The Visual Objects including, for each Visual Object:
- The Visual Object Space-Time.
- The Visual Object.
- The Audio-Visual Objects including, for each Audio-Visual Object:
- The Audio-Visual Object Space-Time.
- The Audio-Visual Object.
3 Syntax
https://schemas.mpai.community/OSD/V1.3/data/AudioVisualBasicSceneDescriptors.json
4 Semantics
Label | Size | Description |
Header | N1 Bytes | Audio-Visual Basic Scene Descriptors Header |
– Standard-AVBasicSceneDescriptors | 9 Bytes | The characters “OSD-BSD-V” |
– Version | N2 Bytes | Major version – 1 or 2 characters |
– Dot-separator | 1 Byte | The character “.” |
– Subversion | N3 Bytes | Minor version – 1 or 2 characters |
MInstanceID | N4 Bytes | Identifier of M-Instance. |
AVBasicSceneDescriptorsID | N5 Bytes | Identifier of the AV Object. |
AVBasicSceneSpaceTime | N6 Bytes | Data about AVScene’s Space and Time |
3DModelObjectCount | N7 Bytes | Number of 3D Model Objects in Scene |
3DModelObjectsData[] | N8 Bytes | Set of 3D Model Objects |
– 3DModelObjectID and/or Object | N9 Bytes | 3D Model Object ID and/or Object |
– 3DModelObjectSpaceTime | N10 Bytes | Space-Time of 3D Model Object |
SpeechObjectCount | N11 Bytes | Number of Speech Objects in Scene |
SpeechObjectsData[] | N12 Bytes | Set of Speech Objects |
– SpeechObjectID and/or Object | N13 Bytes | Speech Object ID and/or Object |
– SpeechObjectSpaceTime | N14 Bytes | Space-Time of Speech Object |
AudioObjectCount | N15 Bytes | Number of Audio Objects in Scene |
AudioObjectsData[] | N16 Bytes | Set of Audio Objects |
– AudioObjectID and/or Object | N17 Bytes | Audio Object ID and/or Object |
– AudioObjectSpaceTime | N18 Bytes | Space-Time of Audio Object |
VisualObjectCount | N19 Bytes | Number of Visual Objects in Scene |
VisualObjectsData[] | N20 Bytes | Set of Visual Objects |
– VisualObjectID and/or Object | N21 Bytes | Visual Object ID and/or Object |
– VisualObjectSpaceTime | N22 Bytes | Space-Time of Visual Object |
AudioVisualObjectCount | N23 Bytes | Number of Audio-Visual Objects in Scene |
AudioVisualObjectsData[] | N24 Bytes | Set of Audio-Visual Objects |
– AudioVisualObjectID and/or Object | N25 Bytes | Audio-Visual Object ID and/or Object |
– AudioObjectSpaceTime | N26 Bytes | Space-Time of Audio-Visual Object |
DescrMetadata | N27 Bytes | Descriptive Metadata |
5 Conformance Testing
A Data instance Conforms with MPAI-OSD V1.2 Audio-Visual Basic Scene Descriptors (OSD-BSD) if:
- The Data validates against the Audio-Visual Basic Scene Descriptors’ JSON Schema.
- All Data in the Audio-Visual Basic Scene Descriptors’ JSON Schema
- Have the specified type
- Validate against their JSON Schemas
- Conform with their Data Qualifiers if present.