1 Definition
A Data Type including a Speech Scene’s Speech Objects, and Space-Time and arrangement.
2 Functional Requirements
Speech Basic Scene Geometry include:
- The ID of a Virtual Space (M-Instance) where the Speech Basic Scene is or will be located.
- The ID of the Speech Basic Scene Descriptors.
- The number of Speech Objects in the Audio Basic Scene.
- The Space-Time Attributes of the Speech Basic Scene Descriptors.
- For each Speech Object in the Audio Basic Scene:
- The Space-Time info.
- The Speech Objects.
3 Syntax
https://schemas.mpai.community/MMC/V2.2/data/SpeechBasicSceneDescriptors.json
4 Semantics
Label | Size | Description |
Header | N1 Bytes | Speech Basic Scene Descriptors Header |
– Standard-SpeechSceneDescriptors | 9 Bytes | The characters “OSD-SBD-V” |
– Version | N2 Bytes | Major version – 1 or 2 characters |
– Dot-separator | 1 Byte | The character “.” |
– Subversion | N3 Bytes | Minor version – 1 or 2 characters. |
MInstanceID | N4 Bytes | Identifier of M-Instance. |
SpeechBasicSceneDescriptorsID | N5 Bytes | Identifier of the Speech Basic Scene Geometry. |
SpeechObjectCount | N6 Bytes | Number of Speech Objects in Speech Basic Scene. |
SpeechBasicSceneSpaceTime | N8 Bytes | Space and Time of Speech Basic Scene Geometry. |
SpeechBasicSceneSpeechObjects[] | N9 Bytes | Set of Speech Objects. |
– SpeechObjectSpaceTime | N10 Bytes | Space and Time of Speech Object. |
– SpeechObjectData | N12 Bytes | Data of Speech Object (length and URI). |
DescrMetadata | N17 Bytes | Descriptive Metadata. |