1      Definition

A Data Type including a Speech Scene’s Speech Objects, and Space-Time and arrangement.

2      Functional Requirements

Speech Basic Scene Geometry include:

  1. The ID of a Virtual Space (M-Instance) where the Speech Basic Scene is or will be located.
  2. The ID of the Speech Basic Scene Descriptors.
  3. The number of Speech Objects in the Audio Basic Scene.
  4. The Space-Time Attributes of the Speech Basic Scene Descriptors.
  5. For each Speech Object in the Audio Basic Scene:
    1. The Space-Time info.
    2. The Speech Objects.

3      Syntax

https://schemas.mpai.community/MMC/V2.2/data/SpeechBasicSceneDescriptors.json

4     Semantics

Label Size Description
Header N1 Bytes Speech Basic Scene Descriptors Header
– Standard-SpeechSceneDescriptors 9 Bytes The characters “OSD-SBD-V”
– Version N2 Bytes Major version – 1 or 2 characters
– Dot-separator 1 Byte The character “.”
– Subversion N3 Bytes Minor version – 1 or 2 characters.
MInstanceID N4 Bytes Identifier of M-Instance.
SpeechBasicSceneDescriptorsID N5 Bytes Identifier of the Speech Basic Scene Geometry.
SpeechObjectCount N6 Bytes Number of Speech Objects in Speech Basic Scene.
SpeechBasicSceneSpaceTime N8 Bytes Space and Time of Speech Basic Scene Geometry.
SpeechBasicSceneSpeechObjects[] N9 Bytes Set of Speech Objects.
– SpeechObjectSpaceTime N10 Bytes Space and Time of Speech Object.
– SpeechObjectData N12 Bytes Data of Speech Object (length and URI).
DescrMetadata N17 Bytes Descriptive Metadata.