1     Definition 2     Functional Requirements 3     Syntax
4     Semantics 5    Conformance Testing 6     Performance Assessment

1      Definition

A Data Type including a Speech Scene’s Speech Objects, and Space-Time and arrangement.

2      Functional Requirements

Speech Basic Scene Geometry include:

  1. The ID of a Virtual Space (M-Instance) where the Speech Basic Scene is or will be located.
  2. The ID of the Speech Basic Scene Descriptors.
  3. The number of Speech Objects in the Audio Basic Scene.
  4. The Space-Time Attributes of the Speech Basic Scene Descriptors.
  5. For each Speech Object in the Speech Basic Scene:
    1. The Space-Time info.
    2. The Speech Objects.

3      Syntax

https://schemas.mpai.community/MMC/V2.3/data/SpeechBasicSceneDescriptors.json

4      Semantics

Label Size Description
Header N1 Bytes Speech Basic Scene Descriptors Header
– Standard-SpeechSceneDescriptors 9 Bytes The characters “OSD-SBD-V”
– Version N2 Bytes Major version – 1 or 2 characters
– Dot-separator 1 Byte The character “.”
– Subversion N3 Bytes Minor version – 1 or 2 characters.
MInstanceID N4 Bytes Identifier of M-Instance.
SpeechBasicSceneDescriptorsID N5 Bytes Identifier of the Speech Basic Scene Descriptors.
SpeechObjectCount N6 Bytes Number of Speech Objects in Speech Basic Scene.
SpeechBasicSceneSpaceTime N8 Bytes Space-Time of Speech Basic Scene Descriptors.
SpeechBasicSceneSpeechData[] N9 Bytes Set of Speech Data.
– SpeechDataSpaceTime N10 Bytes Space and Time of Speech Data.
– SpeechtObject N12 Bytes Speech Object
DescrMetadata N17 Bytes Descriptive Metadata.

5     Conformance Testing

A Data instance Conforms with MPAI-MMC V2.3 Speech Basic Scene Descriptors (MMC-SBD) if:

  1. The Data validates against the Speech Basic Scene Descriptors’ JSON Schema.
  2. All Data in the  Speech Basic Scene Descriptors’ JSON Schema
    1. Have the specified type.
    2. Validate against their JSON Schemas.
    3. Conform with their Data Qualifiers if present.

6     Performance Assessment