1     Definition 2     Functional Requirements 3     Syntax
4     Semantics 5    Conformance Testing 6     Performance Assessment

1      Definition

A Data Type including the Speech Objects of a scene, their sub-scenes, and their arrangement in the scene.

2      Functional Requirements

Speech Scene Descriptors include

  1. Speech Objects
  2. The Descriptors of the Speech Scenes includes in the Speech Scene called Speech Sub-Scenes.
  3. Rights that may be exercised on the Speech Scene.

Scenes may be hierarchical, i.e., they may contain Speech Objects and Speech Scenes.

3      Syntax

https://schemas.mpai.community/OSD/V1.4/data/SpeechSceneDescriptors.json

4      Semantics

Label Description
Header Speech Scene Descriptors Header
– Standard-SpeechSceneDescriptors The characters “OSD-SSD-V”
– Version Major version – 1 or 2 characters
– Dot-separator The character “.”
– Subversion Minor version – 1 or 2 characters
MInstanceID Identifier of M-Instance.
SceneDescriptorsID Identifier of Scene Descriptors.
SceneDescriptorsSpaceTime Space and Time of Scene Descriptors.
ObjectCount Number of Objects in Scene.
Objects[] Set of Objects.
– Object or ObjectID Object in the Scene of its ID.
– ObjectSpaceTime Space Time of Object.
SubSceneCount Number of Sub-Scenes in Scene.
SubScenes[] Set of Sub-Scenes in the Scene.
– SubScene or SubSceneID Sub-Scene in the Scene or its ID.
– SubSceneSpaceTime Space Time of Sub-Scene.
DescrMetadata Descriptive Metadata

5     Conformance Testing

A Data instance Conforms with Speech Scene Descriptors (OSD-SSD) if:

  1. The Data validates against the Scene Descriptors’ JSON Schema.
  2. All Data in the  Scene Descriptors’ JSON Schema
    1. Have the specified type
    2. Validate against their JSON Schemas
    3. Conform with their Data Qualifiers.

6     Performance Assessment