1     Definition 2     Functional Requirements 3     Syntax
4     Semantics 5    Conformance Testing 6     Performance Assessment

1      Definition

A Data Type composed of:

  1. Content represented as Data whose rendering has vocal attributes.
  2. Speech Qualifier.
  3. Space-Time information.

2      Functional Requirements

A Speech Object includes:

  1. The ID of a Virtual Space (M-Instance) where it is or is intended to be located.
  2. The ID of the Speech Object.
  3. The ID(s) of Parent Object(s) supporting two cases:
    1. The Parent Object has spawned two (or more) Objects. That is, two Objects are now distinguished where only one was before.
    2. Two (or more) Parent Objects have merged into one.
  4. The Space-Time information of all Parent Objects in the M-Instance.
  5. The Speech Object Space-Time information.
  6. The Speech Data Qualifier.
  7. The Speech Data Annotations, including:
    1. Annotation
    2. Annotation Space-Time
    3. Process Action IDs
  8. The Speech Data Length and URI:
    1. The length in Bytes of the Speech Data.
    2. The URI of the Speech Data.

3      Syntax

https://schemas.mpai.community/MMC/V2.3/data/SpeechObject.json

4      Semantics

Label Size Description
Header N1 Bytes Speech Object Data Header
– Standard-SpeechObject 9 Bytes The characters “MMC-SPO-V”
– Version N2 Bytes Major version – 1 or 2 characters
– Dot-separator 1 Byte The character “.”
– Subversion N3 Bytes Minor version – 1 or 2 characters
MInstanceID N4 Bytes Identifier of M-Instance.
SpeechObjectID N5 Bytes Identifier of the Speech Object.
ParentSpeechObjects[] N6 Bytes Identifier(s) of Parent Speech Objects.
ParentAudioObjectID N7 Bytes ID of a Parent Audio Object
ParentAudioDataSpaceTime N8 Bytes Space Time info of Parent Audio  Data
SpeechDataSpace-Time N9 Bytes Space-Time info of Data Object.
SpeechDataQualifier
N10 Bytes Speech Data Qualifier.
SpeechDataAnnotations[] N11 Bytes Annotations of Speech Data
– Annotation N12 Bytes ID of Annotation
– AnnotationSpaceTime N13 Bytes Where/when Annotation is attached.
– ProcessActionID N14 Bytes What is possible to do with the Annotation
SpeechDataLength N15 Bytes Number of Bytes of Speech Data
SpeechDataURI N16 Bytes URI of Data of Speech Data
DescrMetadata N17 Bytes Descriptive Metadata

5     Conformance Testing

A Data instance Conforms with MPAI-MMC V2.3 Speech Object (MMC-SPO) if:

  1. The Data validates against the Speech Object’s JSON Schema.
  2. All Data in the  Speech Object’s JSON Schema
    1. Have the specified type
    2. Validate against their JSON Schemas
    3. Conform with their Data Qualifiers if present.

6     Performance Assessment