1 Definition | 2 Functional Requirements | 3 Syntax |
4 Semantics | 5 Conformance Testing | 6 Performance Assessment |
1 Definition
A Data Type composed of:
- Content represented as Data whose rendering has vocal attributes.
- Speech Qualifier.
- Space-Time information.
2 Functional Requirements
A Speech Object includes:
- The ID of a Virtual Space (M-Instance) where it is or is intended to be located.
- The ID of the Speech Object.
- The ID(s) of Parent Object(s) supporting two cases:
- The Parent Object has spawned two (or more) Objects. That is, two Objects are now distinguished where only one was before.
- Two (or more) Parent Objects have merged into one.
- The Space-Time information of all Parent Objects in the M-Instance.
- The Speech Object Space-Time information.
- The Speech Data Qualifier.
- The Speech Data Annotations, including:
- Annotation
- Annotation Space-Time
- Process Action IDs
- The Speech Data Length and URI:
- The length in Bytes of the Speech Data.
- The URI of the Speech Data.
3 Syntax
https://schemas.mpai.community/MMC/V2.3/data/SpeechObject.json
4 Semantics
Label | Size | Description |
Header | N1 Bytes | Speech Object Data Header |
– Standard-SpeechObject | 9 Bytes | The characters “MMC-SPO-V” |
– Version | N2 Bytes | Major version – 1 or 2 characters |
– Dot-separator | 1 Byte | The character “.” |
– Subversion | N3 Bytes | Minor version – 1 or 2 characters |
MInstanceID | N4 Bytes | Identifier of M-Instance. |
SpeechObjectID | N5 Bytes | Identifier of the Speech Object. |
ParentSpeechObjects[] | N6 Bytes | Identifier(s) of Parent Speech Objects. |
– ParentAudioObjectID | N7 Bytes | ID of a Parent Audio Object |
– ParentAudioDataSpaceTime | N8 Bytes | Space Time info of Parent Audio Data |
SpeechDataSpace-Time | N9 Bytes | Space-Time info of Data Object. |
SpeechDataQualifier |
N10 Bytes | Speech Data Qualifier. |
SpeechDataAnnotations[] | N11 Bytes | Annotations of Speech Data |
– Annotation | N12 Bytes | ID of Annotation |
– AnnotationSpaceTime | N13 Bytes | Where/when Annotation is attached. |
– ProcessActionID | N14 Bytes | What is possible to do with the Annotation |
SpeechDataLength | N15 Bytes | Number of Bytes of Speech Data |
SpeechDataURI | N16 Bytes | URI of Data of Speech Data |
DescrMetadata | N17 Bytes | Descriptive Metadata |
5 Conformance Testing
A Data instance Conforms with MPAI-MMC V2.3 Speech Object (MMC-SPO) if:
- The Data validates against the Speech Object’s JSON Schema.
- All Data in the Speech Object’s JSON Schema
- Have the specified type
- Validate against their JSON Schemas
- Conform with their Data Qualifiers if present.
6 Performance Assessment