| 1 Definition | 2 Functional Requirements | 3 Syntax |
| 4 Semantics | 5 Conformance Testing | 6 Performance Assessment |
1 Definition
A Data Type representing characteristic elements extracted from the input speech, specifically Pitch, Intensity, Tempo, Personal Status, and NNSpeechFeatures in a period of time.
2 Functional Requirements
Speech Descriptors may include Neural Network Descriptors.
3 Syntax
https://schemas.mpai.community/MMC/V2.4/data/SpeechDescriptors.json
4 Semantics
| Label | Description |
| Header | Speech Descriptors Header |
| – Standard – SpeechDescriptors | The characters “MMC-SPD-V” |
| – Version | Major version – 1 or 2 characters |
| – Dot-separator | The character “.” |
| – Subversion | Minor version – 1 or 2 characters |
| MInstanceID | ID of the Metaverse Instance. |
| SpeechDescriptorsID | ID of Speech Descriptors. |
| SpeechDescriptorsData | Data associated with Input Text. |
| NNSpeechFeatures | The output vector of a neural-network using Speech as input. |
| Duration | The Time in which the Speech Descriptors are computed. |
| Pitch | Real number measuring the fundamental frequency of Speech in Hz (Hertz). |
| Intensity | Real number measuring the Energy of Speech in dBs (decibel). |
| Tempo | Real number measuring the rate at which specified linguistic units (Phonemes, Syllables, or Words) are produced. |
| Personal Status | The Speech Personal Status carried by the input speech. |
5 Conformance Testing
A Data instance Conforms with MPAI-MMC V2.3 Speech Descriptors (MMC-SPD) if:
- The Data validates against the Speech Descriptors’ JSON Schema.
- All Data in the Speech Descriptors’ JSON Schema
- Have the specified type
- Validate against their JSON Schemas
- Conform with their Data Qualifiers if present.
6 Performance Assessment