Definition
A Data Type representing the prosody of a Speech Segment in terms of pitch, duration, and intensity per phoneme.
Syntax
https://schemas.mpai.community/MMC/V1.0/data/SpeechDescriptors.json
Semantics
Name | Definition |
pitch | Indicates the fundamental frequency of Speech expressed as a real number indicating frequency as Hz (Hertz). |
tone | Tone is a variation in the pitch of the voice while speaking expressed as human readable words as in Table 36. |
ToneType | Indicates the Tone that the input speech carries. |
intonation | A variation of the pitch, intensity and speed within a time period measured in seconds. |
intensity | Energy of Speech expressed as a real number indicating dBs (decibel). |
speed | Indicates the Speech Rate as a real number indicating specified linguistic units (e.g., Phonemes, Syllables, or Words) per second. |
emotion | Indicates the Emotion that the input speech carries. |
EmotionType | Indicates the Emotion that the input speech carries. |
toneName | Specifies the name of a Tone. |
toneSetName | Name of the Tone set which contains the Tone. Tone set is used as a baseline, but other sets are possible. |
Note: The semantics of “tone” defines a basic set of elements characterising tone. Elements can be added to the basic set or new sets defined using the registration procedure defined Personal Status.
Table 1 – Basic Tones
TONE CATEGORIES | ADJECTIVAL | Semantics |
FORMALITY | formal informal |
serious, official, polite everyday, relaxed, casual |
ASSERTIVENESS | assertive factual hesitant |
certain about content neutral about content uncertain about content |
REGISTER (per situation or use case) | conversational directive |
appropriate to an informal speaking related to commands or requests for action |