Definition

A Data Type representing the prosody of a Speech Segment in terms of pitch, duration, and intensity per phoneme.

Syntax

https://schemas.mpai.community/MMC/V1.0/data/SpeechDescriptors.json

Semantics

Name Definition
pitch Indicates the fundamental frequency of Speech expressed as a real number indicating frequency as Hz (Hertz).
tone Tone is a variation in the pitch of the voice while speaking expressed as human readable words as in Table 36.
ToneType Indicates the Tone that the input speech carries.
intonation A variation of the pitch, intensity and speed within a time period measured in seconds.
intensity Energy of Speech expressed as a real number indicating dBs (decibel).
speed Indicates the Speech Rate as a real number indicating specified linguistic units (e.g., Phonemes, Syllables, or Words) per second.
emotion Indicates the Emotion that the input speech carries.
EmotionType Indicates the Emotion that the input speech carries.
toneName Specifies the name of a Tone.
toneSetName Name of the Tone set which contains the Tone. Tone set is used as a baseline, but other sets are possible.

Note: The semantics of “tone” defines a basic set of elements characterising tone. Elements can be added to the basic set or new sets defined using the registration procedure defined Personal Status.

Table 1 – Basic Tones

TONE CATEGORIES ADJECTIVAL Semantics
FORMALITY formal
informal
serious, official, polite
everyday, relaxed, casual
ASSERTIVENESS assertive
factual
hesitant
certain about content
neutral about content
uncertain about content
REGISTER (per situation or use case) conversational
directive
appropriate to an informal speaking
related to commands or requests for action