Definition

Data representing various features of a Speech Segment, including speaker identity, prosody, and additional vocal elements including tension, whispery quality, or creaky voice.

Syntax

https://schemas.mpai.community/MMC/V2.1/data/SpeechDescriptors.json

Semantics

Name Definition
SpeechFeatures Characteristic elements extracted from the input speech, specifically pitch, tone, intonation, intensity, speed, emotion, and NNspeechFeatures.
NNSpeechFeatures Specifically neural-network-based characteristic elements extracted from the input speech by Neural Network
pitch Indicates the fundamental frequency of Speech expressed as a real number indicating frequency as Hz (Hertz).
tone Tone is a variation in the pitch of the voice while speaking expressed as human readable words as in Table 20.
ToneType Indicates the Tone that the input speech carries.
intonation A variation of the pitch, intensity and speed within a time period measured in seconds.
intensity Energy of Speech expressed as a real number indicating dBs (decibel).
speed Indicates the Speech Rate as a real number indicating specified linguistic units (e.g., Phonemes, Syllables, or Words) per second.
emotion Indicates the Emotion that the input speech carries.
EmotionType Indicates the Emotion that the input speech carries.
toneName Specifies the name of a Tone.
toneSetName Name of the Tone set which contains the Tone. Tone set is used as a baseline, but other sets are possible.

Note: The semantics of “tone” defines a basic set of elements characterising tone. Elements can be added to the basic set or new sets defined using the registration procedure defined for Emotion Sets.

Table – Basic Tones

TONE CATEGORIES ADJECTIVAL Semantics
FORMALITY formal
informal
serious, official, polite
everyday, relaxed, casual
ASSERTIVENESS assertive
factual
hesitant
certain about content
neutral about content
uncertain about content
REGISTER (per situation or use case) conversational
directive
appropriate to an informal speaking
related to commands or requests for action