<-Scope     Go to ToC       References ->

The Terms used in this standard whose first letter is capital have the meaning defined in Table 1. All MPAI-defined Terms are accessible online..

Table 1Table of terms and definitions

Term Definition
Access Copy Files Set of files providing the information stored in an audio tape recording, including Restored Audio Files, suitable for audio information access, but not for long-term preservation.
Audio Block A set of consecutive Audio samples.
Audio Channel A sequence of Audio Blocks.
Audio Data Digital representation of an analogue audio signal sampled at a frequency between 8-192 kHz with a number of bits/sample between 8 and 64.
Audio File An Audio Object having a File Transport.
Audio Object Audio Data and optional metadata regarding Sub-Types, Formats and Attributes of the Audio Data.
Audio Scene Geometry A Data Type describing the spatial arrangement of the Audio Objects and Sub-Scenes of a Scene.
Audio Segment An Audio Block with Start Time and an End Time Labels corresponding to the time of the first and last sample of the Audio Segment, respectively.
Audio-Visual File An Audio-Visual Object having a File Transport.
Capstan The capstan is a rotating spindle used to move recording tape through the mechanism of a tape recorder.
Damaged List A list of strings of Texts corresponding to the Damaged Segments (if any) requiring replacement with synthetic segments.
Damaged Section An Audio Segment which is damaged in its entirety and is contained in a Damaged Segment.
Damaged Segment An Audio Segment containing only speech (and not containing music or other sounds) which is either damaged in its entirety or contains one or more Damaged Sections specified in the Damaged List.
Degree Strength of a feature, specifically, with respect to Emotion, “High,” “Medium,” or “Low.”
Editing List The description of the speed, equalisation and reading backwards corrections occurred during the restoration process.
Emotion A Data Type representing the internal status of a human or avatar resulting from their interaction with the context or subsets of it, such as “Angry”, and “Sad”.
Emotionless Speech An Audio File containing speech without music and other sounds, and in which little or no identifiable emotion is perceptible by native listeners.
Irregularity An event of interest to preservation in Table 26 and Table 27
Irregularity File A JSON file containing information about Irregularities of the ARP inputs.
Irregularity Image An image corresponding to an Irregularity.
JSON JavaScript object notation [18].
Microphone Array Geometry Description of the position of each microphone comprising the microphone array and specific characteristics such as microphone type, look directions, and the array type.
Model Utterance An Audio Segment used as a model or demonstration of the Emotion to be added to Emotionless Speech in order to produce Speech with Emotion.
Multichannel Audio A data structure containing at least 2 time-aligned interleaved Audio Channels.
Multichannel Audio Stream A data structure containing Audio Objects packaged with Audio Scene Geometry.
Neural Network Speech Model A Neural Network Model trained on Speech Segments for Modelling and used to synthesize replacements for the entire Damaged Segment or Damaged Sections within it.
Passthrough AIM An AIM with the same input and output data of an AIM without executing the Function of that AIM. E.g., a Noise Cancellation AIM that does not cancel the noise.
Preservation Audio File The input Audio File resulting from the digitisation of an audio open-reel tape to be preserved and, in case, restored.
Preservation Audio-Visual File The input Audio-Visual File produced by a camera pointed to the playback head of the magnetic tape recorder and the synchronised Audio resulting from the tape digitisation process.
Preservation Image A Video frame extracted from Preservation Audio-Visual File.
Preservation Master Files Set of files providing the information stored in an audio tape recording without any restoration. As soon as the original analogue recordings is no more accessible, it becomes the new item for long-term preservation.
Restored Audio Files Set of Audio Files derived from the Preservation Audio File, where potential speed, equalisation or reading backwards errors that occurred in the digitisation process have been corrected.
Restored Speech Segment An Audio Segment in which the entire segment has been replaced by a synthetic speech segment, or in which each Damaged Segment has been replaced by a synthetic speech segment.
Speech Features Descriptor representing a variety of information elements incorporated in a Speech Segment, e.g., personal identity, Personal Status, additional factors such as vocal tension, creakiness, whispery quality, etc.
Speech Segments for Modelling A set of Audio Files containing speech segments used to train the Neural Network Speech Model.
Speech With Emotion File An Audio File containing speech with emotional features.
Spherical Coordinate System A coordinate system where the position of a point is specified by three numbers: the radial distance of that point from a fixed origin, its polar angle measured from a fixed zenith direction, and the azimuthal angle of its orthogonal projection on a reference plane.
Spherical Grid Resolution The maximum spherical angle between any two neighbouring sampled points on a sphere.
Text List List of texts to be converted into speech by the Speech Synthesis for Restoration AIM.
Time Code Number of ms from 1970-01-01T00:00:00.000 according to [8].
Time Label A measure of time from a context-dependent zero time expressed as HH:mm:ss.SSS.
Transform Audio An Audio Object whose data are represented in the Frequency Domain.
Enhanced Transform Audio Transform Audio whose samples are Enhanced Transform Audio samples.
Useful Signal Digital signal resulting from the A/D conversion of the analogue signal recorded in an audio tape.

<-Scope     Go to ToC       References ->