CAE-USC V2.4 AI Modules

1. Technical Specifications

Table 1 provides the links to the specifications and the JSON syntax of all AIMs specified by Technical Specification: Context-based Audio Enhancement (MPAI-CAE) – Use Cases (CAE-USC) V2.4. The AI Modules specified by CAE-USC V2.4 supersede those specified by previous versions. These may be used if their Version is explicitly signaled. AIMs in bold are Composite.

Table 1 – Specifications and JSON syntax of AIMs used by CAE-USC V2.4

Acronym	AIM Name	JSON	Acronym	AIM Name	JSON
CAE-AAP	Audio Analysis for Preservation	File	CAE-PEI	Prosodic Emotion Insertion	File
CAE-AAT	Audio Analysis Transform	File	CAE-SFD	Sound Field Description	File
CAE-AMX	Audio Descriptors Multiplexing	File	CAE-SDS	Speech Detection and Separation	File
CAE-AOI	Audio Object Identification	File	CAE-SF1	Speech Feature Analysis 1	File
CAE-ASE	Audio Separation and Enhancement	File	CAE-SF2	Speech Feature Analysis 2	File
CAE-ASL	Audio Source Localisation	File	CAE-SMC	Speech Model Creation	File
CAE-AST	Audio Synthesis Transform	File	CAE-SRA	Speech Restoration Assembly	File
CAE-EFP	Emotion Feature Production	File	CAE-TAR	Tape Audio Restoration	File
CAE-NEI	Neural Emotion Insertion	File	CAE-TIC	Tape Irregularity Classification	File
CAE-NCM	Noise Cancellation Module	File	CAE-VAP	Video Analysis for Preservation	File
CAE-PAP	Packaging for Audio Preservation	File

2. Reference Software

As a rule, MPAI provides Reference Software implementing the AI Modules released with the BSD-3-Clause licence and the following disclaimers:

The CAE-USC V2.4 Reference Software Implementation, if in source code, is released with the BSD-3-Clause licence.
The purpose of this Reference Software is to provide a working Implementation of CAE-USC V2.4, not to provide a ready-to-use product.
MPAI disclaims the suitability of the Software for any other purposes and does not guarantee that it is secure.
Use of this Reference Software may require acceptance of licences from the respective copyright holders. Users shall verify that they have the right to use any third-party software required by this Reference Software.

Note that at this stage CAE-USC V2.4 specifies Reference Software only for some AIMs.

3. Conformance Testing

An implementation of an AI Module conforms with CAE-USC V2.4 if it accepts as input _and_ produces as output Data and/or Data Objects (the combination of Data of a Data Type and its Qualifier) conforming with those specified by CAE-USC V2.4 .

The Conformance is expressed by one of the two statements

“Data conforms with the relevant (Non-MPAI) standard” – for Data.
“Data validates against the Data Type Schema” – for Data Object.

The latter statement implies that:

Any Sub-Type of the Data conforms with the relevant Sub-Type specification of the applicable Qualifier.
Any Content and Transport Format of the Data conform with the relevant Format specification of the applicable Qualifier.
Any Attribute of the Data
1. Conforms with the relevant (Non-MPAI) standard – for Data, or
2. Validates against the Data Type Schema – for Data Object.

The method to Test the Conformance of an instance of Data or Data Object is specified in the Data Types chapter.

Note that at this stage the CAE-USC V2.4 specifies Conformance Testing only for some AIMs.

4. Performance Assessment

Performance is an umbrella term used to describe a variety of attributes – some specific of the application domain the Implementation intends to address. Therefore, Performance Assessment Specifications provide methods and procedures to measure how well an AIW or an AIM performs its function. Performance of an Implementation includes methods and procedures for all or a subset of the following characteristics:

Quality – for instance, how well a Face Identity Recognition AIM recognises faces, how precise or error-free are the changes in a Visual Scene detected by a Visual Change Detection AIM, or how satisfactory are the responses provided by an Answer to Multimodal Question AIW.
Robustness – for instance, how robust is the operation of an implementation with respect to duration of operation, load scaling, etc.
Extensibility – for instance, the degree of confidence a user can have in an Implementation when it deals with data outside of its stated application scope.
Bias: – for instance, how dependent on specific features of the training data is the inference, as in Company Performance Prediction when the accuracy of the prediction may widely change based on the size or the geographic position of a Company; or face recognition in Television Media Analysis.
Legality – for instance, in which jurisdictions the use of an AIM or an AIW complies with a regulation, e.g., the European AI Act.
Ethics: may indicate the conformity of an AIM or AIW to a target ethical standard.

Note that at this stage the CAE-USC V2.4 specifies Performance Assessment only for some AIMs.

<-AI Workflows Go to ToC Data Types->

Cookie	Duration	Description
cookielawinfo-checkbox-necessary	1 year	This cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Technical".
CookieLawInfoConsent	1 year	The cookie is set by the GDPR Cookie Consent plug-in and is used to store whether the user has consented to the use of cookies or not. It does not store any personal data.
viewed_cookie_policy	1 year	The cookie is set by the GDPR Cookie Consent plugin and is used to store whether or not user has consented to the use of cookies. It does not store any personal data.
_pk_id.6.08a8	13 months	Used to store a few details about the user such as the unique visitor ID
_pk_ses.6.08a8	30 minutes	Short lived cookies used to temporarily store data for the visit

1. Technical Specifications

2. Reference Software

3. Conformance Testing

4. Performance Assessment

Notice