1     Functions

Audio Scene Description (CAE-ASD):

Receives Multichannel Audio
Computes Audio Objets
Audio Scene Geometry
Produces Audio Scene Descriptors

2      Reference Architecture

Figure 1 depicts the Reference Architecture of the Audio Scene Description AIM.

Figure 1 – The Audio Scene Description (CAE-ASD) AIM

3      I/O Data

Table 1 specifies the Input and Output Data of the Audio Scene Description AIM.

Table 1 – I/O Data of the Audio Scene Description (CAE-ASD) AIM

Input Description
Multichannel Audio Input Audio (with associated Microphone Array info)
Output Description
Audio Scene Descriptors Output Audio Scene Descriptors

4      SubAIMs

Audio Scene Description (CAE-ASD) is a Composite AIM with the structure is depicted in Figure 2.

Figure 2 – The Audio Scene Description (CAE-ASD) Composite AIM

The specification of the CAE-ASD Basic AIMs are provided by Table 2.

Table 2 – BASIC AIMs of Audio Scene Descriptors

AIMs Names
CAE-AAT Audio Analysis Transform
CAE-ASL Audio Source Localisation
CAE-ASE Audio Separation and Enhancement
CAE-AST Audio Synthesis Transform
CAE-ADM Audio Descriptors Multiplexing

5     JSON Metadata

https://schemas.mpai.community/CAE1/V2.2/AIMs/AudioSceneDescription.json

6 Profiles

No Profiles