1    Function

Audio Source Localisation (CAE-ASL):

Receives Transform Multichannel Audio (with associated Microphone Array information)
Detects Audio Objects in the Audio Scene.
Determines Audio Objects’ Spatial Attitudes
Produces Audio Spatial Attitudes

2     Reference Architecture

Figure 1 depicts the Reference Architecture of the Audio Source Localisation AIM.

Figure 1 – Audio Source Localisation AIM

3    I/O Data

Table 1 specifies the Input and Output Data of the Audio Source Localisation AIM.

Table 1 – Audio Source Localisation AIM

Input Description
Multichannel Audio (Trf) The result of the application of the Fast Fourier Transform to the Multichannel Audio (with associated Microphone Array info).
Output Description
Audio Spatial Attitudes The Orientations and Directions of Audio Objects.

4    SubAIMs

No SubAIMs.

5. JSON Syntax

https://schemas.mpai.community/CAE1/V2.2/AIMs/AudioSourceLocalisation.json

6     Profiles

No Profiles