1     Function

Audio Analysis Transform (CAE-AAT):

Receives Multichannel Audio
Transforms Multichannel Audio into frequency bands via Fast Fourier Transform (FFT). The operations of the subsequent AIMs are carried out in discrete frequency bands. When such a configuration is used, a 50% overlap between subsequent Audio Blocks must be employed.
Produces Transform Multichannel Audio, a data structure comprising complex valued audio samples in the frequency domain.

2     Reference Architecture

Figure 1 depicts the Reference Architecture of the Audio Analysis Transform AIM.

Figure 1 – The Audio Analysis Transform AIM

4    I/O Data

Table 1 specifies the Input and Output Data of the Audio Analysis Transform AIM.

Table 1 – I/O Data of the Audio Analysis Transform AIM

Input Description
Multichannel Audio The output Audio of the Microphone Array.
Output Description
 Multichannel Audio  (Transform) The result of the application of the Fast Fourier Transform to Multichannel Audio.

5    SubAIMs

No SubAIMs.

6     JSON Metadata

https://schemas.mpai.community/CAE/V2.2/AIMs/AudioAnalysisTransform.json