1 Function
Audio Analysis Transform (CAE-AAT):
Receives | Multichannel Audio |
Transforms | Multichannel Audio into frequency bands via Fast Fourier Transform (FFT). The operations of the subsequent AIMs are carried out in discrete frequency bands. When such a configuration is used, a 50% overlap between subsequent Audio Blocks must be employed. |
Produces | Transform Multichannel Audio, a data structure comprising complex valued audio samples in the frequency domain. |
2 Reference Architecture
Figure 1 depicts the Reference Architecture of the Audio Analysis Transform AIM.
Figure 1 – The Audio Analysis Transform AIM
4 I/O Data
Table 1 specifies the Input and Output Data of the Audio Analysis Transform AIM.
Table 1 – I/O Data of the Audio Analysis Transform AIM
Input | Description |
Audio Object | The output Audio of the Microphone Array. |
Output | Description |
AudioObject (Transform) | The result of the application of the Fast Fourier Transform to Multichannel Audio. |
5 SubAIMs
No SubAIMs.
6 JSON Metadata
https://schemas.mpai.community/CAE/V2.2/AIMs/AudioAnalysisTransform.json