<-AI Workflows Go to ToC Data Types->

Function Ref. Model I/O Data SubAIMs JSON MData Profiles Ref. Software Conformance Performance

1. Function

The 6DF Decompression (CAE-6DC) AIM AIM linearly interpolates the selected time-domain Diffuse Field using the Interpolation Parameters and performs a parametric interpolation in the frequency domain on the Selected Dominant Plane Wave Parameters.  It produces the interpolated time-domain Diffuse Field and the Time-Domain Interpolated Direct Field.

Receives Interpolation Parameters
Selected Time Domain Diffuse Field
Selected Dominant Plane Wave Parameters
Produces Time Domain Interpolated Diffuse Sound Field
Time Domain Interpolated Direct Sound Field

2. Reference Model

Figure 1 depicts the Reference Model of the 6DF Interpolation (CAE-6DI) AIM.

 

Figure 1 – Reference Model of the 6DF Interpolation (CAE-6DI) AIM

3. Input/Output Data

Table 1 – Input/Output Data of 6DF Decompression (CAE-6DC) AIM

Input Description
Interpolation Parameters Spatial and temporal weighting coefficients which are calculated dynamically based on the listener’s real-time position relative to the physical microphone arrays.
Selected Time Domain Diffuse Field Multi-channel, non-directional ambient audio signal extracted from the physical microphone arrays.
Selected Dominant Plane Wave Parameters These parameters consist of the estimated directions of arrival (DoAs), amplitudes, and distances of the primary, highly directional sound sources extracted from the active spherical microphone arrays.
Output Description
TD Interpolated Diffuse Sound Field The multi-channel, time-domain audio output representing the reconstructed ambient acoustics of the environment at the user’s virtual coordinates, generated by applying proximity-based linear interpolation weights directly to the time-domain diffuse signals extracted from the neighboring physical microphone arrays.
TD Interpolation Direct Sound Field The multi-channel, time-domain audio output representing the localized, direct-path sound waves at the user’s virtual coordinates, synthesized from the interpolated plane-wave parameters (DoA and amplitude) and converted back into the time domain via the Inverse Short-Time Fourier Transform (ISTFT).

4. SubAIMs

4.1 Reference Model

Figure 1 depicts the Reference Model of the 6DF Interpolation (CAE-6DI) Composite AIM.

Figure 2 – Reference Model of the 6DF Interpolation (CAE-6DI) Composite AIM

4.2 Operation

4.3 Functions of AI Modules

Table 2 specifies the Function of the AI Modules.

Table 2 – Functions of Health Back End AI Modules

AI Module Description
Plane Wave Parameter Interpolation The PWI module performs spatial interpolation on parametric representations of a sound field, such as estimated Directions of Arrival (DoAs), source distances, and amplitudes, estimated at discrete, static physical microphone array positions by operating directly on these spatial metadata parameters rather than raw audio waveforms. This module enables the calculation of a spatially coherent, localized sound field representation at any arbitrary virtual listening coordinate within a 6DF environment.
Plane Wave Synthesis The PWS AIM reconstructs a continuous physical sound field at the user’s virtual coordinates by synthesizing discrete directional plane waves from the interpolated parameters provided by the PWI. These synthesized plane waves are mathematically projected onto target rendering formats
Linear Interpolation The LIP AIM calculates proximity-based weighting coefficients to blend diffuse field acoustic parameters between adjacent physical recording nodes.
Inverse Short-Time Fourier Transform The ISTFT AIM the reconstructs the spatial audio channels from the time-frequency domain back into continuous time-domain waveforms. Utilizing an overlap-add (OLA) synthesis method.

4.4 I/O Data of AI Modules

Table 3 specifies the Input and Output Data of the AI Modules.

Table 3 – Input and Output Data of the AI Modules

AI Module Receives Produces
Plane Wave Parameter Interpolation Interpolation Parameters Interpolated Sparse Plane Wave
Selected Dominant Plane Wave Parameters
Plane Wave Synthesis Interpolated Sparse Plane Wave FD Interpolated Direct Sound Field
Linear Interpolation Interpolation Parameters TD Interpolated Diffuse Sound Field
Selected TD Diffuded Field
Inverse Short-Time Fourier Transform FD Interpolated Direct Sound Field TD Interpolated Direct Sound Field

4.5 AIMs and JSON Metadata

Table 4 provides the links to the AIW and AIM specifications and to the JSON schemas. AIMs/1 indicates that the column contains Composite AIMs and AIM2 indicates that the column contains their SubAIMs.

Table 4 – AIMs and JSON Metadata

AIM1 AIM2 Name JSON
CAE-6DF 6DF Interpolation File
CAE-PWI Plane Wave Parameter Interpolation File
CAE-PWS Plane Wave Synthesis File
CAE-LIP Linear Interpolation File
CAE-ISF Inverse Short-Time Fourier Transform File

5. JSON Metadata

https://schemas.mpai.community/CAE2/V1.1/AIMs/6DFInterpolation.json

6. Profiles

No Profiles

7. Reference Software

8. Conformance Testing

Table 2 – Conformance Testing Method for CAE-AAT AIM

Receives Compressed Diffuse Dominant Shall validate against the Audio Object schema.
The Qualifier shall validate against the Audio Qualifier schema.
Produces Decompressed Diffuse Dominant Shall validate against the Audio Object schema.
The Qualifier shall validate against the Audio Qualifier schema.

9. Performance Assessment

 

<-AI Workflows Go to ToC Data Types->