CAE-6DF V 1.0 AIMs - 6DF Interpolation

<-AI Workflows Go to ToC Data Types->

Function

Ref. Model

1. Function

The 6DF Decompression (CAE-6DC) AIM AIM linearly interpolates the selected time-domain Diffuse Field using the Interpolation Parameters and performs a parametric interpolation in the frequency domain on the Selected Dominant Plane Wave Parameters. It produces the interpolated time-domain Diffuse Field and the Time-Domain Interpolated Direct Field.

Receives	Interpolation Parameters
	Selected Time Domain Diffuse Field
	Selected Dominant Plane Wave Parameters
Produces	Time Domain Interpolated Diffuse Sound Field
	Time Domain Interpolated Direct Sound Field

2. Reference Model

Figure 1 depicts the Reference Model of the 6DF Interpolation (CAE-6DI) AIM.

Figure 1 – Reference Model of the 6DF Interpolation (CAE-6DI) AIM

3. Input/Output Data

Table 1 – Input/Output Data of 6DF Decompression (CAE-6DC) AIM

Input	Description
Interpolation Parameters	Spatial and temporal weighting coefficients which are calculated dynamically based on the listener’s real-time position relative to the physical microphone arrays.
Selected Time Domain Diffuse Field	Multi-channel, non-directional ambient audio signal extracted from the physical microphone arrays.
Selected Dominant Plane Wave Parameters	These parameters consist of the estimated directions of arrival (DoAs), amplitudes, and distances of the primary, highly directional sound sources extracted from the active spherical microphone arrays.
Output	Description
TD Interpolated Diffuse Sound Field	The multi-channel, time-domain audio output representing the reconstructed ambient acoustics of the environment at the user’s virtual coordinates, generated by applying proximity-based linear interpolation weights directly to the time-domain diffuse signals extracted from the neighboring physical microphone arrays.
TD Interpolation Direct Sound Field	The multi-channel, time-domain audio output representing the localized, direct-path sound waves at the user’s virtual coordinates, synthesized from the interpolated plane-wave parameters (DoA and amplitude) and converted back into the time domain via the Inverse Short-Time Fourier Transform (ISTFT).

4. SubAIMs

4.1 Reference Model

Figure 1 depicts the Reference Model of the 6DF Interpolation (CAE-6DI) Composite AIM.

Figure 2 – Reference Model of the 6DF Interpolation (CAE-6DI) Composite AIM

4.2 Operation

4.3 Functions of AI Modules

Table 2 specifies the Function of the AI Modules.

Table 2 – Functions of Health Back End AI Modules

AI Module	Description
Plane Wave Parameter Interpolation	The PWI module performs spatial interpolation on parametric representations of a sound field, such as estimated Directions of Arrival (DoAs), source distances, and amplitudes, estimated at discrete, static physical microphone array positions by operating directly on these spatial metadata parameters rather than raw audio waveforms. This module enables the calculation of a spatially coherent, localized sound field representation at any arbitrary virtual listening coordinate within a 6DF environment.
Plane Wave Synthesis	The PWS AIM reconstructs a continuous physical sound field at the user’s virtual coordinates by synthesizing discrete directional plane waves from the interpolated parameters provided by the PWI. These synthesized plane waves are mathematically projected onto target rendering formats
Linear Interpolation	The LIP AIM calculates proximity-based weighting coefficients to blend diffuse field acoustic parameters between adjacent physical recording nodes.
Inverse Short-Time Fourier Transform	The ISTFT AIM the reconstructs the spatial audio channels from the time-frequency domain back into continuous time-domain waveforms. Utilizing an overlap-add (OLA) synthesis method.

4.4 I/O Data of AI Modules

Table 3 specifies the Input and Output Data of the AI Modules.

Table 3 – Input and Output Data of the AI Modules

AI Module	Receives	Produces
Plane Wave Parameter Interpolation	Interpolation Parameters	Interpolated Sparse Plane Wave
	Selected Dominant Plane Wave Parameters
Plane Wave Synthesis	Interpolated Sparse Plane Wave	FD Interpolated Direct Sound Field
Linear Interpolation	Interpolation Parameters	TD Interpolated Diffuse Sound Field
	Selected TD Diffuded Field
Inverse Short-Time Fourier Transform	FD Interpolated Direct Sound Field	TD Interpolated Direct Sound Field

4.5 AIMs and JSON Metadata

Table 4 provides the links to the AIW and AIM specifications and to the JSON schemas. AIMs/1 indicates that the column contains Composite AIMs and AIM2 indicates that the column contains their SubAIMs.

Table 4 – AIMs and JSON Metadata

AIM1	AIM2	Name	JSON
CAE-6DF		6DF Interpolation	File
	CAE-PWI	Plane Wave Parameter Interpolation	File
	CAE-PWS	Plane Wave Synthesis	File
	CAE-LIP	Linear Interpolation	File
	CAE-ISF	Inverse Short-Time Fourier Transform	File

5. JSON Metadata

https://schemas.mpai.community/CAE2/V1.1/AIMs/6DFInterpolation.json

6. Profiles

No Profiles

7. Reference Software

8. Conformance Testing

Table 2 – Conformance Testing Method for CAE-AAT AIM

Receives	Compressed Diffuse Dominant	Shall validate against the Audio Object schema. The Qualifier shall validate against the Audio Qualifier schema.


Produces	Decompressed Diffuse Dominant	Shall validate against the Audio Object schema. The Qualifier shall validate against the Audio Qualifier schema.

9. Performance Assessment