1     Functions

Audio Object Identification (CAE-AOI):

Receives Audio Scene Geometry The spatial arrangements of the Audio Objects.
Audio Objects The individual input Audio Objects
Identifies The Audio Objects.
Produces The Audio Instance IDs The Instance Identifier of the Audio Objects.

2      Reference Architecture

Figure 1 depicts the Reference Architecture of the Audio Object Identification AIM.

Figure 1 – Audio Object Identification AIM

Note that the Audio Object Identification AIM should be able to parse either an Audio-Visual Scene Geometry or its Audio Scene Geometry subset.

3      I/O Data

Table 1 specifies the Input and Output Data of the Audio Object Identification AIM.

Table 1 – I/O Data of the Audio Object Identification AIM

Input Description
Audio Scene Geometry The digital representation of the spatial arrangement of the Visual Objects of the Scene.
Audio Objects The Audio Objects in the Audio Scene Geometry with an identifiable source target of identification.
Output Description
Audio Instance Identifier The Instance Identifier of the specific Audio Object.

4      SubAIMs

No SubAIMs.

5     JSON Metadata

https://schemas.mpai.community/CAE1/V2.2/AIMs/AudioObjectIdentification.json

6     Profiles

No Profiles.