1 Functions
Audio Object Identification (CAE-AOI):
| Receives | Audio Scene Geometry | The spatial arrangements of the Audio Objects. |
| Audio Objects | The individual input Audio Objects | |
| Identifies | The Audio Objects. | |
| Produces | The Audio Instance IDs | The Instance Identifier of the Audio Objects. |
2 Reference Architecture
Figure 1 depicts the Reference Architecture of the Audio Object Identification AIM.

Figure 1 – Audio Object Identification AIM
Note that the Audio Object Identification AIM should be able to parse either an Audio-Visual Scene Geometry or its Audio Scene Geometry subset.
3 I/O Data
Table 1 specifies the Input and Output Data of the Audio Object Identification AIM.
Table 1 – I/O Data of the Audio Object Identification AIM
| Input | Description |
| Audio Scene Geometry | The digital representation of the spatial arrangement of the Visual Objects of the Scene. |
| Audio Objects | The Audio Objects in the Audio Scene Geometry with an identifiable source target of identification. |
| Output | Description |
| Audio Instance Identifier | The Instance Identifier of the specific Audio Object. |
4 SubAIMs
No SubAIMs.
5 JSON Metadata
https://schemas.mpai.community/CAE1/V2.2/AIMs/AudioObjectIdentification.json
6 Profiles
No Profiles.