1. Functions

Speaker Identity Recognition (MMC-SIR):

Receives Auxiliary Text related to the Speech Object.
Speech Object of which the Speaker id requested.
Speech Time for which a Speaker ID is requested.
Produces Speaker Identifier

2      Reference Architecture

The Reference Architecture is depicted in Figure 1.

Figure 1 – The Speaker Identity Recognition AIM

3      I/O Data

Table 1 specifies the Input and Output Data of the Visual Scene Description AIM.

Table 1 – I/O Data of the Visual Scene Description AIM

Input Description
Auxiliary Text Text with content related to Speaker ID.
Speech Object Speech Object emitted by the Speaker.
Speech Time The start and end time of the Speech
Speech Overlap Number of speakers
Output Description
Speaker Identifier The Visual Descriptors of the Visual Scene.

5     JSON Metadata

https://schemas.mpai.community/MMC/V2.2/AIMs/SpeakerIdentityRecognition.json

5     SubAIMs

No SubAIMs

6. Profiles

No Profiles