1. Functions

Speaker Identity Recognition (MMC-SIR)

  1. Receives Speech Object.
  2. Produces Speaker Identifier.

2      Reference Architecture

The Reference Architecture is depicted in Figure 1.

Figure 1 – The Speaker Identity Recognition AIM

3      I/O Data

Table 1 specifies the Input and Output Data of the Visual Scene Description AIM.

Table 1 – I/O Data of the Visual Scene Description AIM

Input Description
Speech Object Speech Object emitted by the Speaker.
Speech Time The start and end time of the Speech
Speech Overlap Number of speakers
Output Description
Speaker Identifier The Visual Descriptors of the Visual Scene.

5     JSON Metadata

https://schemas.mpai.community/MMC/V2.2/AIMs/SpeakerIdentityRecognition.json

5     SubAIMs

No SubAIMs

6. Profiles

No Profiles