1. Functions

Extracts Speaker Time, Speech Object, and Speech Overlap from a Speech file.

2. Reference Architecture

Figure 1 depicts the Reference Architecture of the Audio Diarisation AIM.

 

Figire 1 – Reference Model of Audio Diarisation AIM

3. I/O Data

Table 1 specifies the Input and Output Data of the Audio Diarisation AIM.

Table 1 – I/O Data of the Audio Diarisation AIM

Input Description
Speech File Input Speech  file
Output Description
Speaker Time Time Speaker starts speaking
Speech Overlap Number of speakers
Speech Object Speaker’s Speech Object

4. SubAIMs

No SubAIMs

5. JSON Metadata

https://schemas.mpai.community/MMC/V2.2/AIMs/AudioDiarisation.json

6. Profiles

No Profiles