<- Avatar-Based Videoconference– Go to ToC Data Types->
AI Modules (AIM) are interconnected processing units composing an AI Workflow (AW). There are two types of AIM:
- Composite AIMs are AI Modules composed of multiple AI Modules.
- Basic AIMs do not include (or do not expose) AIMs inside.
This Chapter specifies the Composite AIMs of Table 1 .
Table 1 – Composite AIMs of MPAI-PAF
Acronym | AIM Name | Versions |
|
PAF-PSD | Personal Status Display | V1.0 |
This Chapter also specifies the Basic AIMs of Table 2.
Table 2 – Basic AIMs of MPAI PAF- Note that V1.1 is currently not approved.
Acronym | AIM Name |
PAF-AVR | Audio-Visual Scene Rendering |
PAF-FIR | Face Identity Recognition |
PAF-IBD | Input Body Description |
PAF-IFD | Input Face Description |
PAF-PFI | PS-Face Interpretation |
PAF-PGI | PS-Gesture Interpretation |
PAF-PMX | Portable Avatar Multiplexing |
PAF-PSD | Personal Status Display |
This Chapter also provides the full set of AIWs, Composite AIMs, and AIMs used by the four Avatar-Based Videoconference subsystems.
Videoconference Client Transmitter
AIW and AIMs | Name | JSON | ||
PA-CTX | Videoconference Client Transmitter | X | ||
– | OSD-AVS | Audio-Visual Scene Description | X | |
– | CAE-ASD | Audio Scene Description | X | |
– | CAE-AAT | Audio Analysis Transform | X | |
– | CAE-ASL | Audio Source Localisation | X | |
– | CAE-ASE | Audio Separation and Enhancement | X | |
– | CAE-AST | Audio Synthesis Transform | X | |
– | CAE-AMX | Audio Descriptor Multiplexing | X | |
– | OSD-VSD | Visual Scene Description | X | |
– | OSD-AVA | Audio-Visual Alignment | X | |
– | MMC-ASR | Automatic Speech Recognition | X | |
– | MMC-NLU | Natural Language Understanding | X | |
– | MMC-PSE | Personal Status Extraction | X | |
– | MMC-ITD | Input Text Description | X | |
– | MMC-ISD | Input Speech Description | X | |
– | PAF-IFD | Input Face Description | X | |
– | PAF-IBD | Input Body Description | X | |
– | MMC-PTI | PS-Text Interpretation | X | |
– | MMC-PSI | PS-Speech Interpretation | X | |
– | PAF-PFI | PS-Face Interpretation | X | |
– | PAF-PGI | PS-Gesture Interpretation | X | |
– | MMC-PMX | Personal Status Multiplexing | X |
Avatar Videoconference Server
AIW and AIMs | Name | JSON | |
PAF-AVS | Avatar Videoconference Server | X | |
– | PAF-PDX | Portable Avatar Demultiplexing | X |
– | MMC-TST | Text and Speech Translation | X |
– | PAF-SPA | Service Participant Authentication | X |
– | PAF-PMX | Portable Avatar Multiplexing | X |
Virtual Meeting Secretary
AIW and AIMs | Name and AIW/AIM Specification | JSON | ||
MMC-VMS | Virtual Meeting Secretary | X | ||
– | PAF-PDX | Portable Avatar Demultiplexing | X | |
– | MMC-ASR | Automatic Speech Recognition | X | |
– | MMC-NLU | Natural Language Understanding | X | |
– | MMC-PSE | Personal Status Extraction | X | |
– | MMC-ITD | Input Text Description | X | |
– | MMC-ISD | Input Speech Description | X | |
– | PAF-IFD | Input Face Description | X | |
– | PAF-IBD | Input Body Description | X | |
– | MMC-PTI | PS-Text Interpretation | X | |
– | MMC-PSI | PS-Speech Interpretation | X | |
– | PAF-PFI | PS-Face Interpretation | X | |
– | PAF-PGI | PS-Gesture Interpretation | X | |
– | MMC-PMX | Personal Status Multiplexing | X | |
– | MMC-SCM | Summary Creation Module | X | |
– | MMC-EDP | Entity Dialogue Processing | X | |
– | PAF-PSD | Personal Status Display | X | |
– | MMC-TTS | Text-to-Speech | X | |
– | PAF-IFD | Input Face Description | X | |
– | PAF-IBD | Input Body Description | X | |
– | PAF-PMX | Portable Avatar Multiplexing | X |
Videoconference Client Receiver
AIW and AIMs | Name and AIW/AIM Specification | JSON | |
PAF-CRX | Videoconference Client Receiver | X | |
– | PAF-PDX | Portable Avatar Demultiplexing | X |
– | PAF-VSC | Visual Scene Creation | X |
– | PAF-ASC | Audio Scene Creation | X |
– | PAF-AVR | Audio-Visual Scene Rendering | X |