<- Avatar-Based Videoconference– Go to ToC Data Types->
AI Modules (AIM) are interconnected processing units composing an AI Workflow (AW). There are two types of AIM:
- Composite AIMs are AI Modules composed of multiple AI Modules.
- Basic AIMs do not include (or do not expose) AIMs inside.
This Chapter specifies the Composite AIMs of Table 1 .
Table 1 – Composite AIMs of MPAI-PAF
| Acronym | AIM Name | Versions |
|
| PAF-PSD | Personal Status Display | V1.0 | |
This Chapter also specifies the Basic AIMs of Table 2.
Table 2 – Basic AIMs of MPAI PAF- Note that V1.1 is currently not approved.
| Acronym | AIM Name |
| PAF-AVR | Audio-Visual Scene Rendering |
| PAF-FIR | Face Identity Recognition |
| PAF-IBD | Input Body Description |
| PAF-IFD | Input Face Description |
| PAF-PFI | PS-Face Interpretation |
| PAF-PGI | PS-Gesture Interpretation |
| PAF-PMX | Portable Avatar Multiplexing |
| PAF-PSD | Personal Status Display |
This Chapter also provides the full set of AIWs, Composite AIMs, and AIMs used by the four Avatar-Based Videoconference subsystems.
Videoconference Client Transmitter
| AIW and AIMs | Name | JSON | ||
| PA-CTX | Videoconference Client Transmitter | X | ||
| – | OSD-AVS | Audio-Visual Scene Description | X | |
| – | CAE-ASD | Audio Scene Description | X | |
| – | CAE-AAT | Audio Analysis Transform | X | |
| – | CAE-ASL | Audio Source Localisation | X | |
| – | CAE-ASE | Audio Separation and Enhancement | X | |
| – | CAE-AST | Audio Synthesis Transform | X | |
| – | CAE-AMX | Audio Descriptor Multiplexing | X | |
| – | OSD-VSD | Visual Scene Description | X | |
| – | OSD-AVA | Audio-Visual Alignment | X | |
| – | MMC-ASR | Automatic Speech Recognition | X | |
| – | MMC-NLU | Natural Language Understanding | X | |
| – | MMC-PSE | Personal Status Extraction | X | |
| – | MMC-ITD | Input Text Description | X | |
| – | MMC-ISD | Input Speech Description | X | |
| – | PAF-IFD | Input Face Description | X | |
| – | PAF-IBD | Input Body Description | X | |
| – | MMC-PTI | PS-Text Interpretation | X | |
| – | MMC-PSI | PS-Speech Interpretation | X | |
| – | PAF-PFI | PS-Face Interpretation | X | |
| – | PAF-PGI | PS-Gesture Interpretation | X | |
| – | MMC-PMX | Personal Status Multiplexing | X | |
Avatar Videoconference Server
| AIW and AIMs | Name | JSON | |
| PAF-AVS | Avatar Videoconference Server | X | |
| – | PAF-PDX | Portable Avatar Demultiplexing | X |
| – | MMC-TST | Text and Speech Translation | X |
| – | PAF-SPA | Service Participant Authentication | X |
| – | PAF-PMX | Portable Avatar Multiplexing | X |
Virtual Meeting Secretary
| AIW and AIMs | Name and AIW/AIM Specification | JSON | ||
| MMC-VMS | Virtual Meeting Secretary | X | ||
| – | PAF-PDX | Portable Avatar Demultiplexing | X | |
| – | MMC-ASR | Automatic Speech Recognition | X | |
| – | MMC-NLU | Natural Language Understanding | X | |
| – | MMC-PSE | Personal Status Extraction | X | |
| – | MMC-ITD | Input Text Description | X | |
| – | MMC-ISD | Input Speech Description | X | |
| – | PAF-IFD | Input Face Description | X | |
| – | PAF-IBD | Input Body Description | X | |
| – | MMC-PTI | PS-Text Interpretation | X | |
| – | MMC-PSI | PS-Speech Interpretation | X | |
| – | PAF-PFI | PS-Face Interpretation | X | |
| – | PAF-PGI | PS-Gesture Interpretation | X | |
| – | MMC-PMX | Personal Status Multiplexing | X | |
| – | MMC-SCM | Summary Creation Module | X | |
| – | MMC-EDP | Entity Dialogue Processing | X | |
| – | PAF-PSD | Personal Status Display | X | |
| – | MMC-TTS | Text-to-Speech | X | |
| – | PAF-IFD | Input Face Description | X | |
| – | PAF-IBD | Input Body Description | X | |
| – | PAF-PMX | Portable Avatar Multiplexing | X | |
Videoconference Client Receiver
| AIW and AIMs | Name and AIW/AIM Specification | JSON | |
| PAF-CRX | Videoconference Client Receiver | X | |
| – | PAF-PDX | Portable Avatar Demultiplexing | X |
| – | PAF-VSC | Visual Scene Creation | X |
| – | PAF-ASC | Audio Scene Creation | X |
| – | PAF-AVR | Audio-Visual Scene Rendering | X |