1     Functions of Personal Status Display

2     Reference Model of Personal Status Display

3     I/O Data of Personal Status Display

4     Functions of AI Modules of Personal Status Display

5     I/O Data of AI Modules of Personal Status Display

6     AIMs and JSOM Metadata of of Personal Status Display

1      Functions of Personal Status Display

Personal Status Display (PSD) is a Composite AIM receiving Text, Personal Status, Speech Model and Avatar Model and generating the Personal Avatar with Face and Body Descriptors of an Avatar producing Text and uttering Speech with the intended Personal Status while the avatar’s Face and Gesture show the intended Personal Status. The Personal Status driving the avatar can be extracted from a human or can be synthetically generated by a machine as a result of its conversation with a human or another avatar. Several Use Cases from MPAI-MMC and other MPAI standards use this Composite AIM as a replacement for the combination of the AIMs depicted in Figure 1.

2      Reference Model of Personal Status Display

Figure 1 depicts the AIMs required to implement Personal Status Display.

Figure 1 – Reference Model of Personal Status Display

The Personal Status Display operates as follows:

  1. Avatar ID is the ID of the Portable Avatar.
  2. Machine Text is synthesised as Speech using the Personal Status provided by PS-Speech using a Speech Model.
  3. Machine Speech and PS-Face are used to produce the Machine Face Descriptors.
  4. PS-Gesture and Text are used for Machine Body Descriptors using the Avatar Model.
  5. Portable Avatar Multiplexing produces the Portable Avatar.

3      I/O Data of Personal Status Display of Personal Status Display

Table 1 gives the Input/Output Data of Personal Status Display.

Table 1 – I/O Data of Personal Status Display

Input data From Description
Avatar ID Upstream AIM Portable Avatar’s ID
Avatar Model From upstream AIM or embedded Part of Portable Avatar
Text Object Keyboard or upstream AIM Texts accompanying Portable Avatar
NN Format From upstream AIM 0=NNEF
1=ONNX
Speech Model From upstream AIM Neural Network Speech Model
PS-Speech Personal Status Extraction or Machine To synthesise Speech
PS-Face Personal Status Extractor or Machine To describe Face
PS-Gesture Personal Status Extractor or Machine To describe Body
Output data To Description
Portable Avatar Downstream AIM or renderer As Portable Avatar

4      Functions of AI Modules of Personal Status Display

Table 2 gives the functions of the AIMs.

Table 2 – Functions of AI Modules of Personal Status Extraction

AIM Functions
Text To Speech Synthesises Text with Personal Status.
Input Face Description Produces the Machine Face Descriptors with Personal Status.
Input Body Description Produces the Machine Body Descriptors with Personal Status.
Portable Avatar Multiplexing Multiplexes Data into Portable Avatar with Personal Status.

5      I/O Data of AI Modules of Personal Status Display

Table 3 gives the list of AIMs with their functions.

Table 3 – AI Modules of Personal Status Extraction

AIM Receives Produces
Text To Speech Text
NNFormat
Speech Model
PS-Speech
Machine Speech
Input Face Description Avatar Model
Machine Speech and PS-Face
Machine Face Descriptors
Gesture Description Avatar Model
Text and Machine PS-Gesture
Machine Body Descriptors
Portable Avatar Multiplexing Avatar ID
Machine Text
Machine Speech
Machine Body Descriptors
Machine Face Descriptors
Machine Avatar
Avatar Model
NNFormat
Speech Model
Portable Avatar

6      Specification and JSON Metadata of Composite AIM and its AIMs

The AIMs in italic are not fimal.

Table 6 – Acronyms and URs of JSON Metadata

AIMs Name and Specification JSON
PAF-PSD Personal Status Display X
MMC-TTS Text-to-Speech X
PAF-IFD Input Face Description X
PAF-IBD Input Body Description X
PAF-PMX Portable Avatar Multiplexing X