MPAI-PAF V1.3 Introduction

(Informative)

There is a long history of computer-created objects called “digital humans”, i.e., digital objects having a human appearance when rendered. In most cases the underlying assumption of these objects has been that creation, animation, and rendering is done in a closed environment. Such digital humans had little or no need for standards.

In a communication and more so in a metaverse context, there are many cases where a digital human is not constrained within a closed environment thus requiring forms of standardisation. Technical Specification: Portable Avatar Format (MPAI-PAF) V1.4 – in the following also called MPAI-PAF V1.4 or MPAI-PAF – is a response to the requirements of new usage contexts. MPAI-PAF specifies a standard for Portable Avatar Format (PAF) enabling a receiving party to render a digital human as intended by the sending party.

MPAI-PAF V1.4 specifies the Avatar-Based Videoconference (PAF-ABV) AI Workflow where:

Client Transmitters send PAFs containing:
- At the beginning: Avatar Models, Language Selector, and Speech Object and Face Object for participant authentication.
- Continuously: Avatar Descriptors, and Speech Objects to a Server.

Avatar Videoconference Server:
- At the beginning:
  - Selects an Environment, i.e., a meeting room and equips it with objects, i.e., meeting table and chairs.
  - Places Avatar Models around the table.
  - Distributes for each participant a PAF containing Environment, Avatar Models, and their positions to all receiving clients.
- Continuously sends to receiving clients:
  - Translated Speech from participants according to Language Selectors.
  - Sends PAFs containing Avatar Descriptors and translated Speech.

Client Receivers:
- At the beginning: receive Environment and PAFs containing Avatar Models and Language Selectors from the server.
- Continuously from the server:
  - Receive PAFs containing Avatar Descriptors and translated Speech.
  - Create Audio and Visual Scene Descriptors.
  - Render the Audio-Visual Scene as seen from the human participant-selected Point of View.

In all Chapters and Sections, Terms beginning with a capital letter are defined in Table 1 if they are specific to this Technical Specification. All MPAI-defined Terms are accessible online. All Chapters, and Sections are Normative unless they are labelled as Informative.

<-Foreword Go to ToC Scope->

Cookie	Duration	Description
cookielawinfo-checkbox-necessary	1 year	This cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Technical".
CookieLawInfoConsent	1 year	The cookie is set by the GDPR Cookie Consent plug-in and is used to store whether the user has consented to the use of cookies or not. It does not store any personal data.
viewed_cookie_policy	1 year	The cookie is set by the GDPR Cookie Consent plugin and is used to store whether or not user has consented to the use of cookies. It does not store any personal data.
_pk_id.6.08a8	13 months	Used to store a few details about the user such as the unique visitor ID
_pk_ses.6.08a8	30 minutes	Short lived cookies used to temporarily store data for the visit

MPAI-PAF V1.3 Introduction

(Informative)

Notice