Multimodal Question Answering (MMC-MQA)

V2.1

Multimodal Question Answering (MMC-MQA):

Receives
1. Selector – communicating use of Text or Speech.
2. Input Text – replacing Speech, where appropriate.
3. Input Visual – the object for which a question is asked
4. Input Speech – the question asked.
Produces Text or Speech conveying the answer.

Figure 1 depicts the Reference Module of the Multimodal Question Answering AIW.

Figure 1 – The Multimodal Question Answering AIW

Table 1 specifies the Input and Output Data of the Multimodal Question Answering AIW.

Table 1 – I/O Data of Multimodal Question Answering

Input	Descriptions
Input Text	Text typed by the human as a replacement for Input Speech.
Input Selector	Data determining the use of Speech or Text.
Input Visual	Video of the human showing an object held in hand.
Input Speech	Speech of the human asking a question the Machine.
Output	Descriptions
Machine Text	The Text generated by Machine in response to human input.
Machine Speech	The Speech generated by Machine in response to human input.

Cookie	Duration	Description
cookielawinfo-checkbox-necessary	1 year	This cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Technical".
CookieLawInfoConsent	1 year	The cookie is set by the GDPR Cookie Consent plug-in and is used to store whether the user has consented to the use of cookies or not. It does not store any personal data.
viewed_cookie_policy	1 year	The cookie is set by the GDPR Cookie Consent plugin and is used to store whether or not user has consented to the use of cookies. It does not store any personal data.
_pk_id.6.08a8	13 months	Used to store a few details about the user such as the unique visitor ID
_pk_ses.6.08a8	30 minutes	Short lived cookies used to temporarily store data for the visit