AWS Certified AI Practitioner (AIF-C01) — Question 198

An education company waftion. The application will give users the ability to enter text or provide a picture of a question. The application will respond with a written answer and an explanation of the written answer.

Which model type meets these requirements?

Answer options

Correct answer: B

Explanation

The correct answer is B, the Large multi-modal language model, as it can process both text and images to generate comprehensive responses. Option A, the Computer vision model, focuses solely on image processing and does not handle text input well. Option C, the Diffusion model, is not relevant to question answering, and Option D, the Text-to-speech model, is designed for converting text to spoken words rather than generating answers.