05 IBM Build Multimodal Generative AI Applications
Private Course
Please sign in to contact responsible
| Responsible | Administrator |
|---|---|
| Last Update | 06/28/2026 |
| Completion Time | 1 hour 2 minutes |
| Members | 1 |
Advanced (RAG and Agentic AI)
-
Module 0112Lessons · 28 min
-
01 Course Introduction.mp4New
-
02 course overview.pdfNew
-
03 helpful tips.pngNew
-
04 Introduction to Multimodal AI.mp4New
-
05 What Is Multimodal Generative AI.pdfNew
-
06 What is Computer Vision.pdfNew
-
07 Text-to-Speech Technologies.mp4New
-
08 Speech-to-Text Technologies.mp4New
-
09 ext Processing, Speech Processing.pdfNew
-
10 Challenges in Multimodal AI Integration.pdfNew
-
11 cheatsheet.pdfNew
-
11 SummaryNew
-
-
Module 025Lessons · 15 min
-
01 Understanding Image Captioning with Meta's Llama.mp4New
-
02 Text-to-Video and Image-to-Video.pdfNew
-
03 Text-to-Video Generation with OpenAI's Sora.mp4New
-
04 Applications of Multimodal Vision Models in Real World.pdfNew
-
05 cheatsheet.pdfNew
-
-
Module 035Lessons · 19 min
-
01 Introduction to Multimodal Retrieval-Augmented Generation (MM-RAG).mp4New
-
02 Multimodal Chatbots and QA Systems.mp4New
-
03 cheatsheet.pdfNew
-
04 Course Wrap-up.mp4New
-
05 congratulation.pdfNew
-