Phase 066: Multimodal AI

Phase 066 of the AI Encyclopedia — Multimodal AI. Topics 1301–1320.

Part of the AI Encyclopedia · Phase 066 of 130 · Topics 1301–1320

This phase covers Multimodal AI. Below are the 20 concepts grouped under this phase — each is a future article in the Insightful AI World encyclopedia.

1301 Multimodal Learning
1302 Cross-modal Representation
1303 Image-text Alignment
1304 Audio-text Alignment
1305 Video-text Alignment
1306 Multimodal Embeddings
1307 Contrastive Multimodal Learning
1308 CLIP-style Models
1309 Vision-Language Models
1310 Audio-Language Models
1311 Video-Language Models
1312 Multimodal Transformers
1313 Multimodal Instruction Tuning
1314 Visual Question Answering
1315 Image-grounded Dialogue
1316 Multimodal Retrieval
1317 Multimodal Reasoning
1318 Multimodal Agents
1319 Multimodal Evaluation
1320 Multimodal Safety