🍊 Latent Atlas 🍉

❯

❯

Multimodal

2026年2月15日1分钟阅读

多模态架构模块负责整理视觉-语言模型和多模态对齐结构，包括 CLIP、LLaVA、Qwen-VL 等模型路线。

Notes

Vision-Language Model
CLIP
LLaVA
Qwen-VL
Multimodal Projector

此文件夹下有5条笔记。

2026年2月21日
LLaVA
- multimodal
- llava
2026年2月21日
Qwen-VL
- multimodal
- qwen-vl
2026年2月15日
CLIP
- multimodal
- clip
2026年2月15日
Multimodal Projector
- multimodal
- projector
2026年2月15日
Vision-Language Model
- multimodal
- vlm

🍊 Latent Atlas 🍉 · An AI knowledge atlas built with Quartz © 2026