🍊 Latent Atlas 🍉

标签: post-training

此标签下有15条笔记。

2026年3月01日
Post-training
- training
- post-training
2026年3月08日
Knowledge Distillation
- post-training
- distillation
2026年3月08日
Logits Distillation
2026年3月08日
Offline KD
2026年3月08日
On-policy KD
2026年3月08日
Sequence-level Distillation
2026年3月07日
DPO
2026年3月07日
GRPO
2026年3月07日
PPO
2026年3月07日
Rejection Sampling
2026年3月07日
RLHF
2026年3月01日
Chat Template
- post-training
- chat-template
2026年3月01日
Instruction Tuning
- post-training
- instruction-tuning
2026年3月01日
Reward Model
2026年3月01日
SFT

🍊 Latent Atlas 🍉 · An AI knowledge atlas built with Quartz © 2026