🍊 Latent Atlas 🍉

标签: chain-of-thought

此标签下有1条笔记。

  • 2026年5月28日

    RLP: Reinforcement as a Pretraining Objective

    • source
    • paper
    • pretraining
    • reinforcement-learning
    • reasoning
    • chain-of-thought

🍊 Latent Atlas 🍉 · An AI knowledge atlas built with Quartz © 2026