🍊 Latent Atlas 🍉

❯

❯

Papers

2026年5月28日2分钟阅读

论文阅读分析，记录论文的研究问题、核心方法、实验依据、局限和可以回填到主题笔记的稳定知识。

Notes

Deep Residual Learning for Image Recognition
Sequence-Level Knowledge Distillation
Neural Machine Translation of Rare Words with Subword Units
Training Deep Nets with Sublinear Memory Cost
Layer Normalization
Deep Reinforcement Learning from Human Preferences
Proximal Policy Optimization Algorithms
Attention Is All You Need
Outrageously Large Neural Networks
GPipe
SentencePiece
Megatron-LM
Fast Transformer Decoding
Root Mean Square Layer Normalization
T5
ZeRO
Big Bird
GShard
GLU Variants Improve Transformer
Language Models are Few-Shot Learners
Longformer
Learning to summarize from human feedback
Scaling Laws for Neural Language Models
ALiBi
Deduplicating Training Data Makes Language Models Better
Documenting Large Webtext Corpora
The Pile
LoRA
RoFormer
Switch Transformer
ZeRO-Infinity
Finetuned Language Models Are Zero-Shot Learners
Multitask Prompted Training Enables Zero-Shot Task Generalization
Constitutional AI
Training language models to follow instructions with human feedback
Self-Instruct
Training Compute-Optimal Large Language Models
A Pretrainer’s Guide to Training Data
Distilling Step-by-Step
Direct Preference Optimization
Grouped-Query Attention
RepoCoder: Repository-Level Code Completion Through Iterative Retrieval and Generation
LongLoRA
Position Interpolation
QLoRA
RefinedWeb
ROOTS
YaRN
DataComp-LM
DeepSeekMath
DeepSeek-V2
DeepSeek-V3
DeepSeekMoE
Dolma
FineWeb
LongLLMLingua: Accelerating and Enhancing LLMs in Long Context Scenarios via Prompt Compression
LongRoPE
Repoformer: Selective Retrieval for Repository-Level Code Completion
CodeRAG: Finding Relevant and Necessary Knowledge for Retrieval-Augmented Repository-Level Code Completion
LongCodeZip: Compress Long Context for Code Language Models
SWE-Pruner: Self-Adaptive Context Pruning for Coding Agents
Squeez: Task-Conditioned Tool-Output Pruning for Coding Agents
CodePromptZip: Code-specific Prompt Compression for Retrieval-Augmented Generation in Coding Tasks with LMs
RLP: Reinforcement as a Pretraining Objective

此文件夹下有64条笔记。

2026年6月02日
RepoCoder: Repository-Level Code Completion Through Iterative Retrieval and Generation
2026年6月02日
LongLLMLingua: Accelerating and Enhancing LLMs in Long Context Scenarios via Prompt Compression
2026年6月02日
Repoformer: Selective Retrieval for Repository-Level Code Completion
2026年6月02日
CodeRAG: Finding Relevant and Necessary Knowledge for Retrieval-Augmented Repository-Level Code Completion
2026年6月02日
CodePromptZip: Code-specific Prompt Compression for Retrieval-Augmented Generation in Coding Tasks with LMs
2026年6月02日
Squeez: Task-Conditioned Tool-Output Pruning for Coding Agents
2026年6月01日
Deep Residual Learning for Image Recognition
2026年6月01日
Layer Normalization
2026年6月01日
Attention Is All You Need
2026年6月01日
Outrageously Large Neural Networks
2026年6月01日
Fast Transformer Decoding
2026年6月01日
Root Mean Square Layer Normalization
2026年6月01日
Big Bird
2026年6月01日
GLU Variants Improve Transformer
2026年6月01日
GShard
2026年6月01日
Language Models are Few-Shot Learners
2026年6月01日
Longformer
2026年6月01日
ALiBi
2026年6月01日
RoFormer
2026年6月01日
Switch Transformer
2026年6月01日
Grouped-Query Attention
2026年6月01日
DeepSeek-V2
2026年6月01日
DeepSeek-V3
2026年6月01日
DeepSeekMoE
2026年5月31日
Neural Machine Translation of Rare Words with Subword Units
2026年5月31日
Training Deep Nets with Sublinear Memory Cost
2026年5月31日
GPipe
2026年5月31日
SentencePiece
2026年5月31日
Megatron-LM
2026年5月31日
T5
2026年5月31日
ZeRO
2026年5月31日
Scaling Laws for Neural Language Models
2026年5月31日
Deduplicating Training Data Makes Language Models Better
2026年5月31日
Documenting Large Webtext Corpora
2026年5月31日
LoRA
2026年5月31日
The Pile
2026年5月31日
ZeRO-Infinity
2026年5月31日
Training Compute-Optimal Large Language Models
2026年5月31日
A Pretrainer's Guide to Training Data
2026年5月31日
LongLoRA
2026年5月31日
Position Interpolation
2026年5月31日
QLoRA
2026年5月31日
RefinedWeb
2026年5月31日
ROOTS
2026年5月31日
YaRN
2026年5月31日
DataComp-LM
2026年5月31日
Dolma
2026年5月31日
FineWeb
2026年5月31日
LongRoPE
2026年5月29日
Sequence-Level Knowledge Distillation
2026年5月29日
Deep Reinforcement Learning from Human Preferences
2026年5月29日
Proximal Policy Optimization Algorithms
2026年5月29日
Learning to summarize from human feedback
2026年5月29日
Finetuned Language Models Are Zero-Shot Learners
2026年5月29日
Multitask Prompted Training Enables Zero-Shot Task Generalization
2026年5月29日
Constitutional AI
2026年5月29日
Training language models to follow instructions with human feedback
2026年5月29日
Self-Instruct
2026年5月29日
Distilling Step-by-Step
2026年5月29日
Direct Preference Optimization
2026年5月29日
DeepSeekMath
2026年5月29日
SWE-Pruner: Self-Adaptive Context Pruning for Coding Agents
2026年5月28日
LongCodeZip: Compress Long Context for Code Language Models
2026年5月28日
RLP: Reinforcement as a Pretraining Objective

🍊 Latent Atlas 🍉 · An AI knowledge atlas built with Quartz © 2026