🍊 Latent Atlas 🍉

标签: source

此标签下有66条笔记。

2026年6月02日
RepoCoder: Repository-Level Code Completion Through Iterative Retrieval and Generation
2026年6月02日
LongLLMLingua: Accelerating and Enhancing LLMs in Long Context Scenarios via Prompt Compression
2026年6月02日
Repoformer: Selective Retrieval for Repository-Level Code Completion
2026年6月02日
CodeRAG: Finding Relevant and Necessary Knowledge for Retrieval-Augmented Repository-Level Code Completion
2026年6月02日
CodePromptZip: Code-specific Prompt Compression for Retrieval-Augmented Generation in Coding Tasks with LMs
2026年6月02日
Squeez: Task-Conditioned Tool-Output Pruning for Coding Agents
2026年6月01日
Deep Residual Learning for Image Recognition
2026年6月01日
Layer Normalization
2026年6月01日
Attention Is All You Need
2026年6月01日
Outrageously Large Neural Networks
2026年6月01日
Fast Transformer Decoding
2026年6月01日
Root Mean Square Layer Normalization
2026年6月01日
Big Bird
2026年6月01日
GLU Variants Improve Transformer
2026年6月01日
GShard
2026年6月01日
Language Models are Few-Shot Learners
2026年6月01日
Longformer
2026年6月01日
ALiBi
2026年6月01日
RoFormer
2026年6月01日
Switch Transformer
2026年6月01日
Grouped-Query Attention
2026年6月01日
DeepSeek-V2
2026年6月01日
DeepSeek-V3
2026年6月01日
DeepSeekMoE
2026年5月31日
Neural Machine Translation of Rare Words with Subword Units
2026年5月31日
Training Deep Nets with Sublinear Memory Cost
2026年5月31日
GPipe
2026年5月31日
SentencePiece
2026年5月31日
Megatron-LM
2026年5月31日
T5
2026年5月31日
ZeRO
2026年5月31日
Scaling Laws for Neural Language Models
2026年5月31日
Deduplicating Training Data Makes Language Models Better
2026年5月31日
Documenting Large Webtext Corpora
2026年5月31日
LoRA
2026年5月31日
The Pile
2026年5月31日
ZeRO-Infinity
2026年5月31日
Training Compute-Optimal Large Language Models
2026年5月31日
A Pretrainer's Guide to Training Data
2026年5月31日
LongLoRA
2026年5月31日
Position Interpolation
2026年5月31日
QLoRA
2026年5月31日
RefinedWeb
2026年5月31日
ROOTS
2026年5月31日
YaRN
2026年5月31日
DataComp-LM
2026年5月31日
Dolma
2026年5月31日
FineWeb
2026年5月31日
LongRoPE
2026年5月29日
Sequence-Level Knowledge Distillation
2026年5月29日
Deep Reinforcement Learning from Human Preferences
2026年5月29日
Proximal Policy Optimization Algorithms
2026年5月29日
Learning to summarize from human feedback
2026年5月29日
Finetuned Language Models Are Zero-Shot Learners
2026年5月29日
Multitask Prompted Training Enables Zero-Shot Task Generalization
2026年5月29日
Constitutional AI
2026年5月29日
Training language models to follow instructions with human feedback
2026年5月29日
Self-Instruct
2026年5月29日
Distilling Step-by-Step
2026年5月29日
Direct Preference Optimization
2026年5月29日
DeepSeekMath
2026年5月29日
SWE-Pruner: Self-Adaptive Context Pruning for Coding Agents
2026年5月28日
Meta Llama 4 Multimodal Intelligence
2026年5月28日
LongCodeZip: Compress Long Context for Code Language Models
2026年5月28日
RLP: Reinforcement as a Pretraining Objective
2026年5月28日
DeepSeek V4 Technical Documentation

🍊 Latent Atlas 🍉 · An AI knowledge atlas built with Quartz © 2026