🍊 Latent Atlas 🍉

compact Context Pipeline

Thu, 04 Jun 2026 00:00:00 GMT

这篇笔记整理 Claude Code /compact 的源码阅读结果。它关注的不是“怎么使用 /compact”，而是产品级 coding agent 如何在上下文接近饱和时，用一组预算、缓存、历史折叠、状态恢复和 LLM summary 机制维持会话可继续性。输入与来源参考入口：小红书 Claude Code compact 相关笔记。本地读码材料：agent-compression-related 中的 Claude Code compact 读码内容。相关主题页：Agent Compression。来源边界本文的机制描述以本地源码阅读材料为准，外部链接作为选题参考入口。核...

Claude Code

Thu, 04 Jun 2026 00:00:00 GMT

Claude Code 目录收集 Claude Code 的产品机制、源码阅读和工程行为分析，重点放在 code agent 的上下文、工具、记忆和执行链路。 Notes compact Context Pipeline .

Agent Compression

Tue, 02 Jun 2026 00:00:00 GMT

Agent Compression 指在 agentic system 中，对上下文、工具输出、检索结果、会话历史和中间状态进行选择、裁剪、摘要或结构化保留的机制。它的目标不是简单减少 token，而是在有限 context window、有限预算和长程任务中，让 agent 持续看到对下一步行动真正有用的信息。在 code agent 场景中，这个问题尤其突出。Agent 会反复读取文件、搜索仓库、运行测试、查看日志、修改代码，再把所有 observation 写回对话历史。如果不加控制，history 很快会被大段源码、grep 输出、traceback、diff 和重复日志填满。上下文...

RepoCoder: Repository-Level Code Completion Through Iterative Retrieval and Generation

Tue, 02 Jun 2026 00:00:00 GMT

RepoCoder RepoCoder 这篇论文的核心想法是：repository-level code completion 不能只看当前文件，也不能只做一次普通 RAG；更有效的做法是先让模型生成一个草稿 completion，再用这个草稿反过来改进检索 query，最后重新生成。如果只记一句话：RepoCoder 是一种 iterative retrieval-generation 框架。它把代码模型第一次生成出来的“未必正确但长得像目标代码”的预测，当作下一轮检索的 query expansion，用来从仓库里找更接近目标 completion 的代码片段。这篇论文对 coding...

LongLLMLingua: Accelerating and Enhancing LLMs in Long Context Scenarios via Prompt Compression

Tue, 02 Jun 2026 00:00:00 GMT

LongLLMLingua LongLLMLingua 的 motivation 是：长上下文不只是贵，还会因为噪声和位置偏置让模型表现变差；所以 prompt compression 的目标不应只是少 token，而是让 key information density 更高，并把关键信息放到模型更容易利用的位置。如果只记一句话：LongLLMLingua 是一个面向长上下文的 question-aware coarse-to-fine prompt compression 方法。它用一个小语言模型先判断哪些文档/段落更可能回答当前 question，再在这些内容内部做 token-leve...

Repoformer: Selective Retrieval for Repository-Level Code Completion

Tue, 02 Jun 2026 00:00:00 GMT

Repoformer Repoformer 这篇论文的 motivation 是：不是每次 repository-level code completion 都该检索，所以要让模型自己判断要不要检索。如果只记一句话：Repoformer 是一种 self-selective RAG 框架。它把“是否需要跨文件上下文”做成代码模型自己的一个预测行为：模型看到当前文件的 left/right context 后，先预测一个特殊 token ；如果概率足够高，就触发 cross-file retrieval，否则直接做 fill-in-the-middl...

CodeRAG: Finding Relevant and Necessary Knowledge for Retrieval-Augmented Repository-Level Code Completion

Tue, 02 Jun 2026 00:00:00 GMT

CodeRAG CodeRAG 这篇论文的 motivation 很直观：repository-level code completion 里的 RAG 不能只问“哪些代码相关”，还要问“哪些代码对当前 code LLM 真正好用”。它把问题拆成三问：怎么去问？当前 unfinished code 不一定是好 query，最后几行也不一定包含真正线索。怎么去搜？单一路径 retrieval 只能覆盖一种相关性，代码任务同时需要 lexical、semantic 和 dependency 线索。相关是否等于好用？ retriever 觉得相关的代码，不一定能帮助最终 code LLM ...

CodePromptZip: Code-specific Prompt Compression for Retrieval-Augmented Generation in Coding Tasks with LMs

Tue, 02 Jun 2026 00:00:00 GMT

CodePromptZip CodePromptZip 这篇论文的核心想法是：RAG-based coding task 里的 retrieved code examples 很长，但不是所有 code token 都同等重要；与其用通用 prompt compression 方法压自然语言 prompt，不如训练一个 code-specific compressor，专门压缩检索回来的代码示例。如果只记一句话：CodePromptZip 是一种 type-aware + task-aware + copy-enhanced 的代码示例压缩框架。它先用程序分析工具把代码 token 分成不同...

Squeez: Task-Conditioned Tool-Output Pruning for Coding Agents

Tue, 02 Jun 2026 00:00:00 GMT

Squeez 论文定位 Squeez 这篇论文讨论的是 coding agent 中一个很具体的压缩问题：当 agent 调用工具后拿到一段很长的 tool observation，如何根据当前 query 抽出下一步最值得看的最小证据块。我理解这篇论文的主要贡献是：它把 coding agent 的复杂多轮上下文压缩问题，简化成一个单轮、可监督、可评测的 evidence extraction 任务。基本信息字段内容来源arXiv:2604.04979v1作者/机构Adam Kovacs, KR Labs日期2026-04-04链接/代码arxiv.org/abs/2604.04979...

Deep Residual Learning for Image Recognition

Mon, 01 Jun 2026 00:00:00 GMT

基本信息 Title: Deep Residual Learning for Image Recognition Source type: paper Related topic notes: Residual in Transformer, Residual Connection TODO 阅读论文原文，整理 residual connection 如何缓解深层网络优化困难。回填 identity shortcut、residual mapping 和深层可训练性的基本思想。对照 Transformer 中 Pre-Norm residual path，区分视觉 ResNet 结构与 T...