Repository: pat-jj/Awesome-Adaptation-of-Agentic-AI Branch: main Commit: 8c37381e71e1 Files: 3 Total size: 39.1 KB Directory structure: gitextract_2we7iq65/ ├── .gitignore ├── LICENSE └── README.md ================================================ FILE CONTENTS ================================================ ================================================ FILE: .gitignore ================================================ .history ================================================ FILE: LICENSE ================================================ © 2025 The Authors. All rights reserved. This repository contains original research work by the listed authors. Unauthorized submission of this work (in whole or in part) to academic journals or conferences is strictly prohibited and will be considered academic misconduct. Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International Public License By exercising the Licensed Rights (defined below), You accept and agree to be bound by the terms and conditions of this Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International Public License ("Public License"). You are granted the rights to: 1. Share — copy and redistribute the material in any medium or format for noncommercial purposes only, and only if unchanged and used in whole. Under the following terms: 2. Attribution — You must give appropriate credit, provide a link to the license, and indicate if changes were made. You may do so in any reasonable manner, but not in any way that suggests the licensor endorses you or your use. 3. NonCommercial — You may not use the material for commercial purposes. 4. NoDerivatives — If you remix, transform, or build upon the material, you may not distribute the modified material. 5. No additional restrictions — You may not apply legal terms or technological measures that legally restrict others from doing anything the license permits. Notices: You do not have to comply with the license for elements of the material in the public domain or where your use is permitted by an applicable exception or limitation. No warranties are given. The license may not give you all of the permissions necessary for your intended use. Other rights such as publicity, privacy, or moral rights may limit how you use the material. Full license text: https://creativecommons.org/licenses/by-nc-nd/4.0/legalcode ================================================ FILE: README.md ================================================ # Awesome Adaptation of Agentic AI [![Awesome](https://cdn.rawgit.com/sindresorhus/awesome/d7305f38d29fed78fa85652e3a63e154dd8e8829/media/badge.svg)](https://github.com/sindresorhus/awesome) [![Stars](https://img.shields.io/github/stars/pat-jj/Awesome-Adaptation-of-Agentic-AI?style=social)](https://img.shields.io/github/stars/pat-jj/Awesome-Adaptation-of-Agentic-AI?style=social) [![License: CC BY-NC-ND 4.0](https://img.shields.io/badge/License-CC--BY--NC--ND%204.0-blue.svg)](https://creativecommons.org/licenses/by-nc-nd/4.0/) [![PRWelcome](https://img.shields.io/badge/PRs-Welcome-red)](https://img.shields.io/badge/PRs-Welcome-red) [![arXiv](https://img.shields.io/badge/arXiv-2512.16301-b31b1b.svg)](https://arxiv.org/abs/2512.16301)

A curated list of papers on adaptation strategies of agentic AI systems. This repository accompanies the paper "Adaptation of Agentic AI" (Ongoing Work). **Cite this paper:** ``` @article{jiang2025adaptation, title={Adaptation of Agentic AI}, author={Jiang, Pengcheng and Lin, Jiacheng and Shi, Zhiyi and Wang, Zifeng and He, Luxi and Wu, Yichen and Zhong, Ming and Song, Peiyang and Zhang, Qizheng and Wang, Heng and others}, journal={arXiv preprint arXiv:2512.16301}, year={2025} } ``` ## Table of Contents - [Agent Adaptation](#agent-adaptation) - [A1: Tool Execution Signaled](#a1-tool-execution-signaled) - [A2: Agent Output Signaled](#a2-agent-output-signaled) - [Tool Adaptation](#tool-adaptation) - [T1: Agent-Agnostic Tool Adaptation](#t1-agent-agnostic-tool-adaptation) - [T2: Agent-Supervised Tool Adaptation](#t2-agent-supervised-tool-adaptation) --- ## Agent Adaptation ### A1: Tool Execution Signaled Agent Adaptation

Development Timeline:

#### RL-based Methods | Time | Method | Venue | Task(s) | Tool(s) | Agent Backbone | Tuning | |------|--------|-------|---------|---------|----------------|--------| | 2025.11 | Orion | arXiv

[Paper](https://arxiv.org/abs/2510.19817)

[Paper](https://arxiv.org/abs/2509.06493)

[Paper](https://arxiv.org/abs/2509.22644)

[Paper](https://arxiv.org/abs/2509.12867)

[Paper](https://arxiv.org/abs/2508.08791)

[Paper](https://arxiv.org/abs/2508.03613)

[Paper](https://arxiv.org/abs/2507.08649)

[Paper](https://arxiv.org/abs/2506.09033)

[Paper](https://arxiv.org/abs/2505.21668)

[Paper](https://arxiv.org/abs/2505.00024)

[Paper](https://arxiv.org/abs/2504.21801)

[Paper](https://arxiv.org/abs/2504.11354)

[Paper](https://arxiv.org/abs/2504.08600)

[Paper](https://openreview.net/forum?id=YBRU9MV2vE)

[Paper](https://arxiv.org/abs/2504.11001)

[Code](https://github.com/janhq/ReZero) | Web Search, IR | Web Search Engine | LLaMA3.2 | GRPO | | 2025.03 | Code-R1 | ---

[Paper](https://arxiv.org/abs/2503.00223)

[Paper](https://arxiv.org/abs/2408.08152)

[Paper](https://arxiv.org/abs/2405.18649) | Coding | Code Executor | StarCoder & CodeLlaMA | SFT, PPO | #### SFT & DPO Methods | Time | Method | Venue | Task(s) | Tool(s) | Agent Backbone | Tuning | |------|--------|-------|---------|---------|----------------|--------| | 2024.12 | AWL | ICML'25

[Paper](https://arxiv.org/abs/2411.00412)

[Code]([YOUR_GITHUB_LINK](https://github.com/Rose-STL-Lab/Adapting-While-Learning)) | Scientific Reasoning,
Adaptive Tool Usage | Scientific Simulators | Llama-3.1-8B,
Qwen-2.5-{14/32}B | SFT, DPO | | 2024.10 | LeReT | ICLR'25

[Paper](https://arxiv.org/abs/2410.23214)

[Paper](https://arxiv.org/abs/2405.16533)

[Paper](https://arxiv.org/abs/2402.11827)

[Paper](https://arxiv.org/abs/2402.01030)

[Paper](https://arxiv.org/abs/2307.16789)

[Paper](https://arxiv.org/abs/2306.05301)

[Paper](https://arxiv.org/abs/2305.15334)

[Paper](https://arxiv.org/abs/2305.13068)

[Paper](https://arxiv.org/abs/2302.04761)

[Code](https://github.com/conceptofmind/toolformer) | QA, Math | Calculator, QA system, Search Engine, Translation System, Calendar | GPT-J | SFT | ---

### A2: Agent Output Signaled Agent Adaptation

Development Timeline:

#### Methods with Tools | Time | Method | Venue | Task(s) | Tool(s) | Agent Backbone | Tuning | |------|--------|-------|---------|---------|----------------|--------| | 2025.10 | TT-SI | arXiv

[Paper](https://arxiv.org/abs/2510.12838)

[Paper](https://arxiv.org/abs/2509.01055)

[Paper](https://arxiv.org/abs/2508.14880)

[Paper](https://arxiv.org/abs/2508.03680)

[Paper](https://arxiv.org/abs/2507.17365)

[Paper](https://arxiv.org/abs/2506.20670)

[Paper](https://arxiv.org/abs/2505.15107)

[Paper](https://arxiv.org/abs/2505.04588)

[Paper](https://arxiv.org/abs/2505.11277)

[Paper](https://arxiv.org/abs/2504.11536)

[Paper](https://arxiv.org/abs/2504.13958)

[Paper](https://arxiv.org/abs/2504.03160)

[Paper](https://arxiv.org/abs/2503.19470)

[Paper](https://arxiv.org/abs/2503.09516)

[Paper](https://arxiv.org/abs/2503.05592)

[Paper](https://arxiv.org/abs/2502.10996)

[Paper](https://arxiv.org/abs/2501.11425)

[Paper](https://arxiv.org/abs/2406.01495)

[Paper](https://arxiv.org/abs/2406.14979)

[Paper](https://arxiv.org/abs/2310.11511)

[Paper](https://arxiv.org/abs/2310.05915)

[Code](https://fireact-agent.github.io) | QA | Search API | GPT3.5, LLaMA2, CodeLLaMA | SFT | #### Methods without Tools | Time | Method | Venue | Task(s) | Tool(s) | Agent Backbone | Tuning | |------|--------|-------|---------|---------|----------------|--------| | 2025.10 | Empower | arXiv

[Paper](https://arxiv.org/abs/2510.13709)

[Code](https://github.com/festusev/codegen_empowerment/tree/main) | Coding | --- | Gemma3 | SFT | | 2025.10 | KnowRL | arXiv

[Paper](https://arxiv.org/abs/2510.11407)

[Code](https://anonymous.4open.science/r/KnowRL-5BF0) | Knowledge calibration | --- | LLaMA3.1, Qwen2.5 | REINFORCE++ | | 2025.10 | GRACE | arXiv

[Paper](https://arxiv.org/abs/2510.04506)

[Code](https://github.com/GasolSun36/GRACE) | Embedding Tasks | --- | Qwen2.5, Qwen3, LLaMA3.2 | GRPO | | 2025.06 | Magistral | arXiv

[Paper](https://arxiv.org/abs/2506.10910) | Math, Coding | --- | Magistral | PPO, GRPO | | 2025.05 | EHRMind | arXiv

[Paper](https://arxiv.org/abs/2505.24105)

[Code](https://github.com/linjc16/EHRMind) | EHR-based Reasoning | --- | LLaMA3 | SFT, GRPO | | 2025.01 | Kimi k1.5 | arXiv

[Paper](https://arxiv.org/abs/2501.12948)

[Code](https://github.com/MoonshotAI/Kimi-k1.5) | Math, Coding | --- | Kimi k1.5 | GRPO | | 2025.01 | DeepSeek-R1-Zero (Math) | Nature

[Paper](https://arxiv.org/abs/2501.12948) | Math | --- | DeepSeek-V3 | GRPO | | 2024.09 | SCoRe | ICLR'25

[Paper](https://arxiv.org/abs/2409.12917)

[Code](https://github.com/BY571/SCoRe) | Math, Coding, QA | --- | Gemini1.0 Pro, Gemini1.5 Flash | REINFORCE | | 2024.07 | RISE | NeurIPS'24

[Paper](https://arxiv.org/abs/2407.18219)

[Code](https://github.com/cmu-mind/RISE) | Math | --- | LLaMA2, LLaMA3, Mistral | SFT | | 2024.06 | TextGrad | Nature

[Paper](https://arxiv.org/abs/2406.07496)

[Code](https://github.com/zou-group/textgrad) | Various Tasks | --- | GPT3.5, GPT4o | Prompt Tuning | | 2023.03 | Self-Refine | NeurIPS'23

[Paper](https://arxiv.org/abs/2303.17651)

[Code](https://github.com/madaan/self-refine) | Dialogue, Math, Coding | --- | GPT3.5, GPT4, CODEX | Test-Time Prompting | --- ## Tool Adaptation ### T1: Agent-Agnostic Tool Adaptation

#### Foundational Systems and Architectures | Year.Month | Method Name | Venue | Paper Name | |:-----------:|:-----------:|:-----------|:-----------| | 2021.08 | Neural Operators | JMLR'23

[Paper](https://arxiv.org/abs/2303.17580)

[Paper](https://arxiv.org/abs/2303.08128)

[Paper](https://arxiv.org/abs/2507.20280) | SciToolAgent: A Knowledge-Graph-Driven Scientific Agent for Multitool Integration | #### Categories and Training Methods | Year.Month | Method Name | Venue | Paper Name | |:-----------:|:-----------:|:-----------|:-----------| | 2021.01 | CLIP | ICML'21

[Paper](https://arxiv.org/abs/2103.00020)

[Paper](https://arxiv.org/abs/2304.02643)

[Paper](https://arxiv.org/abs/2212.04356)

[Paper](https://arxiv.org/abs/2402.01030)

[Paper](https://arxiv.org/abs/2004.04906)

[Paper](https://arxiv.org/abs/2004.12832)

[Paper](https://arxiv.org/abs/2112.09118)

[Paper](https://arxiv.org/abs/2212.03533)

[Paper](https://www.nature.com/articles/s41586-021-03819-2)

[Paper](https://www.science.org/doi/10.1126/science.ade2574) | Evolutionary-Scale Prediction of Atomic-Level Protein Structure with a Language Model | ---

### T2: Agent-Supervised Tool Adaptation

Development Timeline:

| Time | Method | Venue | Task(s) | Tool Backbone | Agent Backbone | Tuning | |------|--------|-------|---------|---------------|----------------|--------| | 2025.10 | QAgent | arXiv

[Paper](https://arxiv.org/abs/2510.08383)

[Paper](https://arxiv.org/abs/2510.05592)

[Paper](https://arxiv.org/abs/2510.02453)

[Paper](https://arxiv.org/abs/2510.15339)

[Paper](https://arxiv.org/abs/2510.23595)

[Paper](https://arxiv.org/abs/2509.25911)

[Paper](https://arxiv.org/abs/2508.16153)

[Paper](https://arxiv.org/abs/2508.05004)

[Paper](https://arxiv.org/abs/2505.14146)

[Paper](https://arxiv.org/abs/2410.20749)

[Paper](https://arxiv.org/abs/2406.18695)

[Paper](https://arxiv.org/abs/2405.03000)

[Paper](https://arxiv.org/abs/2403.18365)

[Paper](https://arxiv.org/abs/2402.13542)

[Paper](https://arxiv.org/abs/2402.12317)

[Paper](https://arxiv.org/abs/2402.08219)

[Paper](https://arxiv.org/abs/2401.08565)

[Paper](https://arxiv.org/abs/2307.07164)

[Paper](https://arxiv.org/abs/2305.17331)

[Paper](https://arxiv.org/abs/2305.11554)

[Paper](https://aclanthology.org/2023.emnlp-main.758/)

[Paper](https://aclanthology.org/2024.naacl-long.463.pdf)

[Code](https://github.com/swj0419/REPLUG) | QA | Contriever | GPT3-175B, PaLM, Codex, LLaMA-13B | Proxy-Tuning, LSR | --- ## Citation If you find this repository useful, please consider citing our survey: ``` @article{jiang2025adaptation, title={Adaptation of Agentic AI}, author={Jiang, Pengcheng and Lin, Jiacheng and Shi, Zhiyi and Wang, Zifeng and He, Luxi and Wu, Yichen and Zhong, Ming and Song, Peiyang and Zhang, Qizheng and Wang, Heng and others}, journal={arXiv preprint arXiv:2512.16301}, year={2025} } ``` ## Contributing We welcome contributions! Please feel free to submit a Pull Request to add new papers or update existing entries. ---

_{(ﾉ◕ヮ◕)ﾉ*:･ﾟ✧ Keep exploring the awesome world of agentic AI! ✧ﾟ･: *ヽ(◕ヮ◕ヽ)}