My first blog: From CV to Agentic RL
Published:
This is my first blog. I’m trying to use this way to study latest reseach and industrial work as much as I can. My current research interests include agentic RL. Since most of my time has been devoted to CV in the last two years, it is a challenge. But I will take it.
First, I want to explore ‘what’. What is agent? What is RL? what is Agentic RL? What is their relationship?



To put it simple, an agent is an entity which can use tools and take actions to achieve its goals in an environment. A more formal definition can be seen in [3]. RL is the methodology to train agents. Agentic RL focuses on how RL empowers LLM-based agents in dynamic environments[2]. As shown in Figure 4, I draw a diagram to describe their relationships in an intuitive way.

In the next blog, I will dive into their components.
References
- Weng, Lilian. (Jun 2023). “LLM-powered Autonomous Agents”. Lil’Log. https://lilianweng.github.io/posts/2023-06-23-agent/.
- Zhang, Guibin, et al. "The Landscape of Agentic Reinforcement Learning for LLMs: A Survey." arXiv preprint arXiv:2509.02547 (2025).
- Wang, Hongru, et al. "Toward a Theory of Agents as Tool-Use Decision-Makers." arXiv preprint arXiv:2506.00886 (2025).
