My first blog: From CV to Agentic RL

1 minute read

Published: September 19, 2025

This is my first blog. I’m trying to use this way to study latest reseach and industrial work as much as I can. My current research interests include agentic RL. Since most of my time has been devoted to CV in the last two years, it is a challenge. But I will take it.

First, I want to explore ‘what’. What is agent? What is RL? what is Agentic RL? What is their relationship?

Agent system — **Figure 1.** What is agent? (from [1])

**Figure 2.** What is RL? (from ChatGPT)

Agentic RL Framework — **Figure 3.** What is Agentic RL? (from [2])

To put it simple, an agent is an entity which can use tools and take actions to achieve its goals in an environment. A more formal definition can be seen in [3]. RL is the methodology to train agents. Agentic RL focuses on how RL empowers LLM-based agents in dynamic environments[2]. As shown in Figure 4, I draw a diagram to describe their relationships in an intuitive way.

Relationship — **Figure 4.** What is their relationship?

In the next blog, I will dive into their components.

References

Weng, Lilian. (Jun 2023). “LLM-powered Autonomous Agents”. Lil’Log. https://lilianweng.github.io/posts/2023-06-23-agent/.
Zhang, Guibin, et al. "The Landscape of Agentic Reinforcement Learning for LLMs: A Survey." arXiv preprint arXiv:2509.02547 (2025).
Wang, Hongru, et al. "Toward a Theory of Agents as Tool-Use Decision-Makers." arXiv preprint arXiv:2506.00886 (2025).

Share on

Bluesky Facebook LinkedIn X (formerly Twitter)

Jialong Sun

Share on