Mock Worlds, Real Skills: Building Small Agentic Language Models with Synthetic Tasks, Simulated Environments, and Rubric-Based Rewards | DocHero AI - 专业免费润色翻译工具,助您快速准确翻译英文学术论文