米哈游Agent 算法工程师 - Varsapura
社招全职2年以上程序&技术类地点:上海 | 北京状态:招聘
任职要求
1)硕士及以上学历,计算机科学、人工智能、机器学习、自然语言处理或相关专业 2)2 年以上大模型应用、Agent 系统、NLP 算法或强化学习相关经验,有完整项目研发和落地经验 3)熟悉 LLM / VLM / 多模态模型的基础原理及应用方式,理解 Agent 系统中的规划、记忆、工具调用、上下文管理与多步推理等关键问题 4)熟练使用 PyTorch 及主流大模型训练/推理框架与工具链,如 Transformers、DeepSpeed、Megatron-LM、VeRL、vLLM、SGLang 等,具备较强的工程实现能力 5)具备 Agent 方向的实际研发经验,熟悉 ReAct、Function Calling、RAG、Memory、Reflection、Planning、Multi-Agent 等常见范式,能够独立设计并实现复杂 Agent 工作流 6)具备扎实的强化学习或对齐基础,理解 SFT、DPO、RLHF、RLAIF、PPO、GRPO 等方法原理,有将相关方法应用于大模型行为优化或 Agent 系统优化的实践经验 7)具备良好的系统设计与问题分…
登录查看完整任职要求
微信扫码,1秒登录
工作职责
1)Agent 能力研发:面向 AI Native 游戏场景,负责智能体(Agent)核心能力研发与优化,覆盖智能 AI NPC、AI 叙事、AI 玩法等方向,构建具备多轮对话、任务规划、工具调用、环境交互、长期记忆与自主决策能力的 Agent 系统 2)Agent 架构设计:设计并实现游戏场景下的 Agent 核心架构,包括 Planning、Memory、Tool Use、Action、Reflection、Persona、State Tracking 等模块,提升智能体在复杂动态环境中的稳定性、一致性与可控性 3)训练与对齐优化:结合业务需求,参与 Agent 相关模型与策略优化,包括 SFT、DPO、RLHF/RLAIF、PPO/GRPO 等方法,提升智能体在角色一致性、任务完成率、对话连贯性、行为合理性和安全性等维度的表现 4)记忆与数据体系建设:构建适用于游戏场景的 Agent Memory 与数据闭环体系,支持 NPC 对玩家历史行为、剧情进展、任务状态、角色关系和世界知识的长期记忆与高效调用,并持续优化训练数据与交互数据质量 5)工具调用与环境交互:建设 Agent 的工具与动作能力,使其能够可靠调用游戏内外部系统能力,如任务系统、剧情系统、检索系统、脚本/代码执行、UI/Browser 自动化等,提升 Agent 在真实业务场景中的执行能力 6)评测体系与系统优化:建立面向 Agent 的评测体系,围绕任务完成、角色设定一致性、叙事合理性、工具调用成功率、长期记忆效果、安全性等维度设计 Eval、自动化测试与分析机制,推动模型与系统持续迭代 7)多 Agent 与前沿探索:探索 Multi-Agent、GUI Agent、Browser Agent、World Model、MCP 等前沿方向在游戏中的应用,与产品、策划、工程团队协同推进 Agent 能力的落地与创新
包括英文材料
学历+
机器学习+
https://www.youtube.com/watch?v=0oyDqO8PjIg
Learn about machine learning and AI with this comprehensive 11-hour course from @LunarTech_ai.
https://www.youtube.com/watch?v=i_LwzRVP7bg
Learn Machine Learning in a way that is accessible to absolute beginners.
https://www.youtube.com/watch?v=NWONeJKn6kc
Learn the theory and practical application of machine learning concepts in this comprehensive course for beginners.
https://www.youtube.com/watch?v=PcbuKRNtCUc
Learn about all the most important concepts and terms related to machine learning and AI.
NLP+
https://www.youtube.com/watch?v=fNxaJsNG3-s&list=PLQY2H8rRoyvzDbLUZkbudP-MFQZwNmU4S
Welcome to Zero to Hero for Natural Language Processing using TensorFlow!
https://www.youtube.com/watch?v=R-AG4-qZs1A&list=PLeo1K3hjS3uuvuAXhYjV2lMEShq2UYSwX
Natural Language Processing tutorial for beginners series in Python.
https://www.youtube.com/watch?v=rmVRLeJRkl4&list=PLoROMvodv4rMFqRtEuo6SGjY4XbRIVRd4
The foundations of the effective modern methods for deep learning applied to NLP.
大模型+
https://www.youtube.com/watch?v=xZDB1naRUlk
You will build projects with LLMs that will enable you to create dynamic interfaces, interact with vast amounts of text data, and even empower LLMs with the capability to browse the internet for research papers.
https://www.youtube.com/watch?v=zjkBMFhNj_g
AI agent+
https://www.ibm.com/think/ai-agents
Your one-stop resource for gaining in-depth knowledge and hands-on applications of AI agents.
算法+
https://roadmap.sh/datastructures-and-algorithms
Step by step guide to learn Data Structures and Algorithms in 2025
https://www.hellointerview.com/learn/code
A visual guide to the most important patterns and approaches for the coding interview.
https://www.w3schools.com/dsa/
强化学习+
https://cloud.google.com/discover/what-is-reinforcement-learning?hl=en
Reinforcement learning (RL) is a type of machine learning where an "agent" learns optimal behavior through interaction with its environment.
https://huggingface.co/learn/deep-rl-course/unit0/introduction
This course will teach you about Deep Reinforcement Learning from beginner to expert. It’s completely free and open-source!
https://www.kaggle.com/learn/intro-to-game-ai-and-reinforcement-learning
Build your own video game bots, using classic and cutting-edge algorithms.
PyTorch+
https://datawhalechina.github.io/thorough-pytorch/
PyTorch是利用深度学习进行数据科学研究的重要工具,在灵活性、可读性和性能上都具备相当的优势,近年来已成为学术界实现深度学习算法最常用的框架。
https://www.youtube.com/watch?v=V_xro1bcAuA
Learn PyTorch for deep learning in this comprehensive course for beginners. PyTorch is a machine learning framework written in Python.
vLLM+
https://www.newline.co/@zaoyang/ultimate-guide-to-vllm--aad8b65d
vLLM is a framework designed to make large language models faster, more efficient, and better suited for production environments.
https://www.youtube.com/watch?v=Ju2FrqIrdx0
vLLM is a cutting-edge serving engine designed for large language models (LLMs), offering unparalleled performance and efficiency for AI-driven applications.
SGLang+
[英文] Install SGLang
https://docs.sglang.ai/get_started/install.html
SGLang is a fast serving framework for large language models and vision language models.
https://github.com/sgl-project/sgl-learning-materials
还有更多 •••