网易大模型算法工程师(Agent方向)
社招全职3年以下网易云音乐地点:杭州状态:招聘
任职要求
1、计算机科学、人工智能、电子工程或相关专业硕士及以上学历; 2、扎实的机器学习与深度学习理论基础,熟悉Transformer、LLM训练范式(SFT、RLHF、DPO、RAG、LoRA等); 3、精通Python,熟悉至少一种主流框架(PyTorch/TensorFlow),有Agent框架(LangChain、LlamaIndex、AutoGPT等)使用或开发经验; 4、熟悉大语言模型的系统优化与部署流程,包括推理加速、分布式并行(Data/Model/Pipeline Parallel)、Prompt Engineering等; 5、具备较强的工程实现与问题解决能力,能快速在实际场景中实现智能体任务执行闭环; 6、具有良好的跨团队沟通能力与创新精神,能够从算法到产品形成完整落地路径。 【优先条件】 1、对Qwen(通义…
登录查看完整任职要求
微信扫码,1秒登录
工作职责
1、负责智能体(AI Agent)相关核心算法的研究与实现,构建具备多模态理解、规划、记忆、工具调用和任务执行能力的自主智能体体系; 2、参与大语言模型(LLM)在多轮对话、任务分解、知识检索、动作执行等环节的能力增强与优化; 3、设计并实现智能体的推理框架(Reasoning Framework),包括上下文记忆、长短期记忆融合(STM/LTM)、反思机制(Reflection Loop)、以及基于反馈的自我迭代策略; 4、负责Agent系统的工具生态构建,包括函数调用(Function Calling)、插件集成(Plugin Integration)、知识库检索(RAG)等,提升模型的可操作性与可解释性; 5、跟进业界前沿的大模型与Agent技术(如OpenAI o1、Anthropic Claude、MCP、LangChain、AutoGPT、OpenDevin等),并探索其在社交、内容、推荐、AIGC、商业化等场景的落地路径; 6、分析与解决模型在多轮推理、长上下文记忆、工具调用策略优化等过程中的技术瓶颈; 7、与产品、平台及算法团队紧密协作,将Agent能力嵌入真实业务流程中,并通过数据反馈实现持续演化。
包括英文材料
学历+
机器学习+
https://www.youtube.com/watch?v=0oyDqO8PjIg
Learn about machine learning and AI with this comprehensive 11-hour course from @LunarTech_ai.
https://www.youtube.com/watch?v=i_LwzRVP7bg
Learn Machine Learning in a way that is accessible to absolute beginners.
https://www.youtube.com/watch?v=NWONeJKn6kc
Learn the theory and practical application of machine learning concepts in this comprehensive course for beginners.
https://www.youtube.com/watch?v=PcbuKRNtCUc
Learn about all the most important concepts and terms related to machine learning and AI.
深度学习+
https://d2l.ai/
Interactive deep learning book with code, math, and discussions.
Transformer+
https://huggingface.co/learn/llm-course/en/chapter1/4
Breaking down how Large Language Models work, visualizing how data flows through.
https://poloclub.github.io/transformer-explainer/
An interactive visualization tool showing you how transformer models work in large language models (LLM) like GPT.
https://www.youtube.com/watch?v=wjZofJX0v4M
Breaking down how Large Language Models work, visualizing how data flows through.
大模型+
https://www.youtube.com/watch?v=xZDB1naRUlk
You will build projects with LLMs that will enable you to create dynamic interfaces, interact with vast amounts of text data, and even empower LLMs with the capability to browse the internet for research papers.
https://www.youtube.com/watch?v=zjkBMFhNj_g
SFT+
https://cameronrwolfe.substack.com/p/understanding-and-using-supervised
Understanding how SFT works from the idea to a working implementation...
RLHF+
[英文] What is RLHF?
https://aws.amazon.com/what-is/reinforcement-learning-from-human-feedback/
Reinforcement learning from human feedback (RLHF) is a machine learning (ML) technique that uses human feedback to optimize ML models to self-learn more efficiently.
https://www.ibm.com/think/topics/rlhf
Reinforcement learning from human feedback (RLHF) is a machine learning technique in which a “reward model” is trained with direct human feedback, then used to optimize the performance of an artificial intelligence agent through reinforcement learning.
RAG+
https://www.youtube.com/watch?v=sVcwVQRHIc8
Learn how to implement RAG (Retrieval Augmented Generation) from scratch, straight from a LangChain software engineer.
Python+
https://liaoxuefeng.com/books/python/introduction/index.html
中文,免费,零起点,完整示例,基于最新的Python 3版本。
https://www.learnpython.org/
a free interactive Python tutorial for people who want to learn Python, fast.
https://www.youtube.com/watch?v=K5KVEU3aaeQ
Master Python from scratch 🚀 No fluff—just clear, practical coding skills to kickstart your journey!
https://www.youtube.com/watch?v=rfscVS0vtbw
This course will give you a full introduction into all of the core concepts in python.
PyTorch+
https://datawhalechina.github.io/thorough-pytorch/
PyTorch是利用深度学习进行数据科学研究的重要工具,在灵活性、可读性和性能上都具备相当的优势,近年来已成为学术界实现深度学习算法最常用的框架。
https://www.youtube.com/watch?v=V_xro1bcAuA
Learn PyTorch for deep learning in this comprehensive course for beginners. PyTorch is a machine learning framework written in Python.
TensorFlow+
https://www.youtube.com/watch?v=tpCFfeUEGs8
Ready to learn the fundamentals of TensorFlow and deep learning with Python? Well, you’ve come to the right place.
https://www.youtube.com/watch?v=ZUKz4125WNI
This part continues right where part one left off so get that Google Colab window open and get ready to write plenty more TensorFlow code.
AI agent+
https://www.ibm.com/think/ai-agents
Your one-stop resource for gaining in-depth knowledge and hands-on applications of AI agents.
LangChain+
https://python.langchain.com/docs/tutorials/
New to LangChain or LLM app development in general? Read this material to quickly get up and running building your first applications.
https://www.freecodecamp.org/news/beginners-guide-to-langchain/
LangChain is a popular framework for creating LLM-powered apps.
LlamaIndex+
https://developers.llamaindex.ai/python/framework/getting_started/starter_example/
This tutorial will show you how to get started building agents with LlamaIndex.
https://www.ibm.com/think/tutorials/llamaindex-rag
LlamaIndex is a powerful open source framework that simplifies the process of building RAG pipelines.
AutoGPT+
[英文] What is AutoGPT?
https://www.ibm.com/think/topics/autogpt
https://www.youtube.com/watch?v=v-5AWQlTFw8
Someone has created a version of ChatGPT called AutoGPT and it’s a lot more powerful.
还有更多 •••
相关职位
实习高德研究型实习生
团队介绍: 我们团队负责高德出行大数据的分析和应用开发,在机器学习、深度学习、大模型agent等方向探索解决业务问题,洞察业务机会,打造智能化的用户出行数据引擎。我们鼓励创新,鼓励发表论文和申请专利,乐于用新技术在业务方向做实际落地的尝试,期待你的加入! 具体职责包括但不限于: 1、负责机器学习、深度学习领域的数据挖掘研发工作,应用高德人地时空大数据挖掘用户画像、解决用户痛点问题; 2、负责时空大数据与AI Agent结合的应用实践,探索时空大数据Agent落地时间; 3、针对特定的业务场景对基座大模型进行微调优化,实现专属技能模型的能力升级和性能提升。
更新于 2025-03-27北京

社招3年以上AI项目组
1、参与虎扑内容生态的智能体(Agent)系统研发,构建基于大模型的智能决策与任务执行能力; 2、探索前沿Agent技术路径,结合NLP/多模态/强化学习等方法,打造具备高效感知、理解和行动能力的智能体; 3、推动Agent方案在实际业务场景中的落地,支持智能运营、内容创作、个性推荐等应用; 4、协同产品与工程团队,持续优化Agent系统的性能与用户体验,提升交互智能水平与业务价值。
更新于 2025-09-29上海
校招J1003
1、基于快手自研基础大模型,构建Agent系统,并打造Deep Research等原生大模型应用; 2、参与包括但不限于agentic数据集构造、SFT冷启动训练、RL端到端训练agentic reasoning model、prompt优化等方向。
更新于 2025-08-04北京