美团大模型应用算法工程师(Agent 方向)
社招全职3年以上核心本地商业-美团平台地点:北京状态:招聘
任职要求
1、本科及以上学历,计算机、人工智能、自动化、数学、物理等相关专业; 2、在强化学习、语言模型、机器学习等一个或多个领域有较深入的研究者; 3、好奇心驱动,具有出色的分析、解决问题的能力,有自主探索解决方案的能力者; 4、具有良好的沟通协作能力,对追求纯粹的技术有强烈热情,工作积极主动,能够与团队融洽合作,一起探索新技术并快速试验想法,推进技术进步。 具备以下条件优先 1、具有优秀的基础算法、扎实的机器学习基础,熟悉 NLP、RL、ML 等领域的技术,在 NeurIPS、ICLR、ICML 等顶级会议/期刊上发表论文者优先; 2、具有优秀的代码能力,熟练掌握 C/C++ 或 Python 编程语言,ACM/ICPC、NOI/IOl、Top Coder、Kaggle 等比赛获奖者优先; 3、在大语言模型、基础模型、世界模型、RL,主导过大影响力项目者优先。
工作职责
1、探索模型通过 RL Scaling 等方式使用成套工具解决复杂问题的行动和规划能力,包括 Human in the Loop 多轮交互下 Agent 基础建模的新方案、以及与复杂环境的交互学习能力; 2、探索模型在 Non-Rule Based Outcome 场景下利用复杂信息进行有效 Reasoning 推理的范式,包括 Proactive Agent 的建模方案; 3、探索研究更多内在奖励的机制,从而激发模型主动学习和自我更新的能力; 4、探索构建长期记忆机制,为下一代高效的推理模型、长序列推理及建模提供基础
包括英文材料
学历+
强化学习+
https://cloud.google.com/discover/what-is-reinforcement-learning?hl=en
Reinforcement learning (RL) is a type of machine learning where an "agent" learns optimal behavior through interaction with its environment.
https://huggingface.co/learn/deep-rl-course/unit0/introduction
This course will teach you about Deep Reinforcement Learning from beginner to expert. It’s completely free and open-source!
https://www.kaggle.com/learn/intro-to-game-ai-and-reinforcement-learning
Build your own video game bots, using classic and cutting-edge algorithms.
机器学习+
https://www.youtube.com/watch?v=0oyDqO8PjIg
Learn about machine learning and AI with this comprehensive 11-hour course from @LunarTech_ai.
https://www.youtube.com/watch?v=i_LwzRVP7bg
Learn Machine Learning in a way that is accessible to absolute beginners.
https://www.youtube.com/watch?v=NWONeJKn6kc
Learn the theory and practical application of machine learning concepts in this comprehensive course for beginners.
https://www.youtube.com/watch?v=PcbuKRNtCUc
Learn about all the most important concepts and terms related to machine learning and AI.
算法+
https://roadmap.sh/datastructures-and-algorithms
Step by step guide to learn Data Structures and Algorithms in 2025
https://www.hellointerview.com/learn/code
A visual guide to the most important patterns and approaches for the coding interview.
https://www.w3schools.com/dsa/
NLP+
https://www.youtube.com/watch?v=fNxaJsNG3-s&list=PLQY2H8rRoyvzDbLUZkbudP-MFQZwNmU4S
Welcome to Zero to Hero for Natural Language Processing using TensorFlow!
https://www.youtube.com/watch?v=R-AG4-qZs1A&list=PLeo1K3hjS3uuvuAXhYjV2lMEShq2UYSwX
Natural Language Processing tutorial for beginners series in Python.
https://www.youtube.com/watch?v=rmVRLeJRkl4&list=PLoROMvodv4rMFqRtEuo6SGjY4XbRIVRd4
The foundations of the effective modern methods for deep learning applied to NLP.
NeurIPS+
https://neurips.cc/
ICLR+
https://iclr.cc/
ICML+
https://icml.cc/
C+
https://www.freecodecamp.org/chinese/news/the-c-beginners-handbook/
本手册遵循二八定律。你将在 20% 的时间内学习 80% 的 C 编程语言。
https://www.youtube.com/watch?v=87SH2Cn0s9A
https://www.youtube.com/watch?v=KJgsSFOSQv0
This course will give you a full introduction into all of the core concepts in the C programming language.
https://www.youtube.com/watch?v=PaPN51Mm5qQ
In this complete C programming course, Dr. Charles Severance (aka Dr. Chuck) will help you understand computer architecture and low-level programming with the help of the classic C Programming language book written by Brian Kernighan and Dennis Ritchie.
C+++
https://www.learncpp.com/
LearnCpp.com is a free website devoted to teaching you how to program in modern C++.
https://www.youtube.com/watch?v=ZzaPdXTrSb8
Python+
https://liaoxuefeng.com/books/python/introduction/index.html
中文,免费,零起点,完整示例,基于最新的Python 3版本。
https://www.learnpython.org/
a free interactive Python tutorial for people who want to learn Python, fast.
https://www.youtube.com/watch?v=K5KVEU3aaeQ
Master Python from scratch 🚀 No fluff—just clear, practical coding skills to kickstart your journey!
https://www.youtube.com/watch?v=rfscVS0vtbw
This course will give you a full introduction into all of the core concepts in python.
Kaggle+
[英文] Kaggle Learn
https://www.kaggle.com/learn
Gain the skills you need to do independent data science projects.
相关职位
实习高德地图2026
团队介绍: 高德地图为您导航,前方路口请“左转”,我们是高德地图交通&行中智能团队。 我们的使命是基于高德海量高质的数据,最前沿的AI算法,最可靠的工程架构,打造有温度、有惊喜、科技感十足的智能出行体验; 在这里,我们一起建设应对超大业务规模,超高业务复杂度的高效、可靠、鲁棒的技术架构;一起用最前沿的机器学习、深度学习、AI算法探索导航领域最具挑战性的行业难题;一起用最尖端的AIGC、LLM/LVM、多模态理解与生成、Agent等技术,打造全新的出行交互体验; 团队简单直接、有情有义、温暖有爱,欢迎加入,一起用技术驱动创新,为海量用户护航! 职位职责包括但不限于: 基于前沿的AIGC、LLM/LVM、MLLM多模态理解与生成、AI Agent等技术,实现高德地图导航过程全场景、全时空、多模态的内容理解/生成以及智能交互,不断提升用户的出行质量和体验。
更新于 2025-03-06
校招J1003
1、基于快手自研基础大模型,构建Agent系统,并打造Deep Research等原生大模型应用; 2、参与包括但不限于agentic数据集构造、SFT冷启动训练、RL端到端训练agentic reasoning model、prompt优化等方向。
更新于 2025-08-04