阿里巴巴AI Agent优化工程师-训练/数据/评测
实习兼职阿里巴巴2027届实习生地点:北京 | 杭州状态:招聘
任职要求
基础要求 1、专业要求:计算机、数学、统计学等相关专业硕士/博士优先,优秀本科生不受限制; 2、模型理解与优化:深入理解Transformer和主流LLM模型架构演进原理,对后训练算法有实操经验和深刻认知,拥有Agentic RL训练实操经验者优先; 3、AI应用构建能力:掌握主流AI协议(MCP、Skills等)、记忆系统(Memory)、知识库(RAG),独立开发过具备一定影响力AI应用者优先; 4、代码与工程能力:较强Python编程能力,熟练掌握Pytorch,了解大模型训练与推理框架(Megatron-LM、vLLM、DeepSpeed等),能高效处理分布式环境下的工程问题; 5、数据构建能力:有很强的Data-centric AI的意识,精通后训练所需高质量数据挖掘…
登录查看完整任职要求
微信扫码,1秒登录
工作职责
这是一个综合的AI推理、优化的技术岗位,适合希望从事以下工作的候选人投递: ● 希望从事AI应用构建与模型优化工作的候选人 ● 希望从事AI应用数据构建与自动化评测工作的候选人 ● 希望从事多模态AI应用构建与算法优化工作的候选人 围绕真实业务核心场景,参与AI应用的系统化构建与优化,把AI变为业务增长引擎,具体职责包括以下相关方向的一项或多项: 1、AI应用全生命周期演进:深度参与业务问题建模、应用架构设计、上下文工程、训练数据构建、自动化评估体系、模型后训练优化等; 2、数据飞轮构建:打造高质量数据生产链路,探索合成数据(Synthetic Data)与高效蒸馏技术方案,跑通“业务-模型-反馈”迭代闭环; 3、评测体系构建:面向业务目标,设计完备的AI应用效果评估体系,构建自动化评估框架,建立离线评估与在线业务指标联动的量化评估能力; 4、强化学习与奖励机制设计:构建可工程化的Reward体系与RL训练环境,提升模型在垂直业务场景中的可控性与泛化能力; 5、AI外部能力体系搭建:实现AI应用所需的知识库(RAG)、长短期记忆系统(Memory)、工具调用、多Agent协作框架等 6、多模态AI应用开发:构建AI应用的多模态感知与推理能力,解决在UI自动化、视觉理解与审核、多模态会话等场景的落地应用问题。
包括英文材料
Transformer+
https://huggingface.co/learn/llm-course/en/chapter1/4
Breaking down how Large Language Models work, visualizing how data flows through.
https://poloclub.github.io/transformer-explainer/
An interactive visualization tool showing you how transformer models work in large language models (LLM) like GPT.
https://www.youtube.com/watch?v=wjZofJX0v4M
Breaking down how Large Language Models work, visualizing how data flows through.
大模型+
https://www.youtube.com/watch?v=xZDB1naRUlk
You will build projects with LLMs that will enable you to create dynamic interfaces, interact with vast amounts of text data, and even empower LLMs with the capability to browse the internet for research papers.
https://www.youtube.com/watch?v=zjkBMFhNj_g
算法+
https://roadmap.sh/datastructures-and-algorithms
Step by step guide to learn Data Structures and Algorithms in 2025
https://www.hellointerview.com/learn/code
A visual guide to the most important patterns and approaches for the coding interview.
https://www.w3schools.com/dsa/
MCP+
https://www.youtube.com/watch?v=eur8dUO9mvE
Unlock the secrets of MCP! 🚀 Dive into the world of Model Context Protocol and learn how to seamlessly connect AI agents to databases, APIs, and more. Roy Derks breaks down its components, from hosts to servers, and showcases real-world applications. Gain the knowledge to revolutionize your AI projects!
https://www.youtube.com/watch?v=L94WBLL0KjY
Let's talk about MCP or the Model Context Protocol.
RAG+
https://www.youtube.com/watch?v=sVcwVQRHIc8
Learn how to implement RAG (Retrieval Augmented Generation) from scratch, straight from a LangChain software engineer.
Python+
https://liaoxuefeng.com/books/python/introduction/index.html
中文,免费,零起点,完整示例,基于最新的Python 3版本。
https://www.learnpython.org/
a free interactive Python tutorial for people who want to learn Python, fast.
https://www.youtube.com/watch?v=K5KVEU3aaeQ
Master Python from scratch 🚀 No fluff—just clear, practical coding skills to kickstart your journey!
https://www.youtube.com/watch?v=rfscVS0vtbw
This course will give you a full introduction into all of the core concepts in python.
PyTorch+
https://datawhalechina.github.io/thorough-pytorch/
PyTorch是利用深度学习进行数据科学研究的重要工具,在灵活性、可读性和性能上都具备相当的优势,近年来已成为学术界实现深度学习算法最常用的框架。
https://www.youtube.com/watch?v=V_xro1bcAuA
Learn PyTorch for deep learning in this comprehensive course for beginners. PyTorch is a machine learning framework written in Python.
Megatron+
https://www.youtube.com/watch?v=hc0u4avAkuM
还有更多 •••