滴滴用户平台部-算法实习生(1)
实习兼职算法类地点:北京状态:招聘
任职要求
- 计算机/AI相关专业本科或硕士在读 - 至少可实习 4个月、每周至少 3 天。 - 熟练Linux 系统常用命令,Python 、PyTorch、SQL - 具备机器学习、深度学习、强化学习基础 - 熟悉至少一种Agent开发框架/系统,如:Langchain、LangGraph、AutoGen,Dify、扣子。 加分项: - 熟悉知名Agent…
登录查看完整任职要求
微信扫码,1秒登录
工作职责
Agent开发工程师 工作内容: 智能客服场景的Agent设计、开发,提升机器人服务质量、效率。 具体内容: 1.针对单轮、多轮任务设计合适的Agent架构(如ReACT/Reflexion/Multi-Agent) 2.根据反馈数据,优化Agent设计模式,持续进化。 3.基于Bad Case训练模型,提升Agent的理解、思考、决策、工具调用能力
包括英文材料
Linux+
https://ryanstutorials.net/linuxtutorial/
Ok, so you want to learn how to use the Bash command line interface (terminal) on Unix/Linux.
https://ubuntu.com/tutorials/command-line-for-beginners
The Linux command line is a text interface to your computer.
https://www.youtube.com/watch?v=6WatcfENsOU
In this Linux crash course, you will learn the fundamental skills and tools you need to become a proficient Linux system administrator.
https://www.youtube.com/watch?v=v392lEyM29A
Never fear the command line again, make it fear you.
https://www.youtube.com/watch?v=ZtqBQ68cfJc
Python+
https://liaoxuefeng.com/books/python/introduction/index.html
中文,免费,零起点,完整示例,基于最新的Python 3版本。
https://www.learnpython.org/
a free interactive Python tutorial for people who want to learn Python, fast.
https://www.youtube.com/watch?v=K5KVEU3aaeQ
Master Python from scratch 🚀 No fluff—just clear, practical coding skills to kickstart your journey!
https://www.youtube.com/watch?v=rfscVS0vtbw
This course will give you a full introduction into all of the core concepts in python.
PyTorch+
https://datawhalechina.github.io/thorough-pytorch/
PyTorch是利用深度学习进行数据科学研究的重要工具,在灵活性、可读性和性能上都具备相当的优势,近年来已成为学术界实现深度学习算法最常用的框架。
https://www.youtube.com/watch?v=V_xro1bcAuA
Learn PyTorch for deep learning in this comprehensive course for beginners. PyTorch is a machine learning framework written in Python.
SQL+
https://liaoxuefeng.com/books/sql/introduction/index.html
什么是SQL?简单地说,SQL就是访问和处理关系数据库的计算机标准语言。
https://sqlbolt.com/
Learn SQL with simple, interactive exercises.
https://www.youtube.com/watch?v=p3qvj9hO_Bo
In this video we will cover everything you need to know about SQL in only 60 minutes.
机器学习+
https://www.youtube.com/watch?v=0oyDqO8PjIg
Learn about machine learning and AI with this comprehensive 11-hour course from @LunarTech_ai.
https://www.youtube.com/watch?v=i_LwzRVP7bg
Learn Machine Learning in a way that is accessible to absolute beginners.
https://www.youtube.com/watch?v=NWONeJKn6kc
Learn the theory and practical application of machine learning concepts in this comprehensive course for beginners.
https://www.youtube.com/watch?v=PcbuKRNtCUc
Learn about all the most important concepts and terms related to machine learning and AI.
深度学习+
https://d2l.ai/
Interactive deep learning book with code, math, and discussions.
强化学习+
https://cloud.google.com/discover/what-is-reinforcement-learning?hl=en
Reinforcement learning (RL) is a type of machine learning where an "agent" learns optimal behavior through interaction with its environment.
https://huggingface.co/learn/deep-rl-course/unit0/introduction
This course will teach you about Deep Reinforcement Learning from beginner to expert. It’s completely free and open-source!
https://www.kaggle.com/learn/intro-to-game-ai-and-reinforcement-learning
Build your own video game bots, using classic and cutting-edge algorithms.
还有更多 •••
相关职位
实习算法类
大模型训练工程师 工作内容: 深入参与垂类场景Agent大模型的预训练和后训练,构建领域对话大模型。 具体内容: 1. 支持预训练和后训练任务,并在导师指导下完成 PPO/DPO/GRPO 等算法应用及改进,支持大模型分布式训练实验。 2. 协助构建奖励模型:偏好数据清洗、特征工程、鲁棒性测试,跟踪实验指标及可视化分析。 3. 驱动效果闭环:搭建离线多维度指标与在线 A/B 体系,打造自迭代数据飞轮。
更新于 2025-08-04北京
实习技术类
工作内容: 深入参与构建大模型 Agent 高质量数据合成飞轮,构建多维度自动评估体系链路闭环。 具体工作: 1.针对业务场景设计基于LLM大模型的多轮对话数据合成链路,构建高质量数据合成pipeline以及线上自进化数据飞轮。 2.基于业务场景抽象用户画像,设计可插拔的画像策略,实现支持多轮对话策略的自动交互,兼顾可控性与多样性。 3.参与设计建设自动化评估,实时监控数据质量与模型效果。
更新于 2025-08-04北京
实习技术类
【大模型训练实习生】 工作内容: 深入参与垂类场景Agent大模型的预训练和后训练,构建领域对话大模型。 具体工作: 1.支持预训练和后训练任务,并在导师指导下完成 PPO/DPO/GRPO 等算法应用及改进,支持大模型分布式训练实验。 2.协助构建奖励模型:偏好数据清洗、特征工程、鲁棒性测试,跟踪实验指标及可视化分析。 3.驱动效果闭环:搭建离线多维度指标与在线 A/B 体系,打造自迭代数据飞轮。
更新于 2025-08-05北京