
商汤26届AI领航员-智慧零售具身智能算法开发工程师
校招全职算法研究类地点:上海状态:招聘
任职要求
1. 扎实的机器学习基础: 熟悉深度学习、强化学习、计算机视觉和自然语言处理等领域的经典算法,并具备丰富的实践经验。 2. VLA或相关经验: 具有 VLA、视觉-语言模型 (VLM) 或视觉-动作模型 (VAM) 的研发经验,有利用大规模多模态数据训练模型的实际项目经验者优先。 3. 具身智能经验: 熟悉机器人操作系统 (ROS),具备机器人硬件相关的开发经验,如控制算法、传感器数据处理(RGB-D相机、IMU等)以及机械臂控制等。 数据闭环经验: 具有构建具身智能数据闭环系统的相关经验,了解如何高效地采集、处理和利用机器人交互数据。 4. 编程能力: 精通 Python,熟悉 PyTorch 或 TensorFlow 等主流深度学习框架,具备良好的代码风格和工程化能力。 5. 创新与学习能力: 对具身智能领域充满热情,具备强大的独立思考和解决问题的能力,能够快速学习和掌握新知识。 加分项 1. 在机器人学、计算机视觉、自然语言处理等顶级会议或期刊(如 ICLR, NeurIPS, ICML, CVPR, ICCV, ECCV, CoRL 等)上发表过相关论文。 2. 有开源项目贡献者。 3. 有具身智能相关的实际产品开发经验。
工作职责
1. VLA模型研发: 参与或主导 VLA 模型的架构设计、训练和优化,提升模型在多模态理解和具身任务执行中的性能。 2. 数据闭环建设: 负责具身智能所需的数据采集、标注和处理流程,构建高效的数据闭环系统,以持续优化模型。你将探索新的数据获取方式,包括但不限于利用机器人自身进行自动化数据采集。 3. 具身技能开发: 将 VLA 模型部署到实际机器人平台上,解决模型与机器人硬件之间的集成和适配问题。开发和调试机器人技能,使其能够完成抓取、放置、操作工具等复杂任务。 4. 算法优化与落地: 持续关注具身智能领域的最新研究成果,并将前沿算法应用到实际产品中,解决技术挑战,推动产品性能的迭代升级。
包括英文材料
机器学习+
https://www.youtube.com/watch?v=0oyDqO8PjIg
Learn about machine learning and AI with this comprehensive 11-hour course from @LunarTech_ai.
https://www.youtube.com/watch?v=i_LwzRVP7bg
Learn Machine Learning in a way that is accessible to absolute beginners.
https://www.youtube.com/watch?v=NWONeJKn6kc
Learn the theory and practical application of machine learning concepts in this comprehensive course for beginners.
https://www.youtube.com/watch?v=PcbuKRNtCUc
Learn about all the most important concepts and terms related to machine learning and AI.
深度学习+
https://d2l.ai/
Interactive deep learning book with code, math, and discussions.
强化学习+
https://cloud.google.com/discover/what-is-reinforcement-learning?hl=en
Reinforcement learning (RL) is a type of machine learning where an "agent" learns optimal behavior through interaction with its environment.
https://huggingface.co/learn/deep-rl-course/unit0/introduction
This course will teach you about Deep Reinforcement Learning from beginner to expert. It’s completely free and open-source!
https://www.kaggle.com/learn/intro-to-game-ai-and-reinforcement-learning
Build your own video game bots, using classic and cutting-edge algorithms.
OpenCV+
https://learnopencv.com/getting-started-with-opencv/
At LearnOpenCV we are on a mission to educate the global workforce in computer vision and AI.
https://opencv.org/university/free-opencv-course/
This free OpenCV course will teach you how to manipulate images and videos, and detect objects and faces, among other exciting topics in just about 3 hours.
NLP+
https://www.youtube.com/watch?v=fNxaJsNG3-s&list=PLQY2H8rRoyvzDbLUZkbudP-MFQZwNmU4S
Welcome to Zero to Hero for Natural Language Processing using TensorFlow!
https://www.youtube.com/watch?v=R-AG4-qZs1A&list=PLeo1K3hjS3uuvuAXhYjV2lMEShq2UYSwX
Natural Language Processing tutorial for beginners series in Python.
https://www.youtube.com/watch?v=rmVRLeJRkl4&list=PLoROMvodv4rMFqRtEuo6SGjY4XbRIVRd4
The foundations of the effective modern methods for deep learning applied to NLP.
算法+
https://roadmap.sh/datastructures-and-algorithms
Step by step guide to learn Data Structures and Algorithms in 2025
https://www.hellointerview.com/learn/code
A visual guide to the most important patterns and approaches for the coding interview.
https://www.w3schools.com/dsa/
ROS+
https://www.youtube.com/watch?v=92Zz5nnd41c&list=PLk51HrKSBQ8-jTgD0qgRp1vmQeVSJ5SQC
https://www.youtube.com/watch?v=HJAE5Pk8Nyw
Ready to learn ROS2 and take your robotics skills to the next level?
https://www.youtube.com/watch?v=MWKnMPX0Yjg&list=PLU9tksFlQRircAdEplrH9NMm4WtSA8yzi
Do you want to know more about ROS the Robot Operating System?
Python+
https://liaoxuefeng.com/books/python/introduction/index.html
中文,免费,零起点,完整示例,基于最新的Python 3版本。
https://www.learnpython.org/
a free interactive Python tutorial for people who want to learn Python, fast.
https://www.youtube.com/watch?v=K5KVEU3aaeQ
Master Python from scratch 🚀 No fluff—just clear, practical coding skills to kickstart your journey!
https://www.youtube.com/watch?v=rfscVS0vtbw
This course will give you a full introduction into all of the core concepts in python.
PyTorch+
https://datawhalechina.github.io/thorough-pytorch/
PyTorch是利用深度学习进行数据科学研究的重要工具,在灵活性、可读性和性能上都具备相当的优势,近年来已成为学术界实现深度学习算法最常用的框架。
https://www.youtube.com/watch?v=V_xro1bcAuA
Learn PyTorch for deep learning in this comprehensive course for beginners. PyTorch is a machine learning framework written in Python.
TensorFlow+
https://www.youtube.com/watch?v=tpCFfeUEGs8
Ready to learn the fundamentals of TensorFlow and deep learning with Python? Well, you’ve come to the right place.
https://www.youtube.com/watch?v=ZUKz4125WNI
This part continues right where part one left off so get that Google Colab window open and get ready to write plenty more TensorFlow code.
ICLR+
https://iclr.cc/
NeurIPS+
https://neurips.cc/
ICML+
https://icml.cc/
CVPR+
https://cvpr.thecvf.com/
ICCV+
https://iccv.thecvf.com/
ICCV is the premier international computer vision event comprising the main conference and several co-located workshops and tutorials.
ECCV+
https://eccv.ecva.net/
ECCV is the official event under the European Computer Vision Association and is biannual on even numbered years.
相关职位

校招算法研究类
1. 面向智慧城市大规模图像/视频内容分析场景,参与VLM、MLLM等多模态算法的研究工作,探索城市视觉智能更好的解决方案,帮助下游产品线在行业内建立技术优势; 2. 构建和维护相关研究方向的代码框架、数据基础,紧跟学术前沿,输出创新研究成果。
更新于 2025-08-21

校招算法研究类
1. 设计和开发先进的深度学习算法,用于图像和视频数据的分析和理解,特别是在智慧城市安全、交通和公共服务等领域。 2. 研究和实现最新的机器学习和深度学习技术,尤其是多模态相关的技术,以提高算法的准确性和效率。 3. 与产品团队紧密合作,了解市场需求,将算法研究成果转化为产品特性。 4. 进行算法测试和优化,确保它们在不同的环境和条件下具。 5. 编写技术文档和算法评估报告,支持技术团队和非技术团队成员的理解和应用。
更新于 2025-08-21

校招交付运维
1.参与公司软件系统的实施交付,包括安装部署、系统配置、客户培训与现场支持; 2.协助客户完成系统上线前的测试、数据准备、问题处理等任务; 3.与研发团队协作,推动问题闭环,确保项目按时交付; 4.支持售前技术交流和客户沟通,为项目交付提供技术支撑; 5.根据项目需求,配合出差至客户现场,提供技术服务。
更新于 2025-07-28