安克创新机器人多模态大模型算法工程师(博士)
校招全职地点:深圳 | 北京 | 上海状态:招聘
任职要求
岗位要求 1.硕士及以上学历,计算机科学/人工智能/机器人学相关专业 2.精通Transformer架构与大模型技术栈(微调/部署),掌握强化学习(PPO/SAC)或模仿学习(BC/GAIL)框架 3.熟练使用PyTorch/TensorFlow,精通Python/C++,熟悉Linux/ROS开发环境 加分项 1.在CVPR/NeurIPS/ICML/CoRL发表具身智能/机器人学习相关论文 2.具备主流具身智能开源项目经验(如RT-1/RT-2、OpenVLA、π系列π0/π0.5等) 3.熟悉机器人抓取规划或人机协作场景落地
工作职责
岗位职责 1.研发具身智能认知架构(VLM/VLA/VLN),实现多模态指令理解与长周期任务规划、自主导航系统 2.设计强化学习(RL)/模仿学习(IL)决策框架,解决开放场景稀疏奖励问题 3.优化模型结构、提升计算效率(模型剪枝/量化),解决端侧部署挑战 4.主导仿真(Isaac Gym/MuJoCo)到真机(人形机器人/机械臂)的Sim2Real迁移
包括英文材料
学历+
Transformer+
https://huggingface.co/learn/llm-course/en/chapter1/4
Breaking down how Large Language Models work, visualizing how data flows through.
https://poloclub.github.io/transformer-explainer/
An interactive visualization tool showing you how transformer models work in large language models (LLM) like GPT.
https://www.youtube.com/watch?v=wjZofJX0v4M
Breaking down how Large Language Models work, visualizing how data flows through.
大模型+
https://www.youtube.com/watch?v=xZDB1naRUlk
You will build projects with LLMs that will enable you to create dynamic interfaces, interact with vast amounts of text data, and even empower LLMs with the capability to browse the internet for research papers.
https://www.youtube.com/watch?v=zjkBMFhNj_g
强化学习+
https://cloud.google.com/discover/what-is-reinforcement-learning?hl=en
Reinforcement learning (RL) is a type of machine learning where an "agent" learns optimal behavior through interaction with its environment.
https://huggingface.co/learn/deep-rl-course/unit0/introduction
This course will teach you about Deep Reinforcement Learning from beginner to expert. It’s completely free and open-source!
https://www.kaggle.com/learn/intro-to-game-ai-and-reinforcement-learning
Build your own video game bots, using classic and cutting-edge algorithms.
PyTorch+
https://datawhalechina.github.io/thorough-pytorch/
PyTorch是利用深度学习进行数据科学研究的重要工具,在灵活性、可读性和性能上都具备相当的优势,近年来已成为学术界实现深度学习算法最常用的框架。
https://www.youtube.com/watch?v=V_xro1bcAuA
Learn PyTorch for deep learning in this comprehensive course for beginners. PyTorch is a machine learning framework written in Python.
TensorFlow+
https://www.youtube.com/watch?v=tpCFfeUEGs8
Ready to learn the fundamentals of TensorFlow and deep learning with Python? Well, you’ve come to the right place.
https://www.youtube.com/watch?v=ZUKz4125WNI
This part continues right where part one left off so get that Google Colab window open and get ready to write plenty more TensorFlow code.
Python+
https://liaoxuefeng.com/books/python/introduction/index.html
中文,免费,零起点,完整示例,基于最新的Python 3版本。
https://www.learnpython.org/
a free interactive Python tutorial for people who want to learn Python, fast.
https://www.youtube.com/watch?v=K5KVEU3aaeQ
Master Python from scratch 🚀 No fluff—just clear, practical coding skills to kickstart your journey!
https://www.youtube.com/watch?v=rfscVS0vtbw
This course will give you a full introduction into all of the core concepts in python.
C+++
https://www.learncpp.com/
LearnCpp.com is a free website devoted to teaching you how to program in modern C++.
https://www.youtube.com/watch?v=ZzaPdXTrSb8
Linux+
https://ryanstutorials.net/linuxtutorial/
Ok, so you want to learn how to use the Bash command line interface (terminal) on Unix/Linux.
https://ubuntu.com/tutorials/command-line-for-beginners
The Linux command line is a text interface to your computer.
https://www.youtube.com/watch?v=6WatcfENsOU
In this Linux crash course, you will learn the fundamental skills and tools you need to become a proficient Linux system administrator.
https://www.youtube.com/watch?v=v392lEyM29A
Never fear the command line again, make it fear you.
https://www.youtube.com/watch?v=ZtqBQ68cfJc
ROS+
https://www.youtube.com/watch?v=92Zz5nnd41c&list=PLk51HrKSBQ8-jTgD0qgRp1vmQeVSJ5SQC
https://www.youtube.com/watch?v=HJAE5Pk8Nyw
Ready to learn ROS2 and take your robotics skills to the next level?
https://www.youtube.com/watch?v=MWKnMPX0Yjg&list=PLU9tksFlQRircAdEplrH9NMm4WtSA8yzi
Do you want to know more about ROS the Robot Operating System?
CVPR+
https://cvpr.thecvf.com/
NeurIPS+
https://neurips.cc/
ICML+
https://icml.cc/
相关职位
校招
1.推进机器人多模态大模型(VLM/VLA)、3D感知算法的工程化落地:涵盖预训练、微调、训练加速和效果调优。 2.基于issac sim搭建仿真环境验证操作模型,设计real2sim2real迁移框架,加速算法验证与落地。 3.具身智能算法研发,包括不同数据配比/网络结构/本体构型,在toC场景完成长序列任务和技能泛化。 4.研发自动化标注算法(2D/3D/VLA等),降低标注成本和提升标注质量。 5.设计多模态数据(图像、视频和点云等)生成算法,增强数据多样性。
更新于 2025-05-16
社招3年以上
1. 负责研究和开发适合机器人的多模态大模型算法,包括但不限于语言、图像、视频、点云等模态,应用于机器人环境感知、决策、规划控制等领域 2. 负责多模态大模型算法设计、开发以及验证,通过仿真和数据闭环等方式控制和量化算法迭代效果 3. 通过研发世界模型、生成式模型,搭建闭环渲染系统,辅助端到端模型的训练 4. 深入调研前沿算法,探索前沿算法在具体场景中落地的可能性
更新于 2025-03-06
社招
1. 开发通用型具身算法并应用于人形机器人场景任务,具备物体泛化、任务泛化、场景泛化能力; 2. 研究多模态具身大模型,具备视觉、触觉、语言感知和决策能力,控制机器人完成开放世界的物理交互;
更新于 2025-04-28