大疆高级计算机视觉算法工程师(多模态大模型)
社招全职3年以上算法地点:深圳 | 上海状态:招聘
任职要求
1. 硕士及以上学历,具备计算机科学、信息工程、电子工程、机器人学等相关专业背景; 2. 具备对深度学习、机器人学、计算机图形学、计算机视觉几何等领域的深入认识,并了解各个算法的条件和瓶颈; 3. 具备C++/Python/Pytorch/ROS开发经验; 4. 具有3年以上VLM,强化学习等科研或开发经验,且熟悉目标分割、目标检测、姿态估计者优先; 5. 全面负责过一项或多项SOTA算法方案设计、模型优化、自动标注,模型评估且在硬件产品有落地经验者优先;主导过多NN模块智能系统硬件产品开发与交付者优先; 6. 在相关领域主流会议或期刊发表过论文 (CVPR/ICCV/ECCV/NIPS/ICML/ICLR/IROS/ICRA)者优先; 7. 对算法落地产品有强烈的热情,善于用技术解决产品问题。
工作职责
1. 负责多模态融合算法研发,针对无人机、手持影像开展环境理解与交互控制相关算法系统的设计与研发; 2. 负责设计和构建VLM/VLA体系能力关键环节,例如数据质量、数据挖掘、数据生成、自动标注、自动迭代等任一或多个模块的设计和算法实现; 3. 开展跨模块沟通,主导相关智能功能系统方案的设计、研发与交付。
包括英文材料
学历+
深度学习+
https://d2l.ai/
Interactive deep learning book with code, math, and discussions.
OpenCV+
https://learnopencv.com/getting-started-with-opencv/
At LearnOpenCV we are on a mission to educate the global workforce in computer vision and AI.
https://opencv.org/university/free-opencv-course/
This free OpenCV course will teach you how to manipulate images and videos, and detect objects and faces, among other exciting topics in just about 3 hours.
算法+
https://roadmap.sh/datastructures-and-algorithms
Step by step guide to learn Data Structures and Algorithms in 2025
https://www.hellointerview.com/learn/code
A visual guide to the most important patterns and approaches for the coding interview.
https://www.w3schools.com/dsa/
C+++
https://www.learncpp.com/
LearnCpp.com is a free website devoted to teaching you how to program in modern C++.
https://www.youtube.com/watch?v=ZzaPdXTrSb8
Python+
https://liaoxuefeng.com/books/python/introduction/index.html
中文,免费,零起点,完整示例,基于最新的Python 3版本。
https://www.learnpython.org/
a free interactive Python tutorial for people who want to learn Python, fast.
https://www.youtube.com/watch?v=K5KVEU3aaeQ
Master Python from scratch 🚀 No fluff—just clear, practical coding skills to kickstart your journey!
https://www.youtube.com/watch?v=rfscVS0vtbw
This course will give you a full introduction into all of the core concepts in python.
PyTorch+
https://datawhalechina.github.io/thorough-pytorch/
PyTorch是利用深度学习进行数据科学研究的重要工具,在灵活性、可读性和性能上都具备相当的优势,近年来已成为学术界实现深度学习算法最常用的框架。
https://www.youtube.com/watch?v=V_xro1bcAuA
Learn PyTorch for deep learning in this comprehensive course for beginners. PyTorch is a machine learning framework written in Python.
ROS+
https://www.youtube.com/watch?v=92Zz5nnd41c&list=PLk51HrKSBQ8-jTgD0qgRp1vmQeVSJ5SQC
https://www.youtube.com/watch?v=HJAE5Pk8Nyw
Ready to learn ROS2 and take your robotics skills to the next level?
https://www.youtube.com/watch?v=MWKnMPX0Yjg&list=PLU9tksFlQRircAdEplrH9NMm4WtSA8yzi
Do you want to know more about ROS the Robot Operating System?
强化学习+
https://cloud.google.com/discover/what-is-reinforcement-learning?hl=en
Reinforcement learning (RL) is a type of machine learning where an "agent" learns optimal behavior through interaction with its environment.
https://huggingface.co/learn/deep-rl-course/unit0/introduction
This course will teach you about Deep Reinforcement Learning from beginner to expert. It’s completely free and open-source!
https://www.kaggle.com/learn/intro-to-game-ai-and-reinforcement-learning
Build your own video game bots, using classic and cutting-edge algorithms.
CVPR+
https://cvpr.thecvf.com/
ICCV+
https://iccv.thecvf.com/
ICCV is the premier international computer vision event comprising the main conference and several co-located workshops and tutorials.
ECCV+
https://eccv.ecva.net/
ECCV is the official event under the European Computer Vision Association and is biannual on even numbered years.
ICML+
https://icml.cc/
ICLR+
https://iclr.cc/
相关职位
社招软件
1. 负责与产品经理对接需求,开展跨模块沟通,调研前沿技术(如多模态大模型)并主导相关智能拍摄功能系统方案的设计、研发与交付(如智能跟随、大师镜头、手势语音等自然交互); 2. 依托平台组件能力,结合产品需求,高效组合创新,完成相关算法系统的设计与研发,并与嵌入式工程师协作完成功能落地。
更新于 2025-04-03
社招算法
1. 负责研发面向无人机场景的大模型算法,实现感知、路径规划、动态避障与飞行控制的深度融合; 2. 负责开发多模态大模型(视觉/激光雷达/MU/地理信息等),优化无人机在复杂环境(城市、野外、低空)下的自主决策能力; 3. 参与构建无人机大规模数据集,设计数据标注策略及仿真训练系统,优化提升无人机系统的性能; 4. 持续关注跟踪泛机器人以及大模型领域的前沿技术进展,进行技术对标以及原型验证工作。
更新于 2025-06-18
社招1年以上技术类-算法
应用计算机视觉、自然语言处理、多模态理解、数据挖掘与机器学习等技术处理阿里国际数字商业集团海量数据,构建多模态预训练大模型底座,落地前沿研究成果,实现技术理论与业务创新,为电商业务场景的商品理解与结构化、图搜与同款、搜索与推荐、数据分析与决策等各类国际化场景应用构建算法基础能力。 1、负责研发电商多模态预训练模型基座,抽象并解决商品理解的基础问题使得模型具备业务通识能力,并构建针对大模型幻觉问题、推理能力、模型加速等关键问题的系统性解决方案,提高下游业务的迭代效率和效果上限。 2、基于多模态预训练大模型,落地商品理解关键场景任务,比如商品类目/属性/标签预测、商品同款、商品图搜等,实现业务指标提升。 3、学习前沿论文与把握技术趋势,深入理解底层算法原理,探索实验面向未来的硬核技术,实现核心技术突破和技术创新,发表相关论文。
更新于 2025-09-02