大疆多媒体算法工程师(北京)
校招全职算法地点:北京状态:招聘
任职要求
1. 本科及以上学历,计算机、电子工程、自动化、图像处理、计算摄影、模式识别、通信/信号处理等相关专业; 2. 具备扎实的图像处理算法基础,有良好的数学功底; 3. 良好的编程基础,熟练掌握Python/C/C++等编程开发技术和常用的数据结构、算法、熟练使用业界常用的算法模型训练工具; 4. 项目经验加分项: ① 有手机端,PC 端视频处理算法研发落地经验,包括Mac,iOS,Andoid 、Windows; ② 有并行优化经验,熟悉Opengl 、Metal, CUDA等并行处理器编程语言; ③ 有计算摄影和成像相关项目经验,如图像增强、人像增强,多目测量、投影几何、匹配和拼接、去模糊、去雾和超分等项目经验; ④ 有增稳、SLAM、VIO开发经验,如了解特征提取、追踪、优化、滤波、回环检测,IMU姿态估计等技术; ⑤ 有大模型(VLM、LLM)训练和应用经验,如了解LLaVa, Qwen-VL,CLIP 等技术; 5 . 在相关领域主流会议或期刊发表过论文者优(CVPR/ICCV/ECCV/NeurIPS/PAMI/ICML/ICLR/ICRA); 6. 具备较强的学习及问题分析能力,敢想敢做,追求极致,做事踏实有恒心,有反思意识,具备良好的团队协作能力,愿意和团队一起进步。
工作职责
加入我们,重塑影像创作的未来! 我们的独特战场——两类产品,双重创造力引擎 作为硬件旗舰的“智慧搭档”,将为无人机/手持设备注入创作全链路的智慧基因:从拍摄后的素材智能归集、AI辅助剪辑,到一键分享的极致流畅体验; 挑战点:重构创作效率——让用户从繁琐操作中解放,专注创意本身。 作为软件驱动的“颠覆者”,将以画质增强算法、视频的拼接算法、AI场景智能识别、影像叙事引擎为核心武器,打造“人无我有”的竞争力壁垒。 突破点:让普通设备输出专业级影像——通过算法突破硬件物理限制,重新定义画质天花板。 1. 负责DJI 产品视频智能成片,视频画质提升,图像匹配对齐相关算法开发和优化工作; 2. 负责参与上述功能在产品侧的落地; 3. 持续跟踪国内外视频理解和处理技术相关进展,并根据业务需要进行创新和落地。
包括英文材料
学历+
图像处理+
https://opencv.org/blog/computer-vision-and-image-processing/
This fascinating journey involves two key fields: Computer Vision and Image Processing.
https://www.geeksforgeeks.org/python/image-processing-in-python/
Image processing involves analyzing and modifying digital images using computer algorithms.
https://www.youtube.com/watch?v=kSqxn6zGE0c
In this Introduction to Image Processing with Python, kaggle grandmaster Rob Mulla shows how to work with image data in python!
模式识别+
https://www.mathworks.com/discovery/pattern-recognition.html
Pattern recognition is the process of classifying input data into objects, classes, or categories using computer algorithms based on key features or regularities.
https://www.microsoft.com/en-us/research/wp-content/uploads/2006/01/Bishop-Pattern-Recognition-and-Machine-Learning-2006.pdf
Pattern recognition has its origins in engineering, whereas machine learning grew out of computer science.
算法+
https://roadmap.sh/datastructures-and-algorithms
Step by step guide to learn Data Structures and Algorithms in 2025
https://www.hellointerview.com/learn/code
A visual guide to the most important patterns and approaches for the coding interview.
https://www.w3schools.com/dsa/
Python+
https://liaoxuefeng.com/books/python/introduction/index.html
中文,免费,零起点,完整示例,基于最新的Python 3版本。
https://www.learnpython.org/
a free interactive Python tutorial for people who want to learn Python, fast.
https://www.youtube.com/watch?v=K5KVEU3aaeQ
Master Python from scratch 🚀 No fluff—just clear, practical coding skills to kickstart your journey!
https://www.youtube.com/watch?v=rfscVS0vtbw
This course will give you a full introduction into all of the core concepts in python.
C+
https://www.freecodecamp.org/chinese/news/the-c-beginners-handbook/
本手册遵循二八定律。你将在 20% 的时间内学习 80% 的 C 编程语言。
https://www.youtube.com/watch?v=87SH2Cn0s9A
https://www.youtube.com/watch?v=KJgsSFOSQv0
This course will give you a full introduction into all of the core concepts in the C programming language.
https://www.youtube.com/watch?v=PaPN51Mm5qQ
In this complete C programming course, Dr. Charles Severance (aka Dr. Chuck) will help you understand computer architecture and low-level programming with the help of the classic C Programming language book written by Brian Kernighan and Dennis Ritchie.
C+++
https://www.learncpp.com/
LearnCpp.com is a free website devoted to teaching you how to program in modern C++.
https://www.youtube.com/watch?v=ZzaPdXTrSb8
数据结构+
https://www.youtube.com/watch?v=8hly31xKli0
In this course you will learn about algorithms and data structures, two of the fundamental topics in computer science.
https://www.youtube.com/watch?v=B31LgI4Y4DQ
Learn about data structures in this comprehensive course. We will be implementing these data structures in C or C++.
https://www.youtube.com/watch?v=CBYHwZcbD-s
Data Structures and Algorithms full course tutorial java
iOS+
https://www.youtube.com/watch?v=UNH0bE4zPtY&list=PLSzsOkUDsvdu5Mm67aBYs2YPu2OM4mFzt
Windows+
[英文] Windows 10 Tutorial
https://www.tutorialspoint.com/windows10/index.htm
This tutorial gives you all the indepth information on this new operating system and its procedures.
CUDA+
https://developer.nvidia.com/blog/even-easier-introduction-cuda/
This post is a super simple introduction to CUDA, the popular parallel computing platform and programming model from NVIDIA.
https://www.youtube.com/watch?v=86FAWCzIe_4
Lean how to program with Nvidia CUDA and leverage GPUs for high-performance computing and deep learning.
SLAM+
https://docs.mrpt.org/reference/latest/tutorial-slam-for-beginners-the-basics.html
[英文] SLAM for Dummies
https://dspace.mit.edu/bitstream/handle/1721.1/119149/16-412j-spring-2005/contents/projects/1aslam_blas_repo.pdf
A Tutorial Approach to Simultaneous Localization and Mapping
https://ouster.com/insights/blog/introduction-to-slam-simultaneous-localization-and-mapping
SLAM is an essential piece in robotics that helps robots to estimate their pose – the position and orientation – on the map while creating the map of the environment to carry out autonomous activities.
[英文] What Is SLAM?
https://www.mathworks.com/discovery/slam.html
How it works, types of SLAM algorithms, and getting started
大模型+
https://www.youtube.com/watch?v=xZDB1naRUlk
You will build projects with LLMs that will enable you to create dynamic interfaces, interact with vast amounts of text data, and even empower LLMs with the capability to browse the internet for research papers.
https://www.youtube.com/watch?v=zjkBMFhNj_g
CVPR+
https://cvpr.thecvf.com/
ICCV+
https://iccv.thecvf.com/
ICCV is the premier international computer vision event comprising the main conference and several co-located workshops and tutorials.
ECCV+
https://eccv.ecva.net/
ECCV is the official event under the European Computer Vision Association and is biannual on even numbered years.
NeurIPS+
https://neurips.cc/
ICML+
https://icml.cc/
ICLR+
https://iclr.cc/
相关职位
校招多媒体算法
1、参与点,直播各场景音频算法,引擎和策略的研发,保障高质量的音频消费体验; 2、参与直播音频引擎的开发,包括音频采集,渲染和混音模块,在多平台完成集成和性能调优; 3、参与音频策略算法的研究,包括但不限于: ①语音降噪(Noise Suppression),回声消除(AEC)等3A算法 ②语音合成与修复等AI算法; ③抗丢包与弱网对抗技术(FEC、PLC)等编解码算法 4、跟踪业界前沿音频技术,参与3D 音效和空间音频等方向的调研与业务落地; 5、参与音频质量评测体系建设,配合进行主观/客观音质测试与问题定位。
更新于 2025-09-10
校招多媒体算法
1、 在音视频技术、人工智能、视频图像处理和生成等领域开展前沿技术研究,保持算法在工业界和学术界的领先; 2、 探索前沿技术在视频图像质量评估、视频图像分析及处理、智能编码、智能抽帧等技术方向上落地。
社招3-5年多媒体算法
1.支持语音识别及音频理解在小红书丰富业务场景的落地,持续优化大模型语音识别效果 2.跟进最领先的音频理解技术体系,包括但不限于提出新的音频理解技术框架、改进现有的算法、持续提升相关技术及业务指标,鼓励撰写论文及申请专利。
更新于 2025-09-09