小米DMS算法工程师
社招全职A154925地点:北京状态:招聘
包括英文材料
图像处理+
https://opencv.org/blog/computer-vision-and-image-processing/
This fascinating journey involves two key fields: Computer Vision and Image Processing.
https://www.geeksforgeeks.org/python/image-processing-in-python/
Image processing involves analyzing and modifying digital images using computer algorithms.
https://www.youtube.com/watch?v=kSqxn6zGE0c
In this Introduction to Image Processing with Python, kaggle grandmaster Rob Mulla shows how to work with image data in python!
OpenCV+
https://learnopencv.com/getting-started-with-opencv/
At LearnOpenCV we are on a mission to educate the global workforce in computer vision and AI.
https://opencv.org/university/free-opencv-course/
This free OpenCV course will teach you how to manipulate images and videos, and detect objects and faces, among other exciting topics in just about 3 hours.
深度学习+
https://d2l.ai/
Interactive deep learning book with code, math, and discussions.
算法+
https://roadmap.sh/datastructures-and-algorithms
Step by step guide to learn Data Structures and Algorithms in 2025
https://www.hellointerview.com/learn/code
A visual guide to the most important patterns and approaches for the coding interview.
https://www.w3schools.com/dsa/
Transformer+
https://huggingface.co/learn/llm-course/en/chapter1/4
Breaking down how Large Language Models work, visualizing how data flows through.
https://poloclub.github.io/transformer-explainer/
An interactive visualization tool showing you how transformer models work in large language models (LLM) like GPT.
https://www.youtube.com/watch?v=wjZofJX0v4M
Breaking down how Large Language Models work, visualizing how data flows through.
相关职位
社招2年以上技术
1.负责自动驾驶乘客行为监测算法开发与落地,如异常触碰、物品遗失、违规乘车等行为,工作包括不局限于:图像分类、多任务学习、多模态大模型等 2.负责DMS司机行为监测算法开发落地,包括司机疲劳分心驾驶监测,工作包括不局限于:图像分类、多任务学习、人脸属性识别、人脸跟踪等。 3.研究与分享大模型前沿技术,落地视觉多模态理解和图像生成大模型
更新于 2025-08-11

社招3年以上
- 与业务、产品团队沟通,共同确定DMS/OMS系统功能研发计划,保障产品功能相对竞品的领先性; - 带领团队,优化研发流程体系,实现DMS/OMS功能开发交付,保障开发进度与交付质量; - 跟进前沿技术进展,组织技术攻关,保障产品量产交付中的问题高效解决;
更新于 2022-04-22

社招3年以上软件序列
1. 负责行车辅助、自动泊车、视觉语言模型(VLM)、驾驶员监测系统(DMS)、端到端模型等数据的接入与后处理工作,为智能驾驶人机交互体验的持续优化提供高质量数据支持 ; 2. 制定人机交互功能所需的数据解决方案,并对感知模型提出明确的数据需求,参与模型输出结果的验收评估 ; 3. 研究端到端学习、视觉语言行动(VLA)、舱驾一体化等新兴技术在人机交互场景中的应用 ; 4. 维护人机交互引擎的稳定性与高性能表现,确保全链路帧率流畅及数据协议的兼容性与合理性;
更新于 2025-08-19