小米多模态算法工程师实习生
实习兼职地点:北京状态:招聘
任职要求
1、计算机相关专业,计算机视觉/机器学习/人工智能相关专业优先; 2、具有良好的计算机视觉、机器学习理论基础,熟悉深度学习网络,在计算机视觉某个领域有较深入的研究,包括但不限于图像分割、目标检测、跟踪、视觉大模型、多模态大模型等技术方向,有工业视觉场景相关项目经历优先(例如缺陷检测等); 3、实践动手能力强,有网络设计和优化能力,良好的英文阅读能力,能直接阅读顶会/顶刊文献并实现其中的算法; 4、研究和探索最新的图像算法和技术,不断优化视觉检测算法,提高检测的准确性和效率,完善内部算法平台;
工作职责
1、负责视觉大模型,多模态大模型、缺陷检测、目标检测、图像分割、相关算法的研发和实现,以及在工业场景的落地; 2、负责图像识别核心能力沉淀和产品化建设,多方协同,快速落地;利用计算机视觉和人工智能新技术,改进提升产品性能; 3、负责算法的优化和集成工作,包括边缘设备和云端;
包括英文材料
OpenCV+
https://learnopencv.com/getting-started-with-opencv/
At LearnOpenCV we are on a mission to educate the global workforce in computer vision and AI.
https://opencv.org/university/free-opencv-course/
This free OpenCV course will teach you how to manipulate images and videos, and detect objects and faces, among other exciting topics in just about 3 hours.
机器学习+
https://www.youtube.com/watch?v=0oyDqO8PjIg
Learn about machine learning and AI with this comprehensive 11-hour course from @LunarTech_ai.
https://www.youtube.com/watch?v=i_LwzRVP7bg
Learn Machine Learning in a way that is accessible to absolute beginners.
https://www.youtube.com/watch?v=NWONeJKn6kc
Learn the theory and practical application of machine learning concepts in this comprehensive course for beginners.
https://www.youtube.com/watch?v=PcbuKRNtCUc
Learn about all the most important concepts and terms related to machine learning and AI.
深度学习+
https://d2l.ai/
Interactive deep learning book with code, math, and discussions.
大模型+
https://www.youtube.com/watch?v=xZDB1naRUlk
You will build projects with LLMs that will enable you to create dynamic interfaces, interact with vast amounts of text data, and even empower LLMs with the capability to browse the internet for research papers.
https://www.youtube.com/watch?v=zjkBMFhNj_g
算法+
https://roadmap.sh/datastructures-and-algorithms
Step by step guide to learn Data Structures and Algorithms in 2025
https://www.hellointerview.com/learn/code
A visual guide to the most important patterns and approaches for the coding interview.
https://www.w3schools.com/dsa/
相关职位