小米视觉多模态算法工程师实习生
实习兼职地点:北京状态:招聘
任职要求
1.硕士及以上学历,计算机、人工智能、机器学习、电子信息、自动化、数学等相关专业,多模态大模型、计算机视觉等相关方向; 2.具备一定的多模态算法或计算机视觉实践经验,对计算机视觉和深度学习算法有深入理解; 3.具备优秀的编程能力,熟练掌握PyTorch等至少一门深度学习框架,熟练掌握Python或C++等至少一门编程语言; 4.对多模态大模型、计算机视觉、深度学习等领域有比较强的兴趣,能每周实习四天以上,实习三个月以上; 5.在多模态大模型、计算机视觉、深度学习等领域发表过高水平论文,或参加过相关领域主流算法竞赛且取得优秀成绩者,优先; 6.在ACM/ICPC、CodeForces、IOI/NOI/NOIP/CSP等编程算法竞赛中获得优秀成绩者,优先;
工作职责
1. 深入调研多模态大模型、计算机视觉、大模型推理以及强化学习等方向的前沿技术,并结合产品对算法进行优化,使相关产品效果达到业界领先水平; 2. 将多模态大模型落地到小米各个产品,结合产品需求,参与算法的设计、开发、验证、集成、优化和维护,解决算法产品化过程中的各种技术问题,确保达到上线要求; 3. 参与相关领域学术研究,产出具有业界行业影响力的科研成果;
包括英文材料
学历+
机器学习+
https://www.youtube.com/watch?v=0oyDqO8PjIg
Learn about machine learning and AI with this comprehensive 11-hour course from @LunarTech_ai.
https://www.youtube.com/watch?v=i_LwzRVP7bg
Learn Machine Learning in a way that is accessible to absolute beginners.
https://www.youtube.com/watch?v=NWONeJKn6kc
Learn the theory and practical application of machine learning concepts in this comprehensive course for beginners.
https://www.youtube.com/watch?v=PcbuKRNtCUc
Learn about all the most important concepts and terms related to machine learning and AI.
大模型+
https://www.youtube.com/watch?v=xZDB1naRUlk
You will build projects with LLMs that will enable you to create dynamic interfaces, interact with vast amounts of text data, and even empower LLMs with the capability to browse the internet for research papers.
https://www.youtube.com/watch?v=zjkBMFhNj_g
OpenCV+
https://learnopencv.com/getting-started-with-opencv/
At LearnOpenCV we are on a mission to educate the global workforce in computer vision and AI.
https://opencv.org/university/free-opencv-course/
This free OpenCV course will teach you how to manipulate images and videos, and detect objects and faces, among other exciting topics in just about 3 hours.
算法+
https://roadmap.sh/datastructures-and-algorithms
Step by step guide to learn Data Structures and Algorithms in 2025
https://www.hellointerview.com/learn/code
A visual guide to the most important patterns and approaches for the coding interview.
https://www.w3schools.com/dsa/
深度学习+
https://d2l.ai/
Interactive deep learning book with code, math, and discussions.
PyTorch+
https://datawhalechina.github.io/thorough-pytorch/
PyTorch是利用深度学习进行数据科学研究的重要工具,在灵活性、可读性和性能上都具备相当的优势,近年来已成为学术界实现深度学习算法最常用的框架。
https://www.youtube.com/watch?v=V_xro1bcAuA
Learn PyTorch for deep learning in this comprehensive course for beginners. PyTorch is a machine learning framework written in Python.
Python+
https://liaoxuefeng.com/books/python/introduction/index.html
中文,免费,零起点,完整示例,基于最新的Python 3版本。
https://www.learnpython.org/
a free interactive Python tutorial for people who want to learn Python, fast.
https://www.youtube.com/watch?v=K5KVEU3aaeQ
Master Python from scratch 🚀 No fluff—just clear, practical coding skills to kickstart your journey!
https://www.youtube.com/watch?v=rfscVS0vtbw
This course will give you a full introduction into all of the core concepts in python.
C+++
https://www.learncpp.com/
LearnCpp.com is a free website devoted to teaching you how to program in modern C++.
https://www.youtube.com/watch?v=ZzaPdXTrSb8
相关职位
实习
1. 深入调研多模态大模型、计算机视觉、大模型推理以及强化学习等方向的前沿技术,并结合产品对算法进行优化,使相关产品效果达到业界领先水平; 2. 将多模态大模型落地到小米各个产品,结合产品需求,参与算法的设计、开发、验证、集成、优化和维护,解决算法产品化过程中的各种技术问题,确保达到上线要求; 3. 参与相关领域学术研究,产出具有业界行业影响力的科研成果;
更新于 2025-08-04
实习
1. 深入调研多模态大模型、计算机视觉、大模型推理以及强化学习等方向的前沿技术,并结合产品对算法进行优化,使相关产品效果达到业界领先水平; 2. 将多模态大模型落地到小米各个产品,结合产品需求,参与算法的设计、开发、验证、集成、优化和维护,解决算法产品化过程中的各种技术问题,确保达到上线要求; 3. 参与相关领域学术研究,产出具有业界行业影响力的科研成果;
更新于 2025-09-10
实习
1、负责视觉大模型,多模态大模型、缺陷检测、目标检测、图像分割、相关算法的研发和实现,以及在工业场景的落地; 2、负责图像识别核心能力沉淀和产品化建设,多方协同,快速落地;利用计算机视觉和人工智能新技术,改进提升产品性能; 3、负责算法的优化和集成工作,包括边缘设备和云端;
更新于 2025-06-12