小米多模态视觉感知算法工程师实习生
实习兼职地点:北京状态:招聘
任职要求
1、熟练掌握深度学习基础知识,对视觉感知算法/多模态大模型等方向有相关研究背景; 2、较好的python代码能力,能够熟练使用tensorflow/pytorch中的一种或多种深度学习框架; 3、较好的动手能力,能够快速搭建并评测前沿算法模型; 4、有视觉大模型等相关领域经验或者在CV领域全球顶会发表过相关论文者优先; 5、有较好的Python/C++编码能力和良好的编码习惯。
工作职责
1、调研多模态大模型等领域的前沿算法,并进行评测,给出研究报告和知识体系建设; 2、辅助完成数据采集/数据(自动)标注/模型训练评测等相关工作和流程搭建; 3、完成多模态大模型相关领域的论文,并在计算机视觉类的会议投递发表。
包括英文材料
深度学习+
https://d2l.ai/
Interactive deep learning book with code, math, and discussions.
算法+
https://roadmap.sh/datastructures-and-algorithms
Step by step guide to learn Data Structures and Algorithms in 2025
https://www.hellointerview.com/learn/code
A visual guide to the most important patterns and approaches for the coding interview.
https://www.w3schools.com/dsa/
大模型+
https://www.youtube.com/watch?v=xZDB1naRUlk
You will build projects with LLMs that will enable you to create dynamic interfaces, interact with vast amounts of text data, and even empower LLMs with the capability to browse the internet for research papers.
https://www.youtube.com/watch?v=zjkBMFhNj_g
Python+
https://liaoxuefeng.com/books/python/introduction/index.html
中文,免费,零起点,完整示例,基于最新的Python 3版本。
https://www.learnpython.org/
a free interactive Python tutorial for people who want to learn Python, fast.
https://www.youtube.com/watch?v=K5KVEU3aaeQ
Master Python from scratch 🚀 No fluff—just clear, practical coding skills to kickstart your journey!
https://www.youtube.com/watch?v=rfscVS0vtbw
This course will give you a full introduction into all of the core concepts in python.
TensorFlow+
https://www.youtube.com/watch?v=tpCFfeUEGs8
Ready to learn the fundamentals of TensorFlow and deep learning with Python? Well, you’ve come to the right place.
https://www.youtube.com/watch?v=ZUKz4125WNI
This part continues right where part one left off so get that Google Colab window open and get ready to write plenty more TensorFlow code.
PyTorch+
https://datawhalechina.github.io/thorough-pytorch/
PyTorch是利用深度学习进行数据科学研究的重要工具,在灵活性、可读性和性能上都具备相当的优势,近年来已成为学术界实现深度学习算法最常用的框架。
https://www.youtube.com/watch?v=V_xro1bcAuA
Learn PyTorch for deep learning in this comprehensive course for beginners. PyTorch is a machine learning framework written in Python.
C+++
https://www.learncpp.com/
LearnCpp.com is a free website devoted to teaching you how to program in modern C++.
https://www.youtube.com/watch?v=ZzaPdXTrSb8
编程规范+
[英文] Google Style Guides
https://google.github.io/styleguide/
Every major open-source project has its own style guide: a set of conventions (sometimes arbitrary) about how to write code for that project. It is much easier to understand a large codebase when all the code in it is in a consistent style.
相关职位
实习
1、调研多模态大模型等领域的前沿算法,并进行评测,给出研究报告和知识体系建设; 2、辅助完成数据采集/数据(自动)标注/模型训练评测等相关工作和流程搭建; 3、完成多模态大模型相关领域的论文,并在计算机视觉类的会议投递发表。
更新于 2025-06-27
实习
1、调研多模态大模型等领域的前沿算法,并进行评测,给出研究报告和知识体系建设; 2、辅助完成数据采集/数据(自动)标注/模型训练评测等相关工作和流程搭建; 3、完成多模态大模型相关领域的论文,并在计算机视觉类的会议投递发表。
更新于 2025-07-29
实习
1、调研多模态大模型等领域的前沿算法,并进行评测,给出研究报告和知识体系建设; 2、辅助完成数据采集/数据(自动)标注/模型训练评测等相关工作和流程搭建; 3、完成多模态大模型相关领域的论文,并在计算机视觉类的会议投递发表。
更新于 2025-09-29