OPPO高级多模态算法工程师(端侧模型方向)-博士
校招全职AI/算法类地点:北京状态:招聘
任职要求
1. 计算机科学,人工智能,统计学,数据挖掘,数学,物理等相关专业,在预训练模型,生成算法,计算机视觉,自然语言处理,模型压缩及端侧部署,信息检索,多模态学习等中的一个或多个领域中具备实践经验; 2. 扎实的编程基础,至少熟悉一种常见的深度学习框架(Pytorch, TensorFlow等); 3. 熟悉计算机视觉,自然语言处理,机器学习等方面的基础技术,有预训练相关实践经验者或在人工智能顶会/期刊上发表研究论文者优先(例如:CVPR/ICCV/ECCV/NIPS/AAAI/ICML/ICLR/TPAMI/TIP/IJCV等); 4. 良好的数学基础与英文阅读能力,具备良好的团队协作意识与优秀的自驱力。
工作职责
1. 负责多模态大模型压缩算法研发,探索并实现极低bit量化技术和稀疏化技术,保持推理精度的同时减少模型存储和计算资源消耗; 2. 负责多模态大模型编解码加速算法研发,降低端侧模型推理成本; 3. 参与端侧大模型技术体系建设,探索和实现高效大模型架构,开发和优化内部模型端侧化部署工具链。 4. 跟进大模型前沿技术发展趋势,探索相关算法的创新优化,发表高质量研究论文。
包括英文材料
数据挖掘+
https://www.youtube.com/watch?v=-bSkREem8dM
Database vs Data Warehouse vs Data Lake
https://www.youtube.com/watch?v=7rs0i-9nOjo
算法+
https://roadmap.sh/datastructures-and-algorithms
Step by step guide to learn Data Structures and Algorithms in 2025
https://www.hellointerview.com/learn/code
A visual guide to the most important patterns and approaches for the coding interview.
https://www.w3schools.com/dsa/
OpenCV+
https://learnopencv.com/getting-started-with-opencv/
At LearnOpenCV we are on a mission to educate the global workforce in computer vision and AI.
https://opencv.org/university/free-opencv-course/
This free OpenCV course will teach you how to manipulate images and videos, and detect objects and faces, among other exciting topics in just about 3 hours.
NLP+
https://www.youtube.com/watch?v=fNxaJsNG3-s&list=PLQY2H8rRoyvzDbLUZkbudP-MFQZwNmU4S
Welcome to Zero to Hero for Natural Language Processing using TensorFlow!
https://www.youtube.com/watch?v=R-AG4-qZs1A&list=PLeo1K3hjS3uuvuAXhYjV2lMEShq2UYSwX
Natural Language Processing tutorial for beginners series in Python.
https://www.youtube.com/watch?v=rmVRLeJRkl4&list=PLoROMvodv4rMFqRtEuo6SGjY4XbRIVRd4
The foundations of the effective modern methods for deep learning applied to NLP.
信息检索+
https://nlp.stanford.edu/IR-book/information-retrieval-book.html
Christopher D. Manning, Prabhakar Raghavan and Hinrich Schütze, Introduction to Information Retrieval, Cambridge University Press. 2008.
深度学习+
https://d2l.ai/
Interactive deep learning book with code, math, and discussions.
PyTorch+
https://datawhalechina.github.io/thorough-pytorch/
PyTorch是利用深度学习进行数据科学研究的重要工具,在灵活性、可读性和性能上都具备相当的优势,近年来已成为学术界实现深度学习算法最常用的框架。
https://www.youtube.com/watch?v=V_xro1bcAuA
Learn PyTorch for deep learning in this comprehensive course for beginners. PyTorch is a machine learning framework written in Python.
TensorFlow+
https://www.youtube.com/watch?v=tpCFfeUEGs8
Ready to learn the fundamentals of TensorFlow and deep learning with Python? Well, you’ve come to the right place.
https://www.youtube.com/watch?v=ZUKz4125WNI
This part continues right where part one left off so get that Google Colab window open and get ready to write plenty more TensorFlow code.
机器学习+
https://www.youtube.com/watch?v=0oyDqO8PjIg
Learn about machine learning and AI with this comprehensive 11-hour course from @LunarTech_ai.
https://www.youtube.com/watch?v=i_LwzRVP7bg
Learn Machine Learning in a way that is accessible to absolute beginners.
https://www.youtube.com/watch?v=NWONeJKn6kc
Learn the theory and practical application of machine learning concepts in this comprehensive course for beginners.
https://www.youtube.com/watch?v=PcbuKRNtCUc
Learn about all the most important concepts and terms related to machine learning and AI.
CVPR+
https://cvpr.thecvf.com/
ICCV+
https://iccv.thecvf.com/
ICCV is the premier international computer vision event comprising the main conference and several co-located workshops and tutorials.
ECCV+
https://eccv.ecva.net/
ECCV is the official event under the European Computer Vision Association and is biannual on even numbered years.
NeurIPS+
https://neurips.cc/
ICML+
https://icml.cc/
ICLR+
https://iclr.cc/
相关职位
校招AI/算法类
1. 负责多模态大模型的构建,包括数据处理和构造、预训练和后训练算法优化,评测benchmark的构建等 2. 负责多模态大模型的下游算法及应用研究,包括图文问答、视频问答、搜索增强、统一理解和生成模型、多模态创作、多模态总结、多模态检索等; 3. 负责多模态大模型在移动端场景的算法优化和应用落地,包括视觉文本处理、UI界面理解和操作、function call等能力的提升 4. 跟进业界前沿技术,持续提升团队在多模态算法方面的技术积累。
更新于 2025-07-14
校招AI/算法类
负责对话相关的前沿算法研究与应用落地,确保对话效果处于行业前列 产出预期: 1、钻研对话 Agent 多模态算法,主导方案效果验证、迭代优化与前沿创新,聚焦多模态特征对齐、多轮对话决策、工具调用、任务编排等关键领域,探索多模态联合建模策略的最大收益,推动技术突破与行业领先。 2、负责与高校合作对话相关的产学研项目,推进学术前沿研究成果转化落地 3、 产出对话算法创新方案的论文与专利"
更新于 2025-07-14
社招4-6年SOFTWARE
1. 负责多模态大模型的基础能力提升,如长窗口拓展,逻辑推理提升,指令遵循提升、自动化生成prompt等基础能力构建。 2. 负责图文多模态模型的下游算法研究,例如:SFT微调(包括full和lora),强化学习,图搜增强,知识蒸馏等技术方向; 3. 负责图文多模态技术在AI搜索、图片结构化提取、图片内容理解等应用领域云侧和端侧的研究和落地工作; 4. 跟进业界前沿技术,持续提升团队在图文多模态算法方面的技术积累。
更新于 2025-09-16