小米大模型算法工程师实习生
实习兼职地点:北京状态:招聘
任职要求
1.本科及以上学历,研究生优先,计算机、模式识别、人工智能相关专业优先; 2.在自然语言处理、知识图谱、智能问答方面有实践经验优先; 3.至少熟悉一门计算机编程语言,包括并不限于C/C++/Java/Python,熟悉常用的深度学习框架(如 PyTorch、TensorFlow 等),熟悉大模型pretrain、sft、RHLF等技术; 4.做事严谨踏实,责任心强,具有良好的沟通能力和团队意识; 5.在ACL、EMNLP、ICML、NIPS等国际会议发表高质量论文者优先;参加过国际知名竞赛并取得较好成绩者优先;在GitHub上发布、贡献流行开源项目者优先
工作职责
模型训练与优化 1. 负责问答对话系统的算法设计与优化。 2. 从事自然语言处理前沿算法技术的研究,探索落地应用,包括大模型预训练、后训练等。 3. 优化小爱同学的问答对话体验,针对实际问题设计解决方案,优化产品效果。 算法研究与创新 跟踪大模型领域的前沿研究成果,研发的技术应用到小爱同学中,并将相关工作撰写成学术论文在学术会议/期刊发表。
包括英文材料
学历+
模式识别+
https://www.mathworks.com/discovery/pattern-recognition.html
Pattern recognition is the process of classifying input data into objects, classes, or categories using computer algorithms based on key features or regularities.
https://www.microsoft.com/en-us/research/wp-content/uploads/2006/01/Bishop-Pattern-Recognition-and-Machine-Learning-2006.pdf
Pattern recognition has its origins in engineering, whereas machine learning grew out of computer science.
NLP+
https://www.youtube.com/watch?v=fNxaJsNG3-s&list=PLQY2H8rRoyvzDbLUZkbudP-MFQZwNmU4S
Welcome to Zero to Hero for Natural Language Processing using TensorFlow!
https://www.youtube.com/watch?v=R-AG4-qZs1A&list=PLeo1K3hjS3uuvuAXhYjV2lMEShq2UYSwX
Natural Language Processing tutorial for beginners series in Python.
https://www.youtube.com/watch?v=rmVRLeJRkl4&list=PLoROMvodv4rMFqRtEuo6SGjY4XbRIVRd4
The foundations of the effective modern methods for deep learning applied to NLP.
C+
https://www.freecodecamp.org/chinese/news/the-c-beginners-handbook/
本手册遵循二八定律。你将在 20% 的时间内学习 80% 的 C 编程语言。
https://www.youtube.com/watch?v=87SH2Cn0s9A
https://www.youtube.com/watch?v=KJgsSFOSQv0
This course will give you a full introduction into all of the core concepts in the C programming language.
https://www.youtube.com/watch?v=PaPN51Mm5qQ
In this complete C programming course, Dr. Charles Severance (aka Dr. Chuck) will help you understand computer architecture and low-level programming with the help of the classic C Programming language book written by Brian Kernighan and Dennis Ritchie.
C+++
https://www.learncpp.com/
LearnCpp.com is a free website devoted to teaching you how to program in modern C++.
https://www.youtube.com/watch?v=ZzaPdXTrSb8
Java+
https://www.youtube.com/watch?v=eIrMbAQSU34
Master Java – a must-have language for software development, Android apps, and more! ☕️ This beginner-friendly course takes you from basics to real coding skills.
Python+
https://liaoxuefeng.com/books/python/introduction/index.html
中文,免费,零起点,完整示例,基于最新的Python 3版本。
https://www.learnpython.org/
a free interactive Python tutorial for people who want to learn Python, fast.
https://www.youtube.com/watch?v=K5KVEU3aaeQ
Master Python from scratch 🚀 No fluff—just clear, practical coding skills to kickstart your journey!
https://www.youtube.com/watch?v=rfscVS0vtbw
This course will give you a full introduction into all of the core concepts in python.
深度学习+
https://d2l.ai/
Interactive deep learning book with code, math, and discussions.
PyTorch+
https://datawhalechina.github.io/thorough-pytorch/
PyTorch是利用深度学习进行数据科学研究的重要工具,在灵活性、可读性和性能上都具备相当的优势,近年来已成为学术界实现深度学习算法最常用的框架。
https://www.youtube.com/watch?v=V_xro1bcAuA
Learn PyTorch for deep learning in this comprehensive course for beginners. PyTorch is a machine learning framework written in Python.
TensorFlow+
https://www.youtube.com/watch?v=tpCFfeUEGs8
Ready to learn the fundamentals of TensorFlow and deep learning with Python? Well, you’ve come to the right place.
https://www.youtube.com/watch?v=ZUKz4125WNI
This part continues right where part one left off so get that Google Colab window open and get ready to write plenty more TensorFlow code.
大模型+
https://www.youtube.com/watch?v=xZDB1naRUlk
You will build projects with LLMs that will enable you to create dynamic interfaces, interact with vast amounts of text data, and even empower LLMs with the capability to browse the internet for research papers.
https://www.youtube.com/watch?v=zjkBMFhNj_g
SFT+
https://cameronrwolfe.substack.com/p/understanding-and-using-supervised
Understanding how SFT works from the idea to a working implementation...
ICML+
https://icml.cc/
GitHub+
[英文] GitHub Learn
https://learn.github.com/
Discover a wide range of beginner-friendly tutorials, hands-on learning, and expert-led lessons.
相关职位
实习
负责面向小米汽车及生产环节的大模型及各类机器学习模型的算法开发与优化; 推动相关算法在实际业务中的落地应用; 保障模型的高效性能和可扩展性; 参与团队的算法创新与技术攻关。
更新于 2025-05-20
实习
1. 参与NLP团队的研发工作,支持小爱对话系统的日常运营和性能优化; 2. 负责实现和优化NLP算法,提升产品在自然语言处理领域的性能及准确率; 3. 设计和开发相应的自然语言处理、文本挖掘、大语言模型等任务; 4. 跟进最新的学术进展,及时掌握NLP的前沿技术。
更新于 2025-06-17
实习
1. 参与NLP团队的研发工作,支持小爱对话系统的日常运营和性能优化; 2. 负责实现和优化NLP算法,提升产品在自然语言处理领域的性能及准确率; 3. 设计和开发相应的自然语言处理、文本挖掘、大语言模型等任务; 4. 跟进最新的学术进展,及时掌握NLP的前沿技术。
更新于 2025-07-23