小米AI算法工程师实习生
实习兼职地点:上海状态:招聘
任职要求
1. 掌握NLP主流大模型,如GPT3/T5/PaLM/LLaMA/GLM等的原理,并对差异有深入的理解; 2. 熟练掌握主流深度学习框架pytorch/tensorflow,大模型训练框架Megatron-LM/Deepspeed等多机多卡方案,有大模型训练和项目经验优先; 3. 有良好的代码开发能力,有开源项目开发经验优先。 4、熟练掌握Python/Shell编程语言,熟悉Linux、容器环境的搭建和维护,Nginx、ES、K8S的运维经验为加分项; 5、善于沟通,有良好的团队协作精神和责任心; 6、对技术有追求,有研究精神,能独立分析并解决问题。 7、需长期实习,六个月以上
工作职责
工作内容:语言大模型技术的研究和应用 负责大模型方向技术的研究和应用,构建汽车垂域智能系统,工作内容包括以下三个方向: a. 负责追踪大模型学术进展,预研前沿技术问题,打造行业领先技术影响力 b. 负责根据业务场景,抽象技术问题,预研和实现大模型技术,赋能整车业务 技术方向包括但不限于: 1. 模型架构、预训练方法、高效微调、模型评估及迭代 2. 知识增强、工具增强、AI Agent、RAG 3. 安全、时效性等问题解决 4. 训练和推理加速、模型量化、端测落地 5. 大模型核心能力提升,包括理解能力、知识能力、推理能力、生成能力和语言能力等 6、制定相关规范并形成项目管理文档。
包括英文材料
NLP+
https://www.youtube.com/watch?v=fNxaJsNG3-s&list=PLQY2H8rRoyvzDbLUZkbudP-MFQZwNmU4S
Welcome to Zero to Hero for Natural Language Processing using TensorFlow!
https://www.youtube.com/watch?v=R-AG4-qZs1A&list=PLeo1K3hjS3uuvuAXhYjV2lMEShq2UYSwX
Natural Language Processing tutorial for beginners series in Python.
https://www.youtube.com/watch?v=rmVRLeJRkl4&list=PLoROMvodv4rMFqRtEuo6SGjY4XbRIVRd4
The foundations of the effective modern methods for deep learning applied to NLP.
大模型+
https://www.youtube.com/watch?v=xZDB1naRUlk
You will build projects with LLMs that will enable you to create dynamic interfaces, interact with vast amounts of text data, and even empower LLMs with the capability to browse the internet for research papers.
https://www.youtube.com/watch?v=zjkBMFhNj_g
Llama+
https://github.com/LlamaFamily/Llama-Chinese
Llama中文社区,实时汇总最新Llama学习资料,构建最好的中文Llama大模型开源生态,完全开源可商用。
https://www.llama.com/docs/overview/
This guide provides information and resources to help you set up Llama including how to access the model, hosting, how-to and integration guides.
深度学习+
https://d2l.ai/
Interactive deep learning book with code, math, and discussions.
PyTorch+
https://datawhalechina.github.io/thorough-pytorch/
PyTorch是利用深度学习进行数据科学研究的重要工具,在灵活性、可读性和性能上都具备相当的优势,近年来已成为学术界实现深度学习算法最常用的框架。
https://www.youtube.com/watch?v=V_xro1bcAuA
Learn PyTorch for deep learning in this comprehensive course for beginners. PyTorch is a machine learning framework written in Python.
TensorFlow+
https://www.youtube.com/watch?v=tpCFfeUEGs8
Ready to learn the fundamentals of TensorFlow and deep learning with Python? Well, you’ve come to the right place.
https://www.youtube.com/watch?v=ZUKz4125WNI
This part continues right where part one left off so get that Google Colab window open and get ready to write plenty more TensorFlow code.
Megatron+
https://www.youtube.com/watch?v=hc0u4avAkuM
DeepSpeed+
https://www.youtube.com/watch?v=pDGI668pNg0
Python+
https://liaoxuefeng.com/books/python/introduction/index.html
中文,免费,零起点,完整示例,基于最新的Python 3版本。
https://www.learnpython.org/
a free interactive Python tutorial for people who want to learn Python, fast.
https://www.youtube.com/watch?v=K5KVEU3aaeQ
Master Python from scratch 🚀 No fluff—just clear, practical coding skills to kickstart your journey!
https://www.youtube.com/watch?v=rfscVS0vtbw
This course will give you a full introduction into all of the core concepts in python.
Bash+
[英文] The Bash Guide
https://guide.bash.academy/
A quality-driven guide through the shell's many features.
https://www.youtube.com/watch?v=tK9Oc6AEnR4
Understanding how to use bash scripting will enhance your productivity by automating tasks, streamlining processes, and making your workflow more efficient.
Linux+
https://ryanstutorials.net/linuxtutorial/
Ok, so you want to learn how to use the Bash command line interface (terminal) on Unix/Linux.
https://ubuntu.com/tutorials/command-line-for-beginners
The Linux command line is a text interface to your computer.
https://www.youtube.com/watch?v=6WatcfENsOU
In this Linux crash course, you will learn the fundamental skills and tools you need to become a proficient Linux system administrator.
https://www.youtube.com/watch?v=v392lEyM29A
Never fear the command line again, make it fear you.
https://www.youtube.com/watch?v=ZtqBQ68cfJc
Nginx+
[英文] Beginner’s Guide
https://nginx.org/en/docs/beginners_guide.html
This guide gives a basic introduction to nginx and describes some simple tasks that can be done with it.
https://www.youtube.com/watch?v=9t9Mp0BGnyI
NGINX is open-source web server software used for reverse proxy, load balancing, and caching. It's important to understand, especially if you are a backend developer.
ElasticSearch+
https://www.youtube.com/watch?v=a4HBKEda_F8
Learn about Elasticsearch with this comprehensive course designed for beginners, featuring both theoretical concepts and hands-on applications using Python (though applicable to any programming language). The course is structured in two parts: first covering essential Elasticsearch fundamentals including index management, document storage, text analysis, pipeline creation, search functionality, and advanced features like semantic search and embeddings; followed by a practical section where you'll build a real-world website using Elasticsearch as a search engine, working with the Astronomy Picture of the Day (APOD) dataset to implement features such as data cleaning pipelines, tokenization, pagination, and aggregations.
Kubernetes+
https://kubernetes.io/docs/tutorials/kubernetes-basics/
This tutorial provides a walkthrough of the basics of the Kubernetes cluster orchestration system.
https://kubernetes.io/zh-cn/docs/tutorials/kubernetes-basics/
本教程介绍 Kubernetes 集群编排系统的基础知识。每个模块包含关于 Kubernetes 主要特性和概念的一些背景信息,还包括一个在线教程供你学习。
https://www.youtube.com/watch?v=s_o8dwzRlu4
Hands-On Kubernetes Tutorial | Learn Kubernetes in 1 Hour - Kubernetes Course for Beginners
https://www.youtube.com/watch?v=X48VuDVv0do
Full Kubernetes Tutorial | Kubernetes Course | Hands-on course with a lot of demos
相关职位
实习信息技术类
岗位职责: 1.在芯片设计、验证、制造、测试环节中定义 AI 算法能解决的业务边界,为业务赋能 ; 2.负责CV/多模态/AIGC相关模型优化和迭代,包含图像理解、生成等前沿技术; 3.评估先进的AI模型,包括模型的训练,技术指标的评测和算法的调优工作; 4.跟踪算法发展的新趋势和新技术,并能不断应用于算法开发过程中,优化算法效果。
更新于 2025-10-17
实习
1. 汽车业务:协助算法工程师完成汽车能耗、运动控制相关的算法研究与开发工作,将行业内相关的前沿技术和最新研究成果应用到实际业务中 2. 大家电业务:协助算法工程师完成空调、冰箱、洗衣机相关的AI算法研究和开发工作,将行业内前沿智能控制算法和大模型技术应用到实际业务中 3. 参与能耗预测、温度预测等时序预测问题的解决,合理构建特征,选择模型,实现精准预测 4. 参与智能控制算法、大模型的研究与验证,协助优化算法模型 5. 协助算法工程师完成上述相关算法的端侧部署、测试,支持量产交付
更新于 2025-07-01
实习
1. 负责执行扫地机图像、声学等数据的采集、标注、质量验证等工作; 2. 负责执行针对传感器数据(Camera/LiDAR/Radar)、融合数据、算法输出等维度的测试用例,并编写自动化验证工具,对研发的相关成果进行快速、全面验证; 3. 负责执行研发自测工作
更新于 2025-10-14