
商汤大模型算法实习生
社招全职算法工程地点:深圳状态:招聘
任职要求
1.本科以上学历; 2.掌握常见的深度学习框架,掌握pytorch,python编程基础,熟悉并行计算加速,了解神经网络模型的工程部署; 3.熟悉各类语言大模型,了解大模型的训练、sft、RLHF的技巧,熟悉训练数据情况,了解知识库和rag,掌握agent训练、agent搭建,熟悉langchain的各种服务调用; 4.了解多模态大模型的工作,对语言和视觉模态对齐有相关经验; 5.熟悉深度学习数据处理流程和训练流程,有模型训练经验; 6.具有良好的沟通能力和团队合作精神,对计算机视觉、大语言模型有浓厚兴趣。
工作职责
1.预训练、微调语言大模型,follow前沿的相关算法,开展高水平和创新性的研究,保持算法在工业界和学术界的领先,参与顶会论文投稿及专利申请; 2.进行业务落地的语言大模型算法研究,特别是在垂直领域的应用; 3.负责开发语言大模型、agent等算法所需要的工具以及基础设施,实现算法部署与工程化、文档输出; 4.负责知识库框架搭建,RAG服务的维护; 5.负责后续算法性能优化等技术细节。
包括英文材料
学历+
深度学习+
https://d2l.ai/
Interactive deep learning book with code, math, and discussions.
PyTorch+
https://datawhalechina.github.io/thorough-pytorch/
PyTorch是利用深度学习进行数据科学研究的重要工具,在灵活性、可读性和性能上都具备相当的优势,近年来已成为学术界实现深度学习算法最常用的框架。
https://www.youtube.com/watch?v=V_xro1bcAuA
Learn PyTorch for deep learning in this comprehensive course for beginners. PyTorch is a machine learning framework written in Python.
Python+
https://liaoxuefeng.com/books/python/introduction/index.html
中文,免费,零起点,完整示例,基于最新的Python 3版本。
https://www.learnpython.org/
a free interactive Python tutorial for people who want to learn Python, fast.
https://www.youtube.com/watch?v=K5KVEU3aaeQ
Master Python from scratch 🚀 No fluff—just clear, practical coding skills to kickstart your journey!
https://www.youtube.com/watch?v=rfscVS0vtbw
This course will give you a full introduction into all of the core concepts in python.
大模型+
https://www.youtube.com/watch?v=xZDB1naRUlk
You will build projects with LLMs that will enable you to create dynamic interfaces, interact with vast amounts of text data, and even empower LLMs with the capability to browse the internet for research papers.
https://www.youtube.com/watch?v=zjkBMFhNj_g
SFT+
https://cameronrwolfe.substack.com/p/understanding-and-using-supervised
Understanding how SFT works from the idea to a working implementation...
RAG+
https://www.youtube.com/watch?v=sVcwVQRHIc8
Learn how to implement RAG (Retrieval Augmented Generation) from scratch, straight from a LangChain software engineer.
AI agent+
https://www.ibm.com/think/ai-agents
Your one-stop resource for gaining in-depth knowledge and hands-on applications of AI agents.
LangChain+
https://python.langchain.com/docs/tutorials/
New to LangChain or LLM app development in general? Read this material to quickly get up and running building your first applications.
https://www.freecodecamp.org/news/beginners-guide-to-langchain/
LangChain is a popular framework for creating LLM-powered apps.
OpenCV+
https://learnopencv.com/getting-started-with-opencv/
At LearnOpenCV we are on a mission to educate the global workforce in computer vision and AI.
https://opencv.org/university/free-opencv-course/
This free OpenCV course will teach you how to manipulate images and videos, and detect objects and faces, among other exciting topics in just about 3 hours.
相关职位
实习网易有道
参与前沿大模型算法的研发与落地应用,方向包括但不限于:智能 Agent、Deep Research、多模态大模型、检索增强生成 (RAG) 等; 紧跟领域最新技术动态,探索创新算法方法,并积极推动科研成果的产出; 参与技术方案讨论、算法设计与实现、模型训练与优化等研发工作,保证项目进度和研发质量; 持续学习和掌握最新的大模型相关技术,并应用于实际产品和项目中,解决实际问题。
更新于 2025-06-18
实习内容理解
工作职责: 1. 真实业界数据的处理分析:定性分析、定量评估数据质量、对数据采集和处理方案不断优化改进; 2. 模型开发:参与Qwen、Llama等开源LLM的训练微调、量化和部署实践,追踪业内前沿,达到领先的性能指标; 3. 结合小红书丰富的工业场景,根据实际业务需求进行技术落地和创新。
更新于 2025-09-23
实习大模型
1、探索下一代AI搜索范式,从底层模型架构和训练方式角度出发,研发AI搜索大模型,在推理速度,幻觉,回答准确性等方向进行突破; 2、探索新一代大语言模型基座架构,以高效推理模式为核心优化目标,探索全新模型结构和scaling law。 3、在工作中能快速成长,积极探索前沿技术,解决好业务中遇到的实际问题,完成数据处理、建模和工程上线,对AI技术始终保持热爱,实习期间可发表论文。
更新于 2025-08-19