百度大模型算法实习生(J92492)
实习兼职MEG地点:北京状态:招聘
任职要求
- 具有一个或多个领域的理论背景和实践经验:机器学习/数据挖掘/深度学习/信息检索/自然语言处理 - 熟悉C/C++语言编程,熟练掌握linux平台、shell或python脚本编程。 熟练使用常用算法和数据结构 - 热爱互联网,有强烈的上进心和积极的工作态度,良好的自我驱动能力,较好的团队合作精神,善于学习和运用新知识 - 有算法类竞赛奖或发表过学术论文者优先
工作职责
- 模型能力优化:面向百度搜索业务进行基础模型优化,包括但不限于 MoE 稀疏化策略、预训练任务设计、Post-Training 任务优化,基于强化学习的推理能力增强等工作 - 模型成本优化:通过优化 MoE 路由策略、Latent Attention 结构优化等手段,提升推理效率,降低部署成本 - 算法研究与创新:组内工作发表于 ACL/NeurIPS/ICLR 等会议,积极跟踪学术界和工业界技术进展,设计并实现相关算法创新,推动搜索基础模型的持续进步 - 跨团队协作:与数据科学家、架构工程师等跨职能团队紧密合作,确保算法方案能够有效落地
包括英文材料
机器学习+
https://www.youtube.com/watch?v=0oyDqO8PjIg
Learn about machine learning and AI with this comprehensive 11-hour course from @LunarTech_ai.
https://www.youtube.com/watch?v=i_LwzRVP7bg
Learn Machine Learning in a way that is accessible to absolute beginners.
https://www.youtube.com/watch?v=NWONeJKn6kc
Learn the theory and practical application of machine learning concepts in this comprehensive course for beginners.
https://www.youtube.com/watch?v=PcbuKRNtCUc
Learn about all the most important concepts and terms related to machine learning and AI.
数据挖掘+
https://www.youtube.com/watch?v=-bSkREem8dM
Database vs Data Warehouse vs Data Lake
https://www.youtube.com/watch?v=7rs0i-9nOjo
深度学习+
https://d2l.ai/
Interactive deep learning book with code, math, and discussions.
信息检索+
https://nlp.stanford.edu/IR-book/information-retrieval-book.html
Christopher D. Manning, Prabhakar Raghavan and Hinrich Schütze, Introduction to Information Retrieval, Cambridge University Press. 2008.
NLP+
https://www.youtube.com/watch?v=fNxaJsNG3-s&list=PLQY2H8rRoyvzDbLUZkbudP-MFQZwNmU4S
Welcome to Zero to Hero for Natural Language Processing using TensorFlow!
https://www.youtube.com/watch?v=R-AG4-qZs1A&list=PLeo1K3hjS3uuvuAXhYjV2lMEShq2UYSwX
Natural Language Processing tutorial for beginners series in Python.
https://www.youtube.com/watch?v=rmVRLeJRkl4&list=PLoROMvodv4rMFqRtEuo6SGjY4XbRIVRd4
The foundations of the effective modern methods for deep learning applied to NLP.
C+
https://www.freecodecamp.org/chinese/news/the-c-beginners-handbook/
本手册遵循二八定律。你将在 20% 的时间内学习 80% 的 C 编程语言。
https://www.youtube.com/watch?v=87SH2Cn0s9A
https://www.youtube.com/watch?v=KJgsSFOSQv0
This course will give you a full introduction into all of the core concepts in the C programming language.
https://www.youtube.com/watch?v=PaPN51Mm5qQ
In this complete C programming course, Dr. Charles Severance (aka Dr. Chuck) will help you understand computer architecture and low-level programming with the help of the classic C Programming language book written by Brian Kernighan and Dennis Ritchie.
C+++
https://www.learncpp.com/
LearnCpp.com is a free website devoted to teaching you how to program in modern C++.
https://www.youtube.com/watch?v=ZzaPdXTrSb8
Linux+
https://ryanstutorials.net/linuxtutorial/
Ok, so you want to learn how to use the Bash command line interface (terminal) on Unix/Linux.
https://ubuntu.com/tutorials/command-line-for-beginners
The Linux command line is a text interface to your computer.
https://www.youtube.com/watch?v=6WatcfENsOU
In this Linux crash course, you will learn the fundamental skills and tools you need to become a proficient Linux system administrator.
https://www.youtube.com/watch?v=v392lEyM29A
Never fear the command line again, make it fear you.
https://www.youtube.com/watch?v=ZtqBQ68cfJc
Bash+
[英文] The Bash Guide
https://guide.bash.academy/
A quality-driven guide through the shell's many features.
https://www.youtube.com/watch?v=tK9Oc6AEnR4
Understanding how to use bash scripting will enhance your productivity by automating tasks, streamlining processes, and making your workflow more efficient.
Python+
https://liaoxuefeng.com/books/python/introduction/index.html
中文,免费,零起点,完整示例,基于最新的Python 3版本。
https://www.learnpython.org/
a free interactive Python tutorial for people who want to learn Python, fast.
https://www.youtube.com/watch?v=K5KVEU3aaeQ
Master Python from scratch 🚀 No fluff—just clear, practical coding skills to kickstart your journey!
https://www.youtube.com/watch?v=rfscVS0vtbw
This course will give you a full introduction into all of the core concepts in python.
脚本+
[英文] Scripting language
https://en.wikipedia.org/wiki/Scripting_language
https://zhuanlan.zhihu.com/p/571097954
一个脚本通常是解释执行而非编译。脚本语言通常都有简单、易学、易用的特性,目的就是希望能让程序员快速完成程序的编写工作。
算法+
https://roadmap.sh/datastructures-and-algorithms
Step by step guide to learn Data Structures and Algorithms in 2025
https://www.hellointerview.com/learn/code
A visual guide to the most important patterns and approaches for the coding interview.
https://www.w3schools.com/dsa/
数据结构+
https://www.youtube.com/watch?v=8hly31xKli0
In this course you will learn about algorithms and data structures, two of the fundamental topics in computer science.
https://www.youtube.com/watch?v=B31LgI4Y4DQ
Learn about data structures in this comprehensive course. We will be implementing these data structures in C or C++.
https://www.youtube.com/watch?v=CBYHwZcbD-s
Data Structures and Algorithms full course tutorial java
相关职位
实习网易有道
参与前沿大模型算法的研发与落地应用,方向包括但不限于:智能 Agent、Deep Research、多模态大模型、检索增强生成 (RAG) 等; 紧跟领域最新技术动态,探索创新算法方法,并积极推动科研成果的产出; 参与技术方案讨论、算法设计与实现、模型训练与优化等研发工作,保证项目进度和研发质量; 持续学习和掌握最新的大模型相关技术,并应用于实际产品和项目中,解决实际问题。
更新于 2025-06-18
实习内容理解
工作职责: 1. 真实业界数据的处理分析:定性分析、定量评估数据质量、对数据采集和处理方案不断优化改进; 2. 模型开发:参与Qwen、Llama等开源LLM的训练微调、量化和部署实践,追踪业内前沿,达到领先的性能指标; 3. 结合小红书丰富的工业场景,根据实际业务需求进行技术落地和创新。
更新于 2025-09-23
实习大模型
1、探索下一代AI搜索范式,从底层模型架构和训练方式角度出发,研发AI搜索大模型,在推理速度,幻觉,回答准确性等方向进行突破; 2、探索新一代大语言模型基座架构,以高效推理模式为核心优化目标,探索全新模型结构和scaling law。 3、在工作中能快速成长,积极探索前沿技术,解决好业务中遇到的实际问题,完成数据处理、建模和工程上线,对AI技术始终保持热爱,实习期间可发表论文。
更新于 2025-08-19