小鹏汽车大模型算法资深工程师/专家
社招全职地点:北京状态:招聘
任职要求
1. 计算机相关专业硕士及以上学历,熟练掌握自然语言处理、深度学习、强化学习的基础理论和方法。 2. 具有扎实的的编程能力,熟练掌握至少一门编程语言(C/C++/Python/Java),熟悉TensorFlow/Pytorch/Keras等深度学习框架。 3. 熟悉Transformer/GPT系列/LLaMA/GLM等预训练模型,对模型训练和应用有一定理解。 4. 了解DeepSpeed、Megatron等分布式训练框架,有一定多机多卡分布式训练与debug经验。 5. 较强的技术攻关能力,能够跟进领域内最新的技术研究成果,结合实际应用场景快速实验和落地。 6. 有对话、多模态领域比赛或者ACL、EMNLP、AAAI等相关顶会论文者优先。
工作职责
1. 参与团队预训练基座大模型的研发,包括预训练,后训练,指令微调,对齐等方向; 2. 负责以大语言模型为核心的对话感知与交互,根据业务需求优化模型,提升业务效果; 3. 负责跟踪和探索大语言模型的前沿问题,结合实际场景,参与前沿算法和应用的研究和专利、论文撰写。
包括英文材料
学历+
NLP+
https://www.youtube.com/watch?v=fNxaJsNG3-s&list=PLQY2H8rRoyvzDbLUZkbudP-MFQZwNmU4S
Welcome to Zero to Hero for Natural Language Processing using TensorFlow!
https://www.youtube.com/watch?v=R-AG4-qZs1A&list=PLeo1K3hjS3uuvuAXhYjV2lMEShq2UYSwX
Natural Language Processing tutorial for beginners series in Python.
https://www.youtube.com/watch?v=rmVRLeJRkl4&list=PLoROMvodv4rMFqRtEuo6SGjY4XbRIVRd4
The foundations of the effective modern methods for deep learning applied to NLP.
深度学习+
https://d2l.ai/
Interactive deep learning book with code, math, and discussions.
强化学习+
https://cloud.google.com/discover/what-is-reinforcement-learning?hl=en
Reinforcement learning (RL) is a type of machine learning where an "agent" learns optimal behavior through interaction with its environment.
https://huggingface.co/learn/deep-rl-course/unit0/introduction
This course will teach you about Deep Reinforcement Learning from beginner to expert. It’s completely free and open-source!
https://www.kaggle.com/learn/intro-to-game-ai-and-reinforcement-learning
Build your own video game bots, using classic and cutting-edge algorithms.
C+
https://www.freecodecamp.org/chinese/news/the-c-beginners-handbook/
本手册遵循二八定律。你将在 20% 的时间内学习 80% 的 C 编程语言。
https://www.youtube.com/watch?v=87SH2Cn0s9A
https://www.youtube.com/watch?v=KJgsSFOSQv0
This course will give you a full introduction into all of the core concepts in the C programming language.
https://www.youtube.com/watch?v=PaPN51Mm5qQ
In this complete C programming course, Dr. Charles Severance (aka Dr. Chuck) will help you understand computer architecture and low-level programming with the help of the classic C Programming language book written by Brian Kernighan and Dennis Ritchie.
C+++
https://www.learncpp.com/
LearnCpp.com is a free website devoted to teaching you how to program in modern C++.
https://www.youtube.com/watch?v=ZzaPdXTrSb8
Python+
https://liaoxuefeng.com/books/python/introduction/index.html
中文,免费,零起点,完整示例,基于最新的Python 3版本。
https://www.learnpython.org/
a free interactive Python tutorial for people who want to learn Python, fast.
https://www.youtube.com/watch?v=K5KVEU3aaeQ
Master Python from scratch 🚀 No fluff—just clear, practical coding skills to kickstart your journey!
https://www.youtube.com/watch?v=rfscVS0vtbw
This course will give you a full introduction into all of the core concepts in python.
Java+
https://www.youtube.com/watch?v=eIrMbAQSU34
Master Java – a must-have language for software development, Android apps, and more! ☕️ This beginner-friendly course takes you from basics to real coding skills.
TensorFlow+
https://www.youtube.com/watch?v=tpCFfeUEGs8
Ready to learn the fundamentals of TensorFlow and deep learning with Python? Well, you’ve come to the right place.
https://www.youtube.com/watch?v=ZUKz4125WNI
This part continues right where part one left off so get that Google Colab window open and get ready to write plenty more TensorFlow code.
PyTorch+
https://datawhalechina.github.io/thorough-pytorch/
PyTorch是利用深度学习进行数据科学研究的重要工具,在灵活性、可读性和性能上都具备相当的优势,近年来已成为学术界实现深度学习算法最常用的框架。
https://www.youtube.com/watch?v=V_xro1bcAuA
Learn PyTorch for deep learning in this comprehensive course for beginners. PyTorch is a machine learning framework written in Python.
Keras+
https://keras.io/getting_started/intro_to_keras_for_engineers/
Keras 3 is a deep learning framework works with TensorFlow, JAX, and PyTorch interchangeably.
Transformer+
https://huggingface.co/learn/llm-course/en/chapter1/4
Breaking down how Large Language Models work, visualizing how data flows through.
https://poloclub.github.io/transformer-explainer/
An interactive visualization tool showing you how transformer models work in large language models (LLM) like GPT.
https://www.youtube.com/watch?v=wjZofJX0v4M
Breaking down how Large Language Models work, visualizing how data flows through.
GPT+
https://www.youtube.com/watch?v=kCc8FmEb1nY
We build a Generatively Pretrained Transformer (GPT), following the paper "Attention is All You Need" and OpenAI's GPT-2 / GPT-3.
Llama+
https://github.com/LlamaFamily/Llama-Chinese
Llama中文社区,实时汇总最新Llama学习资料,构建最好的中文Llama大模型开源生态,完全开源可商用。
https://www.llama.com/docs/overview/
This guide provides information and resources to help you set up Llama including how to access the model, hosting, how-to and integration guides.
相关职位
社招
1,算法开发与优化: 负责自动驾驶模型算法的研发设计,包括但不限于行为决策、轨迹生成、运动规划等模块的深度学习/强化学习模型设计 探索基于Transformer、模仿学习(Imitation Learning)、强化学习(RL)等前沿技术的模型算法设计、应用方案 优化自动驾驶算法的实时性、安全性和舒适性,解决复杂场景(如拥堵、交互博弈、长尾问题)下的规划挑战 2,数据驱动迭代: 构建和利用大规模驾驶数据集(仿真+真实数据),设计数据闭环 pipeline 提升规划性能 参与数据标注、场景挖掘、仿真测试等环节,推动算法迭代 3,系统集成与部署: 与感知、控制等模块团队协作,实现模型算法在车载计算平台的部署 支持实车测试,分析问题并提出改进方案。 4,前沿技术跟踪: 跟进学术界(如CVPR、ICRA、CoRL)和工业界最新进展,将创新技术落地到量产或研发项目中
更新于 2025-06-30
社招A227584A
1、负责大模型算法在边缘计算场景的落地; 2、参与项目建设中的数据建设、指令微调、偏好对齐、模型优化; 3、跟踪调研大模型以及相关方向(包括但不限于NLP/CV/多模态/具身智能)的前沿技术; 4、深入研究模型在未来生活中的更多使用场景,探索边缘计算与大模型的结合点,拓展模型在边缘计算的应用范围。
更新于 2024-02-23
社招A68130
1、负责大模型算法在边缘计算场景的落地; 2、参与项目建设中的数据建设、指令微调、偏好对齐、模型优化; 3、跟踪调研大模型以及相关方向(包括但不限于NLP/CV/多模态/具身智能)的前沿技术; 4、深入研究模型在未来生活中的更多使用场景,探索边缘计算与大模型的结合点,拓展模型在边缘计算的应用范围。
更新于 2024-02-01