美团机器学习引擎工程师
社招全职核心本地商业-业务研发平台地点:北京状态:招聘
任职要求
1. 计算机、自动化、电子信息或相关专业学士及以上学历; 2. 具备扎实的架构设计和代码开发能力; 3. 熟悉 C++/Java/Python 编程,熟练掌握多线程编程、性能调优、RPC 框架等技术; 4. 对计算机技术有较高的热情,能够积极主动的学习,有深度钻研和实践精神,对交付的code 高标准、严要求,做事严谨细心; 5. 较好的主动性和求知欲,良好的沟通协作能力 具备以下条件优先 1. 有深度学习框架开发经验的优先; 2. 有 Deep Models 在大规模线上生产环境落地经验者优先; 3. 对 Tensorflow/Pytorch/DeepSpeed/Megatron/vLLM/TVM/TensorRT 源码熟悉者优先; 4. 熟悉 GPU 编程者(如 CUDA C)优先
工作职责
1. 负责到家搜索推荐机器学习引擎的工程架构工作,包括 CTR/LLM 模型训练/推理优化、 用户特征平台建设等; 2. 建设面向多场景、高性能、可拓展的机器学习引擎,支撑外卖/闪购/医药/营销等场景的搜索推荐业务需求; 3. 持续优化工程架构,提升系统性能表现、算力规模以及迭代效率; 4. 调研业界前沿技术发展动态,结合业务实际情况,实现在业务的落地
包括英文材料
学历+
系统设计+
https://roadmap.sh/system-design
Everything you need to know about designing large scale systems.
https://www.youtube.com/watch?v=F2FmTdLtb_4
This complete system design tutorial covers scalability, reliability, data handling, and high-level architecture with clear explanations, real-world examples, and practical strategies.
C+++
https://www.learncpp.com/
LearnCpp.com is a free website devoted to teaching you how to program in modern C++.
https://www.youtube.com/watch?v=ZzaPdXTrSb8
Java+
https://www.youtube.com/watch?v=eIrMbAQSU34
Master Java – a must-have language for software development, Android apps, and more! ☕️ This beginner-friendly course takes you from basics to real coding skills.
Python+
https://liaoxuefeng.com/books/python/introduction/index.html
中文,免费,零起点,完整示例,基于最新的Python 3版本。
https://www.learnpython.org/
a free interactive Python tutorial for people who want to learn Python, fast.
https://www.youtube.com/watch?v=K5KVEU3aaeQ
Master Python from scratch 🚀 No fluff—just clear, practical coding skills to kickstart your journey!
https://www.youtube.com/watch?v=rfscVS0vtbw
This course will give you a full introduction into all of the core concepts in python.
多线程+
https://liaoxuefeng.com/books/java/threading/basic/index.html
和单线程相比,多线程编程的特点在于:多线程经常需要读写共享数据,并且需要同步。
https://www.youtube.com/watch?v=_uQgGS_VIXM&list=PLsc-VaxfZl4do3Etp_xQ0aQBoC-x5BIgJ
https://www.youtube.com/watch?v=IEEhzQoKtQU
https://www.youtube.com/watch?v=mTGdtC9f4EU&list=PLL8woMHwr36EDxjUoCzboZjedsnhLP1j4
https://www.youtube.com/watch?v=TPVH_coGAQs&list=PLk6CEY9XxSIAeK-EAh3hB4fgNvYkYmghp
https://www.youtube.com/watch?v=xPqnoB2hjjA
This video is an introduction to multithreading in modern C++.
https://www.youtube.com/watch?v=YKBwKy5PrpQ
Rust threading is easy to implement and improves the efficiency of your applications on multi-core systems!
性能调优+
https://goperf.dev/
The Go App Optimization Guide is a series of in-depth, technical articles for developers who want to get more performance out of their Go code without relying on guesswork or cargo cult patterns.
https://web.dev/learn/performance
This course is designed for those new to web performance, a vital aspect of the user experience.
https://www.ibm.com/think/insights/application-performance-optimization
Application performance is not just a simple concern for most organizations; it’s a critical factor in their business’s success.
https://www.oreilly.com/library/view/optimizing-java/9781492039259/
Performance tuning is an experimental science, but that doesn’t mean engineers should resort to guesswork and folklore to get the job done.
RPC+
https://javaguide.cn/distributed-system/rpc/rpc-intro.html
为什么要 RPC ? 因为,两个不同的服务器上的服务提供的方法不在一个内存空间,所以,需要通过网络编程才能传递方法调用所需要的参数。并且,方法调用的结果也需要通过网络编程来接收。
https://www.youtube.com/watch?v=S2osKiqQG9s
This video is part of an 8-lecture series on distributed systems, given as part of the undergraduate computer science course at the University of Cambridge.
深度学习+
https://d2l.ai/
Interactive deep learning book with code, math, and discussions.
TensorFlow+
https://www.youtube.com/watch?v=tpCFfeUEGs8
Ready to learn the fundamentals of TensorFlow and deep learning with Python? Well, you’ve come to the right place.
https://www.youtube.com/watch?v=ZUKz4125WNI
This part continues right where part one left off so get that Google Colab window open and get ready to write plenty more TensorFlow code.
PyTorch+
https://datawhalechina.github.io/thorough-pytorch/
PyTorch是利用深度学习进行数据科学研究的重要工具,在灵活性、可读性和性能上都具备相当的优势,近年来已成为学术界实现深度学习算法最常用的框架。
https://www.youtube.com/watch?v=V_xro1bcAuA
Learn PyTorch for deep learning in this comprehensive course for beginners. PyTorch is a machine learning framework written in Python.
DeepSpeed+
https://www.youtube.com/watch?v=pDGI668pNg0
Megatron+
https://www.youtube.com/watch?v=hc0u4avAkuM
vLLM+
https://www.newline.co/@zaoyang/ultimate-guide-to-vllm--aad8b65d
vLLM is a framework designed to make large language models faster, more efficient, and better suited for production environments.
https://www.youtube.com/watch?v=Ju2FrqIrdx0
vLLM is a cutting-edge serving engine designed for large language models (LLMs), offering unparalleled performance and efficiency for AI-driven applications.
TensorRT+
https://docs.nvidia.com/deeplearning/tensorrt/latest/getting-started/quick-start-guide.html
This TensorRT Quick Start Guide is a starting point for developers who want to try out the TensorRT SDK; specifically, it demonstrates how to quickly construct an application to run inference on a TensorRT engine.
CUDA+
https://developer.nvidia.com/blog/even-easier-introduction-cuda/
This post is a super simple introduction to CUDA, the popular parallel computing platform and programming model from NVIDIA.
https://www.youtube.com/watch?v=86FAWCzIe_4
Lean how to program with Nvidia CUDA and leverage GPUs for high-performance computing and deep learning.
C+
https://www.freecodecamp.org/chinese/news/the-c-beginners-handbook/
本手册遵循二八定律。你将在 20% 的时间内学习 80% 的 C 编程语言。
https://www.youtube.com/watch?v=87SH2Cn0s9A
https://www.youtube.com/watch?v=KJgsSFOSQv0
This course will give you a full introduction into all of the core concepts in the C programming language.
https://www.youtube.com/watch?v=PaPN51Mm5qQ
In this complete C programming course, Dr. Charles Severance (aka Dr. Chuck) will help you understand computer architecture and low-level programming with the help of the classic C Programming language book written by Brian Kernighan and Dennis Ritchie.
相关职位
社招2年以上核心本地商业-业
1.参与超大规模机器学习异构引擎架构设计与深度优化; 2.参与机器学习训练引擎的吞吐提升、推理引擎的深度优化。 3.参与生成式、大模型基础能力建设。
更新于 2025-06-05
社招核心本地商业-业
1. 负责到家搜索推荐机器学习引擎的工程架构工作,包括 CTR/LLM 模型训练/推理优化、 用户特征平台建设等; 2. 建设面向多场景、高性能、可拓展的机器学习引擎,支撑外卖/闪购/医药/营销等场景的搜索推荐业务需求; 3. 持续优化工程架构,提升系统性能表现、算力规模以及迭代效率; 4. 调研业界前沿技术发展动态,结合业务实际情况,实现在业务的落地
更新于 2025-04-17
校招自动车配送部
负责全公司机器学习基础设施的研发,包括大规模分布式训练架构、高性能推理服务、深度学习编译优化、GPU硬件加速、RDMA网络优化等,支撑公司所有机器学习相关业务,如:搜索、推荐、广告、图像、NLP、语音、自动驾驶等。
更新于 2025-02-26