拼多多搜索/推荐算法工程师(召回方向)
社招全职技术类地点:上海状态:招聘
任职要求
1 计算机、模式识别、数学或统计学等相关专业背景,本科及以上学历; 2 熟悉Linux开发环境,熟练掌握Java /Python/C++等至少一种开发语言,熟练掌握TensorFlow/Pytorch/Caffe/Mxnet等至少一种深度学习框架,有Hadoop/Spark等大数据计算平台经验者优先; 3 熟悉机器学习、深度学习、推荐算法、向量检索、计算机视觉和自然语言处理的常用算法; 4 有大规模数据挖掘、机器学习、强化学习、自然语言处理、推荐搜索系统、计算广告相关领域研究和实践背景或经验者优先; 5 具备优秀的分析和解决问题的能力,良好的沟通协作能力。
工作职责
1 负责搜索/推荐召回算法的优化、场景技术方案的推进和落地。算法包含但不限于:向量检索技术、基于树、图、量化、聚类等的各类检索方案、图神经网络、深度表示学习、对比学习、样本增强、集成学习、learning to rank、多任务学习、强化学习、迁移学习、模拟器等。 2 分析海量用户行为序列,深度理解和表示用户长短时兴趣,精准召回商品,提升平台效率和用户体验。 3 熟悉常用的向量检索技术,能够根据业务特点设计和优化索引类型。 4 负责召回各通道融合和各级漏斗方案的设计和优化,以及粗排CTR/CVR/LTR模型的设计和优化。 5 追踪算法前沿技术,结合拼多多的业务特点,探索将前沿算法落地于搜索/推荐召回的实际业务。
包括英文材料
模式识别+
https://www.mathworks.com/discovery/pattern-recognition.html
Pattern recognition is the process of classifying input data into objects, classes, or categories using computer algorithms based on key features or regularities.
https://www.microsoft.com/en-us/research/wp-content/uploads/2006/01/Bishop-Pattern-Recognition-and-Machine-Learning-2006.pdf
Pattern recognition has its origins in engineering, whereas machine learning grew out of computer science.
学历+
Linux+
https://ryanstutorials.net/linuxtutorial/
Ok, so you want to learn how to use the Bash command line interface (terminal) on Unix/Linux.
https://ubuntu.com/tutorials/command-line-for-beginners
The Linux command line is a text interface to your computer.
https://www.youtube.com/watch?v=6WatcfENsOU
In this Linux crash course, you will learn the fundamental skills and tools you need to become a proficient Linux system administrator.
https://www.youtube.com/watch?v=v392lEyM29A
Never fear the command line again, make it fear you.
https://www.youtube.com/watch?v=ZtqBQ68cfJc
Java+
https://www.youtube.com/watch?v=eIrMbAQSU34
Master Java – a must-have language for software development, Android apps, and more! ☕️ This beginner-friendly course takes you from basics to real coding skills.
Python+
https://liaoxuefeng.com/books/python/introduction/index.html
中文,免费,零起点,完整示例,基于最新的Python 3版本。
https://www.learnpython.org/
a free interactive Python tutorial for people who want to learn Python, fast.
https://www.youtube.com/watch?v=K5KVEU3aaeQ
Master Python from scratch 🚀 No fluff—just clear, practical coding skills to kickstart your journey!
https://www.youtube.com/watch?v=rfscVS0vtbw
This course will give you a full introduction into all of the core concepts in python.
C+++
https://www.learncpp.com/
LearnCpp.com is a free website devoted to teaching you how to program in modern C++.
https://www.youtube.com/watch?v=ZzaPdXTrSb8
TensorFlow+
https://www.youtube.com/watch?v=tpCFfeUEGs8
Ready to learn the fundamentals of TensorFlow and deep learning with Python? Well, you’ve come to the right place.
https://www.youtube.com/watch?v=ZUKz4125WNI
This part continues right where part one left off so get that Google Colab window open and get ready to write plenty more TensorFlow code.
PyTorch+
https://datawhalechina.github.io/thorough-pytorch/
PyTorch是利用深度学习进行数据科学研究的重要工具,在灵活性、可读性和性能上都具备相当的优势,近年来已成为学术界实现深度学习算法最常用的框架。
https://www.youtube.com/watch?v=V_xro1bcAuA
Learn PyTorch for deep learning in this comprehensive course for beginners. PyTorch is a machine learning framework written in Python.
深度学习+
https://d2l.ai/
Interactive deep learning book with code, math, and discussions.
Hadoop+
https://www.runoob.com/w3cnote/hadoop-tutorial.html
Hadoop 为庞大的计算机集群提供可靠的、可伸缩的应用层计算和存储支持,它允许使用简单的编程模型跨计算机群集分布式处理大型数据集,并且支持在单台计算机到几千台计算机之间进行扩展。
[英文] Hadoop Tutorial
https://www.tutorialspoint.com/hadoop/index.htm
Hadoop is an open-source framework that allows to store and process big data in a distributed environment across clusters of computers using simple programming models.
Spark+
[英文] Learning Spark Book
https://pages.databricks.com/rs/094-YMS-629/images/LearningSpark2.0.pdf
This new edition has been updated to reflect Apache Spark’s evolution through Spark 2.x and Spark 3.0, including its expanded ecosystem of built-in and external data sources, machine learning, and streaming technologies with which Spark is tightly integrated.
大数据+
https://www.youtube.com/watch?v=bAyrObl7TYE
https://www.youtube.com/watch?v=H4bf_uuMC-g
With all this talk of Big Data, we got Rebecca Tickle to explain just what makes data into Big Data.
机器学习+
https://www.youtube.com/watch?v=0oyDqO8PjIg
Learn about machine learning and AI with this comprehensive 11-hour course from @LunarTech_ai.
https://www.youtube.com/watch?v=i_LwzRVP7bg
Learn Machine Learning in a way that is accessible to absolute beginners.
https://www.youtube.com/watch?v=NWONeJKn6kc
Learn the theory and practical application of machine learning concepts in this comprehensive course for beginners.
https://www.youtube.com/watch?v=PcbuKRNtCUc
Learn about all the most important concepts and terms related to machine learning and AI.
算法+
https://roadmap.sh/datastructures-and-algorithms
Step by step guide to learn Data Structures and Algorithms in 2025
https://www.hellointerview.com/learn/code
A visual guide to the most important patterns and approaches for the coding interview.
https://www.w3schools.com/dsa/
OpenCV+
https://learnopencv.com/getting-started-with-opencv/
At LearnOpenCV we are on a mission to educate the global workforce in computer vision and AI.
https://opencv.org/university/free-opencv-course/
This free OpenCV course will teach you how to manipulate images and videos, and detect objects and faces, among other exciting topics in just about 3 hours.
NLP+
https://www.youtube.com/watch?v=fNxaJsNG3-s&list=PLQY2H8rRoyvzDbLUZkbudP-MFQZwNmU4S
Welcome to Zero to Hero for Natural Language Processing using TensorFlow!
https://www.youtube.com/watch?v=R-AG4-qZs1A&list=PLeo1K3hjS3uuvuAXhYjV2lMEShq2UYSwX
Natural Language Processing tutorial for beginners series in Python.
https://www.youtube.com/watch?v=rmVRLeJRkl4&list=PLoROMvodv4rMFqRtEuo6SGjY4XbRIVRd4
The foundations of the effective modern methods for deep learning applied to NLP.
数据挖掘+
https://www.youtube.com/watch?v=-bSkREem8dM
Database vs Data Warehouse vs Data Lake
https://www.youtube.com/watch?v=7rs0i-9nOjo
强化学习+
https://cloud.google.com/discover/what-is-reinforcement-learning?hl=en
Reinforcement learning (RL) is a type of machine learning where an "agent" learns optimal behavior through interaction with its environment.
https://huggingface.co/learn/deep-rl-course/unit0/introduction
This course will teach you about Deep Reinforcement Learning from beginner to expert. It’s completely free and open-source!
https://www.kaggle.com/learn/intro-to-game-ai-and-reinforcement-learning
Build your own video game bots, using classic and cutting-edge algorithms.
MXNet+
https://www.tutorialspoint.com/apache_mxnet/index.htm
Apache MXNet is a powerful deep learning framework that supports both symbolic and imperative programming.
相关职位
社招技术类
1 负责搜索/推荐召回相关的算法设计、场景技术方案规划,实现方案在业务系统中的落地。算法包含但不限于:向量检索技术、基于树、图、量化、聚类等的各类检索方案、图神经网络、深度表示学习、对比学习、样本增强、集成学习、learning to rank、多任务学习、强化学习、迁移学习、模拟器等。 2 分析海量用户行为序列,深度理解和表示用户长短时兴趣,精准召回商品,提升平台效率和用户体验。 3 熟悉常用的向量检索技术,能够根据业务特点设计和优化索引类型。 4 负责召回多通道融合和各级漏斗方案的设计和优化,以及粗排CTR/CVR/LTR模型的设计和优化。 5 具有较强的业务敏感性,能够自主分析数据发现问题,结合业务实际情况提出有创新性的解决方案,并落地带来业务收益。 6 追踪召回算法前沿技术,能够结合拼多多的业务特点,设计出实际可用的召回前沿技术落地方案,并拿到业务收益。
更新于 2025-09-15
社招2年以上视频号短视频技术
1.负责视频号推荐业务的推荐技术优化,专注于粗排、召回方向; 2.负责视频号内容分发的业务策略优化,包含但不限于红点、本地、关注、热点、低活、冷启、作者等。
更新于 2025-06-23
社招5年以上核心本地商业-业
餐饮搜索召回方向,覆盖外卖与到餐业务,具体职责: 1. 参与搜推广召回模型相关方向的算法策略研发 2. 基于美团和点评双平台的海量用户与商家数据,参与搜索广告场景的召回模型迭代 3. 参与LLM生成式召回方向的探索与研究 4. 全系统链路综合优化
更新于 2025-04-23