滴滴高级算法工程师(J250609023)
社招全职技术地点:北京状态:招聘
任职要求
1、数学、计算机、通信等相关专业,本科以上学历; 2、熟练运用python/Java/C++至少一种语言、掌握Linux shell常用操作;有spark/hadoop/hive相关分布式大数据处理经验 3、有大模型PEFT相关算法经验,有Full Fine-Tuning 经验优先 4、有LBS相关领域研究和工作经验优先
工作职责
业务及团队介绍:POI(Point of Interest),即“兴趣点”,是地理信息系统中的重要概念,表示物理世界中的一处地方,可以是一家美食店、一个小区、一栋大楼等。POI是最重要的地图数据之一。 POI之间不是孤立的,存在各类关联关系;POI也不是一成不变的,随着时间POI也面临新增、下线、搬迁等状态变化;丰富的POI属性、多元的内容也可以辅助用户决策。 POI是用户出行起点和终点表达的最基础的数据,广泛应用到检索,上下车点推荐,周边推荐等。对于准确率,覆盖率和现势性等都有较高的要求。滴滴地图POI团队负责滴滴平台POI的建设,通过完善的数据生态,采用多元的技术方案,实时发现物理时间的变化,帮助用户更美好的出行。 岗位职责: 1、参与POI智能情报挖掘,包括POI时空语义及挖掘,多模态学习,用户行为建模,图表示学习,知识推理等 2、熟悉NLP算法,熟悉PPO、GRPO等强化学习算法 3、通过大模型、司乘反馈,提升已有情报挖掘能力,更快、更准的发现现实世界变化 4、追踪LLM/Agent的前沿技术,通过作业反馈优化、提升自动化作业准确率,提升自动化作业比例
包括英文材料
学历+
Python+
https://liaoxuefeng.com/books/python/introduction/index.html
中文,免费,零起点,完整示例,基于最新的Python 3版本。
https://www.learnpython.org/
a free interactive Python tutorial for people who want to learn Python, fast.
https://www.youtube.com/watch?v=K5KVEU3aaeQ
Master Python from scratch 🚀 No fluff—just clear, practical coding skills to kickstart your journey!
https://www.youtube.com/watch?v=rfscVS0vtbw
This course will give you a full introduction into all of the core concepts in python.
Java+
https://www.youtube.com/watch?v=eIrMbAQSU34
Master Java – a must-have language for software development, Android apps, and more! ☕️ This beginner-friendly course takes you from basics to real coding skills.
C+++
https://www.learncpp.com/
LearnCpp.com is a free website devoted to teaching you how to program in modern C++.
https://www.youtube.com/watch?v=ZzaPdXTrSb8
Linux+
https://ryanstutorials.net/linuxtutorial/
Ok, so you want to learn how to use the Bash command line interface (terminal) on Unix/Linux.
https://ubuntu.com/tutorials/command-line-for-beginners
The Linux command line is a text interface to your computer.
https://www.youtube.com/watch?v=6WatcfENsOU
In this Linux crash course, you will learn the fundamental skills and tools you need to become a proficient Linux system administrator.
https://www.youtube.com/watch?v=v392lEyM29A
Never fear the command line again, make it fear you.
https://www.youtube.com/watch?v=ZtqBQ68cfJc
Bash+
[英文] The Bash Guide
https://guide.bash.academy/
A quality-driven guide through the shell's many features.
https://www.youtube.com/watch?v=tK9Oc6AEnR4
Understanding how to use bash scripting will enhance your productivity by automating tasks, streamlining processes, and making your workflow more efficient.
Spark+
[英文] Learning Spark Book
https://pages.databricks.com/rs/094-YMS-629/images/LearningSpark2.0.pdf
This new edition has been updated to reflect Apache Spark’s evolution through Spark 2.x and Spark 3.0, including its expanded ecosystem of built-in and external data sources, machine learning, and streaming technologies with which Spark is tightly integrated.
Hadoop+
https://www.runoob.com/w3cnote/hadoop-tutorial.html
Hadoop 为庞大的计算机集群提供可靠的、可伸缩的应用层计算和存储支持,它允许使用简单的编程模型跨计算机群集分布式处理大型数据集,并且支持在单台计算机到几千台计算机之间进行扩展。
[英文] Hadoop Tutorial
https://www.tutorialspoint.com/hadoop/index.htm
Hadoop is an open-source framework that allows to store and process big data in a distributed environment across clusters of computers using simple programming models.
Hive+
[英文] Hive Tutorial
https://www.tutorialspoint.com/hive/index.htm
Hive is a data warehouse infrastructure tool to process structured data in Hadoop. It resides on top of Hadoop to summarize Big Data, and makes querying and analyzing easy.
https://www.youtube.com/watch?v=D4HqQ8-Ja9Y
大模型+
https://www.youtube.com/watch?v=xZDB1naRUlk
You will build projects with LLMs that will enable you to create dynamic interfaces, interact with vast amounts of text data, and even empower LLMs with the capability to browse the internet for research papers.
https://www.youtube.com/watch?v=zjkBMFhNj_g
算法+
https://roadmap.sh/datastructures-and-algorithms
Step by step guide to learn Data Structures and Algorithms in 2025
https://www.hellointerview.com/learn/code
A visual guide to the most important patterns and approaches for the coding interview.
https://www.w3schools.com/dsa/
相关职位
社招5年以上A118837
1. 通过对海量车辆运行日志的深度解析,提取关键信息,包括车辆故障码、传感器数据、驾驶行为数据等,为故障诊断提供数据支持。 2. 运用数据挖掘技术,如聚类分析、关联规则挖掘等,发现车辆日志中的潜在模式和异常行为,提前预警潜在故障风险,为预防性维护提供依据。 3. 构建车辆故障诊断方案检索系统,基于车辆故障特征和历史维修记录,快速检索出与当前故障相似的诊断方案和维修案例,为诊断人员提供参考。 4. 运用大语言模型、机器学习算法,优化存量远程诊断案例方案推荐,针对存量方案库生成新的方案,提高诊断效率和准确性。
更新于 2025-05-26
社招5年以上A35523
1、负责端侧CV算法的研发和落地,包括但不限于目标检测、识别、跟踪等算法; 2、负责算法工程化,包括模型工程化和优化等工作; 3、负责端侧算法框架设计开发; 4、可能也会参与一部分多模态大模型相关的工作;
更新于 2025-04-02