滴滴资深数据研发工程师(J250911023)
社招全职5-7年技术地点:北京状态:招聘
任职要求
1.计算机或相关专业本科及以上学历,具备5~7年互联网数据建设工作经验者优先; 2.深入理解常用的数据建模理论,具备数据域的数据架构及模型建设能力; 3.熟悉Hadoop生态,精通Hdfs、Spark、StarRocks、Flink,有iceberg经验者优先; 4.熟悉数据治理相关工作内容,在数据稳定性方面有实践经验; 5.较强的问题分析和解决能力,有大数据量流量、营销、交易等核心业务场景数据建设者优先; 6. 具备较好的沟通协调能力和数据项目管理能力,有英语听说能力者优先;
工作职责
1.负责滴滴国际化外卖业务数据需求开发和相应业务数据中台基建工作; 2.参与部门数据治理体系的规范化落地,保障数据稳定性; 3.参与部门新技术探索和架构升级方案制定及落地;
包括英文材料
学历+
Hadoop+
https://www.runoob.com/w3cnote/hadoop-tutorial.html
Hadoop 为庞大的计算机集群提供可靠的、可伸缩的应用层计算和存储支持,它允许使用简单的编程模型跨计算机群集分布式处理大型数据集,并且支持在单台计算机到几千台计算机之间进行扩展。
[英文] Hadoop Tutorial
https://www.tutorialspoint.com/hadoop/index.htm
Hadoop is an open-source framework that allows to store and process big data in a distributed environment across clusters of computers using simple programming models.
HDFS+
https://hadoop.apache.org/docs/r1.2.1/hdfs_design.html
The Hadoop Distributed File System (HDFS) is a distributed file system designed to run on commodity hardware.
https://www.ibm.com/cn-zh/think/topics/hdfs
Hadoop 分布式文件系统 (HDFS) 是一种管理大型数据集的文件系统,可在商用硬件上运行。
Spark+
[英文] Learning Spark Book
https://pages.databricks.com/rs/094-YMS-629/images/LearningSpark2.0.pdf
This new edition has been updated to reflect Apache Spark’s evolution through Spark 2.x and Spark 3.0, including its expanded ecosystem of built-in and external data sources, machine learning, and streaming technologies with which Spark is tightly integrated.
StarRocks+
https://docs.starrocks.io/docs/quick_start/
These Quick Start guides will help you get going with a small StarRocks environment.
https://itnext.io/introduction-to-starrocks-a-new-modern-analytical-database-1db2177d26e1
Recently, I had the opportunity to explore StarRocks which is the new kid in the block when talking about massive scale databases which are able to handle petabytes of data.
Flink+
https://nightlies.apache.org/flink/flink-docs-release-2.0/docs/learn-flink/overview/
This training presents an introduction to Apache Flink that includes just enough to get you started writing scalable streaming ETL, analytics, and event-driven applications, while leaving out a lot of (ultimately important) details.
https://www.youtube.com/watch?v=WajYe9iA2Uk&list=PLa7VYi0yPIH2GTo3vRtX8w9tgNTTyYSux
Today’s businesses are increasingly software-defined, and their business processes are being automated. Whether it’s orders and shipments, or downloads and clicks, business events can always be streamed. Flink can be used to manipulate, process, and react to these streaming events as they occur.
Iceberg+
https://iceberg.apache.org/spark-quickstart/
This guide will get you up and running with Apache Iceberg™ using Apache Spark™, including sample code to highlight some powerful features.
https://www.baeldung.com/apache-iceberg-intro
This tutorial will discuss Apache Iceberg, a popular open table format in today’s big data landscape.
https://www.youtube.com/watch?v=TsmhRZElPvM
You’ve probably heard about Apache Iceberg™—after all, it’s been getting a lot of buzz.
数据治理+
https://www.ibm.com/think/topics/data-governance
Data governance is the data management discipline that focuses on the quality, security and availability of an organization’s data.
https://www.youtube.com/watch?v=uPsUjKLHLAg
Building data fabric eliminates the technological complexities of data governance so users can connect to the right data at the right time, regardless of where it resides.
相关职位
社招技术
1、参与数据安全、合规的体系化建设,包括外业采集、内业数据生产、数据应用全流程; 2、参与地图业务、数据、平台服务的设计和开发; 3、参与地图规格、工艺等相关的设计;
更新于 2025-09-01
社招5-7年技术
1. 负责业务安全数据域全链路建设、数据分层框架搭建 2. 负责安全离线特征、实时特征开发;为安全风控策略提供快速稳定的数据服务 3. 负责安全在线及离线数据体系的规划、设计及落地;为安全风控策略提供高效的数据支持
更新于 2025-06-20
社招5-7年技术
1.负责滴滴国际化出行业务方向数据域全链路建设; 2.负责数据仓库ETL流程的优化及解决相关技术问题; 3.负责滴滴核心业务数据建模以及cube数据开发工作;
更新于 2025-07-22