滴滴高级数据研发工程师(J250624029)
社招全职3年以上技术地点:北京状态:招聘
任职要求
1. 计算机、数学等相关专业本科及以上学历; 2. 三年以上数据开发工作经验,深入理解常用的数据建模理论,可独立把控数据仓库各层级的设计; 3. 熟悉Hadoop生态,精通Hdfs、Hive、MR开发,熟悉Spark、Presto、Hbase,有任务调优经验、实时开发经验、数据治理经验; 4. 具备较强的编程能力和编程经验,至少熟悉Java/Python/cala一门编程语言; 5. 具备复杂业务的需求梳理能力,较强的结构化思维能力和问题分析能力,良好的沟通能力及团队协作精神 6.具有技术规划能力,较强自驱力和责任感,面对复杂问题能攻关拿结果;
工作职责
1. 负责滴滴核心业务的数据仓库搭建及开发, 进行完整的数仓建模并持续优化,包括数据生产、数据加工、数据应用及治理; 2. 负责抽象核心业务流程,沉淀业务通用分析框架,开发数仓中间层和数据应用产品; 3. 负责数据开发的流程与代码的规范性及优化,不断完善数据治理体系,持续提升数仓建设的质量和效率。
包括英文材料
学历+
数据仓库+
https://www.youtube.com/watch?v=9GVqKuTVANE
From Zero to Data Warehouse Hero: A Full SQL Project Walkthrough and Real Industry Experience!
https://www.youtube.com/watch?v=k4tK2ttdSDg
Hadoop+
https://www.runoob.com/w3cnote/hadoop-tutorial.html
Hadoop 为庞大的计算机集群提供可靠的、可伸缩的应用层计算和存储支持,它允许使用简单的编程模型跨计算机群集分布式处理大型数据集,并且支持在单台计算机到几千台计算机之间进行扩展。
[英文] Hadoop Tutorial
https://www.tutorialspoint.com/hadoop/index.htm
Hadoop is an open-source framework that allows to store and process big data in a distributed environment across clusters of computers using simple programming models.
HDFS+
https://hadoop.apache.org/docs/r1.2.1/hdfs_design.html
The Hadoop Distributed File System (HDFS) is a distributed file system designed to run on commodity hardware.
https://www.ibm.com/cn-zh/think/topics/hdfs
Hadoop 分布式文件系统 (HDFS) 是一种管理大型数据集的文件系统,可在商用硬件上运行。
Hive+
[英文] Hive Tutorial
https://www.tutorialspoint.com/hive/index.htm
Hive is a data warehouse infrastructure tool to process structured data in Hadoop. It resides on top of Hadoop to summarize Big Data, and makes querying and analyzing easy.
https://www.youtube.com/watch?v=D4HqQ8-Ja9Y
MapReduce+
https://www.youtube.com/watch?v=bcjSe0xCHbE
https://www.youtube.com/watch?v=cHGaQz0E7AU
In this video I explain the basics of Map Reduce model, an important concept for any software engineer to be aware of.
Spark+
[英文] Learning Spark Book
https://pages.databricks.com/rs/094-YMS-629/images/LearningSpark2.0.pdf
This new edition has been updated to reflect Apache Spark’s evolution through Spark 2.x and Spark 3.0, including its expanded ecosystem of built-in and external data sources, machine learning, and streaming technologies with which Spark is tightly integrated.
Presto+
[英文] What is Presto?
https://prestodb.io/what-is-presto/
https://www.tutorialspoint.com/apache_presto/index.htm
HBase+
[英文] HBase Tutorial
https://www.tutorialspoint.com/hbase/index.htm
HBase is a data model that is similar to Google's big table designed to provide quick random access to huge amounts of structured data. This tutorial provides an introduction to HBase, the procedures to set up HBase on Hadoop File Systems, and ways to interact with HBase shell.
数据治理+
https://www.ibm.com/think/topics/data-governance
Data governance is the data management discipline that focuses on the quality, security and availability of an organization’s data.
https://www.youtube.com/watch?v=uPsUjKLHLAg
Building data fabric eliminates the technological complexities of data governance so users can connect to the right data at the right time, regardless of where it resides.
Java+
https://www.youtube.com/watch?v=eIrMbAQSU34
Master Java – a must-have language for software development, Android apps, and more! ☕️ This beginner-friendly course takes you from basics to real coding skills.
Python+
https://liaoxuefeng.com/books/python/introduction/index.html
中文,免费,零起点,完整示例,基于最新的Python 3版本。
https://www.learnpython.org/
a free interactive Python tutorial for people who want to learn Python, fast.
https://www.youtube.com/watch?v=K5KVEU3aaeQ
Master Python from scratch 🚀 No fluff—just clear, practical coding skills to kickstart your journey!
https://www.youtube.com/watch?v=rfscVS0vtbw
This course will give you a full introduction into all of the core concepts in python.
相关职位
社招2年以上技术
1、负责滴滴国际化外卖订单收银、财税方向的需求开发,在充分理解订单收银、财税业务的基础上进行需求分析、设计、开发、上线等工作; 2、熟悉所负责的业务及系统,分析和发现系统的优化点,推动产品性能、稳定性、易用性提升,持续进行系统优化及架构升级; 3、技术文档撰写、维护、团队知识传承;
更新于 2025-09-08
社招3年以上技术
1.参与ERP数仓的开发、维护、优化及相关技术支持工作,以及风控&财务等数据体系建设; 2.负责数据仓库ETL流程的优化及解决相关技术问题; 3.参与数据产品设计和评审,保障数据平台架构稳定; 4.为日常项目中需求提供数据支持,并且在一定程度上给予评估和建议。
更新于 2025-05-27
社招5年以上技术
1. 负责滴滴核心业务的数据仓库搭建及开发, 进行完整的数仓建模并持续优化,包括数据生产、数据加工、数据应用及治理; 2. 负责抽象核心业务流程,沉淀业务通用分析框架,开发数仓中间层和数据应用产品; 3. 负责数据开发的流程与代码的规范性及优化,不断完善数据治理体系,持续提升数仓建设的质量和效率。
更新于 2025-06-05