滴滴高级数据研发工程师(J250424022)
社招全职3年以上技术地点:北京状态:招聘
任职要求
1.深入理解常用的数据建模理论,可独立把控数据仓库各层级的设计; 2.熟悉大数据生态,精通Hdfs、Hive开发,熟悉Spark、Presto、Clickhouse、StarRocks,有调优经验; 3.了解数据治理,从事过治理相关工作,理解数据治理的重要性; 4.具备较强的编程能力和编程经验,至少熟悉Java/Python/Shell一门编程语言,熟悉linux系统; 5.三年及以上ETL开发经验优先; 6.具备一定的数据分析能力,能够迅速定位并解决数据相关问题,具备数据敏感性和探知欲,专注数据的价值发现和转化; 7.具备复杂业务需求梳理能力,较强的产品思维能力和问题分析能力,良好的沟通能力、组织能力及团队协作精神;
工作职责
1.参与ERP数仓的开发、维护、优化及相关技术支持工作,以及风控&财务等数据体系建设; 2.负责数据仓库ETL流程的优化及解决相关技术问题; 3.参与数据产品设计和评审,保障数据平台架构稳定; 4.为日常项目中需求提供数据支持,并且在一定程度上给予评估和建议。
包括英文材料
数据仓库+
https://www.youtube.com/watch?v=9GVqKuTVANE
From Zero to Data Warehouse Hero: A Full SQL Project Walkthrough and Real Industry Experience!
https://www.youtube.com/watch?v=k4tK2ttdSDg
大数据+
https://www.youtube.com/watch?v=bAyrObl7TYE
https://www.youtube.com/watch?v=H4bf_uuMC-g
With all this talk of Big Data, we got Rebecca Tickle to explain just what makes data into Big Data.
HDFS+
https://hadoop.apache.org/docs/r1.2.1/hdfs_design.html
The Hadoop Distributed File System (HDFS) is a distributed file system designed to run on commodity hardware.
https://www.ibm.com/cn-zh/think/topics/hdfs
Hadoop 分布式文件系统 (HDFS) 是一种管理大型数据集的文件系统,可在商用硬件上运行。
Hive+
[英文] Hive Tutorial
https://www.tutorialspoint.com/hive/index.htm
Hive is a data warehouse infrastructure tool to process structured data in Hadoop. It resides on top of Hadoop to summarize Big Data, and makes querying and analyzing easy.
https://www.youtube.com/watch?v=D4HqQ8-Ja9Y
Spark+
[英文] Learning Spark Book
https://pages.databricks.com/rs/094-YMS-629/images/LearningSpark2.0.pdf
This new edition has been updated to reflect Apache Spark’s evolution through Spark 2.x and Spark 3.0, including its expanded ecosystem of built-in and external data sources, machine learning, and streaming technologies with which Spark is tightly integrated.
Presto+
[英文] What is Presto?
https://prestodb.io/what-is-presto/
https://www.tutorialspoint.com/apache_presto/index.htm
ClickHouse+
[英文] Advanced Tutorial
https://clickhouse.com/docs/tutorial
Learn how to ingest and query data in ClickHouse using the New York City taxi example dataset.
https://www.youtube.com/watch?v=FtoWGT7kS-c
ClickHouse is an open-source column-oriented DBMS for online analytical processing that allows users to generate analytical reports using SQL queries in real-time.
https://www.youtube.com/watch?v=Rhe-kUyrFUE&list=PL0Z2YDlm0b3gcY5R_MUo4fT5bPqUQ66ep
StarRocks+
https://docs.starrocks.io/docs/quick_start/
These Quick Start guides will help you get going with a small StarRocks environment.
https://itnext.io/introduction-to-starrocks-a-new-modern-analytical-database-1db2177d26e1
Recently, I had the opportunity to explore StarRocks which is the new kid in the block when talking about massive scale databases which are able to handle petabytes of data.
数据治理+
https://www.ibm.com/think/topics/data-governance
Data governance is the data management discipline that focuses on the quality, security and availability of an organization’s data.
https://www.youtube.com/watch?v=uPsUjKLHLAg
Building data fabric eliminates the technological complexities of data governance so users can connect to the right data at the right time, regardless of where it resides.
Java+
https://www.youtube.com/watch?v=eIrMbAQSU34
Master Java – a must-have language for software development, Android apps, and more! ☕️ This beginner-friendly course takes you from basics to real coding skills.
Python+
https://liaoxuefeng.com/books/python/introduction/index.html
中文,免费,零起点,完整示例,基于最新的Python 3版本。
https://www.learnpython.org/
a free interactive Python tutorial for people who want to learn Python, fast.
https://www.youtube.com/watch?v=K5KVEU3aaeQ
Master Python from scratch 🚀 No fluff—just clear, practical coding skills to kickstart your journey!
https://www.youtube.com/watch?v=rfscVS0vtbw
This course will give you a full introduction into all of the core concepts in python.
Bash+
[英文] The Bash Guide
https://guide.bash.academy/
A quality-driven guide through the shell's many features.
https://www.youtube.com/watch?v=tK9Oc6AEnR4
Understanding how to use bash scripting will enhance your productivity by automating tasks, streamlining processes, and making your workflow more efficient.
Linux+
https://ryanstutorials.net/linuxtutorial/
Ok, so you want to learn how to use the Bash command line interface (terminal) on Unix/Linux.
https://ubuntu.com/tutorials/command-line-for-beginners
The Linux command line is a text interface to your computer.
https://www.youtube.com/watch?v=6WatcfENsOU
In this Linux crash course, you will learn the fundamental skills and tools you need to become a proficient Linux system administrator.
https://www.youtube.com/watch?v=v392lEyM29A
Never fear the command line again, make it fear you.
https://www.youtube.com/watch?v=ZtqBQ68cfJc
ETL+
https://www.ibm.com/think/topics/etl
ETL—meaning extract, transform, load—is a data integration process that combines, cleans and organizes data from multiple sources into a single, consistent data set for storage in a data warehouse, data lake or other target system.
https://www.youtube.com/watch?v=OW5OgsLpDCQ
It explains what ETL is and what it can do for you to improve your data analysis and productivity.
数据分析+
[英文] Data Analyst Roadmap
https://roadmap.sh/data-analyst
Step by step guide to becoming an Data Analyst in 2025
相关职位
社招1年以上技术
1、负责国际化滴滴外卖用户方向系统的迭代与升级。 2、充分理解业务的基础上进行需求分析、设计、开发、上线等工作,推动系统的持续进化 3、参与业务核心系统重构,结合项目学习研究业界先进技术,保持技术进步。
更新于 2025-08-08
社招技术
1. 负责外卖骑手管控技术核心服务的日常开发与维护,保障治理业务的高效、高质量落地 2. 充分理解骑手管控,可以通过业务模型拆解技术架构,可以识别现有系统问题并能给出系统优化方案 3. 学习和研究新技术,并能应用于实际项目中,以满足实际的业务需求
更新于 2025-09-08
社招3年以上技术
1. 负责滴滴核心业务的数据仓库搭建及开发, 进行完整的数仓建模并持续优化,包括数据生产、数据加工、数据应用及治理; 2. 负责抽象核心业务流程,沉淀业务通用分析框架,开发数仓中间层和数据应用产品; 3. 负责数据开发的流程与代码的规范性及优化,不断完善数据治理体系,持续提升数仓建设的质量和效率。
更新于 2025-06-27