滴滴数据研发架构师/专家(J250821004)
社招全职技术地点:北京状态:招聘
任职要求
1、本科及以上学历,计算机及相关专业,具备良好的编码能力和扎实的技术功底,至少熟练掌握Java/golang/C++任何一种开发语言; 2、能熟练运用常见的在/离线大数据组件,包括但不限于Flink、Spark、等主流计算框架,以及Doris,ClickHouse等主流OLAP存储引擎; 3、具有良好的数据治理经验,熟练数据采建管用全链路的数据方法和实际经验; 4、具备交易类场景高TPS的样本拼接、特征抽取的存储计算经验,有较强的稳定性意识,对算法平台、算法策略有一定的理解能力 5、具有良好的跨大型团队沟通协作能力,具有较强的分享意愿。
工作职责
1、构建统一的策略数据中间层,保障在/离线策略数据链路的一致性和高复用性,提升策略迭代和业务分析效率 2、持续优化与完善算法策略样本拼接、特征提取等场景的数据架构、提升数据链路的稳定性、扩展性等相关体系能力 3、与算法、策略工程、数据平台等团队紧密合作,制定并落地有效的分层分域数据管理方案和工具,保持数据建设架构的长期合理和高效迭代
包括英文材料
学历+
Java+
https://www.youtube.com/watch?v=eIrMbAQSU34
Master Java – a must-have language for software development, Android apps, and more! ☕️ This beginner-friendly course takes you from basics to real coding skills.
Go+
https://www.youtube.com/watch?v=8uiZC0l4Ajw
学习Golang的完整教程!从开始到结束不到一个小时,包括如何在Go中构建API的完整演示。没有多余的内容,只有你需要知道的知识。
C+++
https://www.learncpp.com/
LearnCpp.com is a free website devoted to teaching you how to program in modern C++.
https://www.youtube.com/watch?v=ZzaPdXTrSb8
大数据+
https://www.youtube.com/watch?v=bAyrObl7TYE
https://www.youtube.com/watch?v=H4bf_uuMC-g
With all this talk of Big Data, we got Rebecca Tickle to explain just what makes data into Big Data.
Flink+
https://nightlies.apache.org/flink/flink-docs-release-2.0/docs/learn-flink/overview/
This training presents an introduction to Apache Flink that includes just enough to get you started writing scalable streaming ETL, analytics, and event-driven applications, while leaving out a lot of (ultimately important) details.
https://www.youtube.com/watch?v=WajYe9iA2Uk&list=PLa7VYi0yPIH2GTo3vRtX8w9tgNTTyYSux
Today’s businesses are increasingly software-defined, and their business processes are being automated. Whether it’s orders and shipments, or downloads and clicks, business events can always be streamed. Flink can be used to manipulate, process, and react to these streaming events as they occur.
Spark+
[英文] Learning Spark Book
https://pages.databricks.com/rs/094-YMS-629/images/LearningSpark2.0.pdf
This new edition has been updated to reflect Apache Spark’s evolution through Spark 2.x and Spark 3.0, including its expanded ecosystem of built-in and external data sources, machine learning, and streaming technologies with which Spark is tightly integrated.
Doris+
https://doris.apache.org/docs/gettingStarted/what-is-apache-doris
ClickHouse+
[英文] Advanced Tutorial
https://clickhouse.com/docs/tutorial
Learn how to ingest and query data in ClickHouse using the New York City taxi example dataset.
https://www.youtube.com/watch?v=FtoWGT7kS-c
ClickHouse is an open-source column-oriented DBMS for online analytical processing that allows users to generate analytical reports using SQL queries in real-time.
https://www.youtube.com/watch?v=Rhe-kUyrFUE&list=PL0Z2YDlm0b3gcY5R_MUo4fT5bPqUQ66ep
OLAP+
https://www.youtube.com/watch?v=iw-5kFzIdgY
OLAP (for online analytical processing) is software for performing multidimensional analysis at high speeds on large volumes of data from a data warehouse, data mart, or some other unified, centralized data store.
数据治理+
https://www.ibm.com/think/topics/data-governance
Data governance is the data management discipline that focuses on the quality, security and availability of an organization’s data.
https://www.youtube.com/watch?v=uPsUjKLHLAg
Building data fabric eliminates the technological complexities of data governance so users can connect to the right data at the right time, regardless of where it resides.
算法+
https://roadmap.sh/datastructures-and-algorithms
Step by step guide to learn Data Structures and Algorithms in 2025
https://www.hellointerview.com/learn/code
A visual guide to the most important patterns and approaches for the coding interview.
https://www.w3schools.com/dsa/
相关职位
社招D11746
1、参与快手数据平台新一代数据分析引擎的建设,支撑超大规模业务数据量,提供统一化极致性能的解决方案; 2、接受大数据平台系统设计与实现复杂度的挑战,分析和发现系统的优化点,负责推动系统的合理性、可靠性、可用性的提升; 3、和开源社区保持交流,从社区引入对公司业务场景有帮助的特性与系统,或将内部研发的功能贡献到社区。
更新于 2025-03-07
社招D11746
1、参与快手数据平台新一代数据分析引擎的建设,支撑超大规模业务数据量,提供统一化极致性能的解决方案; 2、接受大数据平台系统设计与实现复杂度的挑战,分析和发现系统的优化点,负责推动系统的合理性、可靠性、可用性的提升; 3、和开源社区保持交流,从社区引入对公司业务场景有帮助的特性与系统,或将内部研发的功能贡献到社区。
更新于 2025-03-07
社招U1064
1、负责火山引擎的平台架构工程系统研发,包括需求分析、系统设计、编码实现、测试等工作; 2、负责火山引擎云服务依赖的公共组件和产品的研发,保障云服务的高效运行,并不断进行技术迭代和升级; 3、负责火山引擎稳定性平台的建设,包括监控、预警、故障排查和恢复等平台功能设计与研发; 4、参与火山引擎的技术方案讨论和决策,推动云服务架构的持续优化和改进。
更新于 2022-10-18