高德地图高德-数据工程工程师/专家-信息工程团队
社招全职5年以上技术类-开发地点:北京状态:招聘
任职要求
- 5年以上大数据平台研发工作经验,具备扎实的计算机技术能力 - 熟悉大数据生态体系技术,掌握Hive、Spark、Flink等主流离线、实时存储/计算引擎及原理,具备良好的SQL性能调优能力; - 熟悉Java体系架构,熟悉常见数据结构、多线程并发等,精通Spring、Springboot、MyBatis等主流服务框架,熟悉分布式系统设计; - 有过图谱系统、元数据管理、数据标签管理、画像平台等研发经验优先 - 有数据湖的应用落地经验者优先; - 具备良好的沟通能力、数据分析和业务理解能力;
工作职责
- 负责设计高效、可扩展的大数据处理系统架构,确保能够支持海量数据存储与分析。 - 参与或主导重构知识图谱、画像平台的建设工作,利用大数据技术为不同业务场景提供高效准确的用户理解、供给理解能力。 - 负责数据管理等平台研发,包括数据管理、数据分发等,用工具化、平台化的能力协助业务提高数据流转效率;
包括英文材料
大数据+
https://www.youtube.com/watch?v=bAyrObl7TYE
https://www.youtube.com/watch?v=H4bf_uuMC-g
With all this talk of Big Data, we got Rebecca Tickle to explain just what makes data into Big Data.
Hive+
[英文] Hive Tutorial
https://www.tutorialspoint.com/hive/index.htm
Hive is a data warehouse infrastructure tool to process structured data in Hadoop. It resides on top of Hadoop to summarize Big Data, and makes querying and analyzing easy.
https://www.youtube.com/watch?v=D4HqQ8-Ja9Y
Spark+
[英文] Learning Spark Book
https://pages.databricks.com/rs/094-YMS-629/images/LearningSpark2.0.pdf
This new edition has been updated to reflect Apache Spark’s evolution through Spark 2.x and Spark 3.0, including its expanded ecosystem of built-in and external data sources, machine learning, and streaming technologies with which Spark is tightly integrated.
Flink+
https://nightlies.apache.org/flink/flink-docs-release-2.0/docs/learn-flink/overview/
This training presents an introduction to Apache Flink that includes just enough to get you started writing scalable streaming ETL, analytics, and event-driven applications, while leaving out a lot of (ultimately important) details.
https://www.youtube.com/watch?v=WajYe9iA2Uk&list=PLa7VYi0yPIH2GTo3vRtX8w9tgNTTyYSux
Today’s businesses are increasingly software-defined, and their business processes are being automated. Whether it’s orders and shipments, or downloads and clicks, business events can always be streamed. Flink can be used to manipulate, process, and react to these streaming events as they occur.
SQL+
https://liaoxuefeng.com/books/sql/introduction/index.html
什么是SQL?简单地说,SQL就是访问和处理关系数据库的计算机标准语言。
https://sqlbolt.com/
Learn SQL with simple, interactive exercises.
https://www.youtube.com/watch?v=p3qvj9hO_Bo
In this video we will cover everything you need to know about SQL in only 60 minutes.
性能调优+
https://goperf.dev/
The Go App Optimization Guide is a series of in-depth, technical articles for developers who want to get more performance out of their Go code without relying on guesswork or cargo cult patterns.
https://web.dev/learn/performance
This course is designed for those new to web performance, a vital aspect of the user experience.
https://www.ibm.com/think/insights/application-performance-optimization
Application performance is not just a simple concern for most organizations; it’s a critical factor in their business’s success.
https://www.oreilly.com/library/view/optimizing-java/9781492039259/
Performance tuning is an experimental science, but that doesn’t mean engineers should resort to guesswork and folklore to get the job done.
Java+
https://www.youtube.com/watch?v=eIrMbAQSU34
Master Java – a must-have language for software development, Android apps, and more! ☕️ This beginner-friendly course takes you from basics to real coding skills.
数据结构+
https://www.youtube.com/watch?v=8hly31xKli0
In this course you will learn about algorithms and data structures, two of the fundamental topics in computer science.
https://www.youtube.com/watch?v=B31LgI4Y4DQ
Learn about data structures in this comprehensive course. We will be implementing these data structures in C or C++.
https://www.youtube.com/watch?v=CBYHwZcbD-s
Data Structures and Algorithms full course tutorial java
多线程+
https://liaoxuefeng.com/books/java/threading/basic/index.html
和单线程相比,多线程编程的特点在于:多线程经常需要读写共享数据,并且需要同步。
https://www.youtube.com/watch?v=_uQgGS_VIXM&list=PLsc-VaxfZl4do3Etp_xQ0aQBoC-x5BIgJ
https://www.youtube.com/watch?v=IEEhzQoKtQU
https://www.youtube.com/watch?v=mTGdtC9f4EU&list=PLL8woMHwr36EDxjUoCzboZjedsnhLP1j4
https://www.youtube.com/watch?v=TPVH_coGAQs&list=PLk6CEY9XxSIAeK-EAh3hB4fgNvYkYmghp
https://www.youtube.com/watch?v=xPqnoB2hjjA
This video is an introduction to multithreading in modern C++.
https://www.youtube.com/watch?v=YKBwKy5PrpQ
Rust threading is easy to implement and improves the efficiency of your applications on multi-core systems!
Spring+
https://liaoxuefeng.com/books/java/spring/index.html
Spring是一个支持快速开发Java EE应用程序的框架。它提供了一系列底层容器和基础设施,并可以和大量常用的开源框架无缝集成,可以说是开发Java EE应用程序的必备。
https://spring.io/guides/gs/rest-service
https://spring.io/quickstart
Level up your Java code and explore what Spring can do for you.
MyBatis+
https://mybatis.org/mybatis-3/getting-started.html
https://www.baeldung.com/mybatis
MyBatis is an open source persistence framework which simplifies the implementation of database access in Java applications.
分布式系统+
https://www.distributedsystemscourse.com/
The home page of a free online class in distributed systems.
https://www.youtube.com/watch?v=7VbL89mKK3M&list=PLOE1GTZ5ouRPbpTnrZ3Wqjamfwn_Q5Y9A
数据分析+
[英文] Data Analyst Roadmap
https://roadmap.sh/data-analyst
Step by step guide to becoming an Data Analyst in 2025
相关职位
社招技术类-开发
1、负责搜索、展示广告系统的业务分析与实施,沉淀业务解决方案,打造简单、可依赖的商业产品基础设施 2、负责广告投放引擎的架构设计和优化,打造高可用、高性能、高迭代效率的广告在线投放系统,同时保证系统稳定性 3、负责广告检索、策略、召回、排序等技术方向的设计、实现和优化,用技术创新驱动业务发展 4、分析现有系统不足,找到目前系统的瓶颈,改进、提高系统性能
更新于 2025-07-16
社招3年以上技术类-开发
1.设计并构建高效、可扩展的大数据处理与管理系统,支撑海量数据的实时/离线存储、检索、流转需求; 2.构建或重构高性能分布式平台(如知识图谱平台、实时分发平台),通过平台化、工具化提升信息分发效率(解决用户/供给理解、及时/准确性等关键问题); 3.对接业务需求,提供数据驱动的技术解决方案和平台能力,抽象通用逻辑,降低数据生产与流转成本; 4.编写技术文档与规范,主导核心模块开发与代码评审,推动团队技术分享与提升。
更新于 2025-08-14
社招3年以上技术类-数据
1、负责高德广告业务的离线与实时数据仓库的构建和数据设计,包括数据模型设计、ETL研发、ETL性能优化。 2、负责高德广告业务流量、供给、客资、运营、结算等数据主题建设;能够深入理解业务并合理抽象,解决业务痛点问题,不断提升用数能力和数据分析能力,发挥数据价值。 3、负责高德全域数据的广告归因建设,持续优化归因策略,确保平台流量价值。 4、提升高德数仓团队数据架构能力和数据治理能力,持续提升需求支持效率、数据研发效率、提升数据质量、降低数据使用成本。
更新于 2025-07-23