高德地图高德-数据工程工程师/专家-信息工程团队
社招全职5年以上技术类-开发地点:北京状态:招聘
任职要求
- 5年以上大数据平台研发工作经验,具备扎实的计算机技术能力 - 熟悉大数据生态体系技术,掌握Hive、Spark、Flink等主流离线、实时存储/计算引擎及原理,具备良好的SQL性能调优能力; - 熟悉Java体系架构,熟悉常见数据结构、多线程并…
登录查看完整任职要求
微信扫码,1秒登录
工作职责
- 负责设计高效、可扩展的大数据处理系统架构,确保能够支持海量数据存储与分析。 - 参与或主导重构知识图谱、画像平台的建设工作,利用大数据技术为不同业务场景提供高效准确的用户理解、供给理解能力。 - 负责数据管理等平台研发,包括数据管理、数据分发等,用工具化、平台化的能力协助业务提高数据流转效率;
包括英文材料
大数据+
https://www.youtube.com/watch?v=bAyrObl7TYE
https://www.youtube.com/watch?v=H4bf_uuMC-g
With all this talk of Big Data, we got Rebecca Tickle to explain just what makes data into Big Data.
Hive+
[英文] Hive Tutorial
https://www.tutorialspoint.com/hive/index.htm
Hive is a data warehouse infrastructure tool to process structured data in Hadoop. It resides on top of Hadoop to summarize Big Data, and makes querying and analyzing easy.
https://www.youtube.com/watch?v=D4HqQ8-Ja9Y
Spark+
[英文] Learning Spark Book
https://pages.databricks.com/rs/094-YMS-629/images/LearningSpark2.0.pdf
This new edition has been updated to reflect Apache Spark’s evolution through Spark 2.x and Spark 3.0, including its expanded ecosystem of built-in and external data sources, machine learning, and streaming technologies with which Spark is tightly integrated.
Flink+
https://nightlies.apache.org/flink/flink-docs-release-2.0/docs/learn-flink/overview/
This training presents an introduction to Apache Flink that includes just enough to get you started writing scalable streaming ETL, analytics, and event-driven applications, while leaving out a lot of (ultimately important) details.
https://www.youtube.com/watch?v=WajYe9iA2Uk&list=PLa7VYi0yPIH2GTo3vRtX8w9tgNTTyYSux
Today’s businesses are increasingly software-defined, and their business processes are being automated. Whether it’s orders and shipments, or downloads and clicks, business events can always be streamed. Flink can be used to manipulate, process, and react to these streaming events as they occur.
SQL+
https://liaoxuefeng.com/books/sql/introduction/index.html
什么是SQL?简单地说,SQL就是访问和处理关系数据库的计算机标准语言。
https://sqlbolt.com/
Learn SQL with simple, interactive exercises.
https://www.youtube.com/watch?v=p3qvj9hO_Bo
In this video we will cover everything you need to know about SQL in only 60 minutes.
性能调优+
https://goperf.dev/
The Go App Optimization Guide is a series of in-depth, technical articles for developers who want to get more performance out of their Go code without relying on guesswork or cargo cult patterns.
https://web.dev/learn/performance
This course is designed for those new to web performance, a vital aspect of the user experience.
https://www.ibm.com/think/insights/application-performance-optimization
Application performance is not just a simple concern for most organizations; it’s a critical factor in their business’s success.
https://www.oreilly.com/library/view/optimizing-java/9781492039259/
Performance tuning is an experimental science, but that doesn’t mean engineers should resort to guesswork and folklore to get the job done.
还有更多 •••
相关职位
社招技术类-开发
1、负责搜索、展示广告系统的业务分析与实施,沉淀业务解决方案,打造简单、可依赖的商业产品基础设施 2、负责广告投放引擎的架构设计和优化,打造高可用、高性能、高迭代效率的广告在线投放系统,同时保证系统稳定性 3、负责广告检索、策略、召回、排序等技术方向的设计、实现和优化,用技术创新驱动业务发展 4、分析现有系统不足,找到目前系统的瓶颈,改进、提高系统性能
更新于 2025-07-16北京
社招3年以上技术类-开发
1.设计并构建高效、可扩展的大数据处理与管理系统,支撑海量数据的实时/离线存储、检索、流转需求; 2.构建或重构高性能分布式平台(如知识图谱平台、实时分发平台),通过平台化、工具化提升信息分发效率(解决用户/供给理解、及时/准确性等关键问题); 3.对接业务需求,提供数据驱动的技术解决方案和平台能力,抽象通用逻辑,降低数据生产与流转成本; 4.编写技术文档与规范,主导核心模块开发与代码评审,推动团队技术分享与提升。
更新于 2025-08-14北京
社招3年以上技术类-数据
1、负责高德广告业务的离线与实时数据仓库的构建和数据设计,包括数据模型设计、ETL研发、ETL性能优化。 2、负责高德广告业务流量、供给、客资、运营、结算等数据主题建设;能够深入理解业务并合理抽象,解决业务痛点问题,不断提升用数能力和数据分析能力,发挥数据价值。 3、负责高德全域数据的广告归因建设,持续优化归因策略,确保平台流量价值。 4、提升高德数仓团队数据架构能力和数据治理能力,持续提升需求支持效率、数据研发效率、提升数据质量、降低数据使用成本。
更新于 2025-07-23北京