京东数据开发工程师(广告)
社招全职2年以上数据开发岗地点:海南状态:招聘
任职要求
1.计算机或相关专业本科及以上学历,2年以上大数据研发经验;有广告/搜推数据体系建设、PB级数据处理与调优经验者优先; 2.精通Java/Python/Scala至少一门语言;深入理解Spark/Flink/Hive/Iceberg等大数据生态,具备线上调优与问题排查能力;掌握ClickHouse/Doris等OLAP引擎原理与复杂SQL性能优化;具备数据分层、维度建模、数仓架构设计及流批一体…
登录查看完整任职要求
微信扫码,1秒登录
工作职责
1.广告数据体系建设:负责京东广告实时与离线数据全链路建设,涵盖数据接入、流批处理及OLAP分析,构建高可用、低成本、易扩展的数仓体系,持续提升数据资产质量,支撑广告策略、算法模型与投放决策等多样化用数需求; 2.高性能数据链路优化:面向海量广告数据,持续突破计算效率、存储成本与查询延迟,通过架构升级、算力优化打造低延迟、高可用的数据链路,核心支撑效果分析、实时出价等关键业务场景; 3.AI驱动的数据智能:探索AI在智能ETL、自动化建模及数据质量监控与诊断体系等数据工程场景的落地,沉淀通用工具与平台能力,提升研发效能与数据智能化水平,为算法与业务决策提供智能数据支持。
包括英文材料
学历+
大数据+
https://www.youtube.com/watch?v=bAyrObl7TYE
https://www.youtube.com/watch?v=H4bf_uuMC-g
With all this talk of Big Data, we got Rebecca Tickle to explain just what makes data into Big Data.
Java+
https://www.youtube.com/watch?v=eIrMbAQSU34
Master Java – a must-have language for software development, Android apps, and more! ☕️ This beginner-friendly course takes you from basics to real coding skills.
Python+
https://liaoxuefeng.com/books/python/introduction/index.html
中文,免费,零起点,完整示例,基于最新的Python 3版本。
https://www.learnpython.org/
a free interactive Python tutorial for people who want to learn Python, fast.
https://www.youtube.com/watch?v=K5KVEU3aaeQ
Master Python from scratch 🚀 No fluff—just clear, practical coding skills to kickstart your journey!
https://www.youtube.com/watch?v=rfscVS0vtbw
This course will give you a full introduction into all of the core concepts in python.
Scala+
Spark+
[英文] Learning Spark Book
https://pages.databricks.com/rs/094-YMS-629/images/LearningSpark2.0.pdf
This new edition has been updated to reflect Apache Spark’s evolution through Spark 2.x and Spark 3.0, including its expanded ecosystem of built-in and external data sources, machine learning, and streaming technologies with which Spark is tightly integrated.
Flink+
https://nightlies.apache.org/flink/flink-docs-release-2.0/docs/learn-flink/overview/
This training presents an introduction to Apache Flink that includes just enough to get you started writing scalable streaming ETL, analytics, and event-driven applications, while leaving out a lot of (ultimately important) details.
https://www.youtube.com/watch?v=WajYe9iA2Uk&list=PLa7VYi0yPIH2GTo3vRtX8w9tgNTTyYSux
Today’s businesses are increasingly software-defined, and their business processes are being automated. Whether it’s orders and shipments, or downloads and clicks, business events can always be streamed. Flink can be used to manipulate, process, and react to these streaming events as they occur.
Hive+
[英文] Hive Tutorial
https://www.tutorialspoint.com/hive/index.htm
Hive is a data warehouse infrastructure tool to process structured data in Hadoop. It resides on top of Hadoop to summarize Big Data, and makes querying and analyzing easy.
https://www.youtube.com/watch?v=D4HqQ8-Ja9Y
还有更多 •••