京东数据开发负责人
社招全职软件开发岗(团队负责人)地点:北京状态:招聘
任职要求
1、计算机相关专业,能够准确、全面的理解业务,具备丰富的架构设计经验; 2、具备扎实的计算机基础知识,对于常见的中间件(存储、缓存、消息队列、大数据等)有使用经验; 3、具备扎实的数据架构和研发处理经验,包括实时/离线数仓建设、数据建模、质量管理、性能调优等; 4、具备复杂分布式系统建设经验,对服务高可用建设、高并发性能调优等有丰富的实践经验; 5、熟悉大数据研发相关的框架工具,如Flink、Spark、Hive、HDSF、Kafka等; 6、熟悉后端工程常用的框架产品,如SpringBoot、Mybatis、SpringCloud、Dubbo等; 7、熟悉常用的数据存储服务,如Mysql、Redis、TiDB/Oceanbase、Doris/ClickHouse等 8、具备20人以上团队的管理经验,有良好的沟通能力、团队意识和协作能力,有强烈的责任心和执行力; 加分项: 1、有商业化广告相关行业的数据研发经验; 2、了解数据查询相关框架,如Presto、Calcite等; 3、有数据相关AI Agent的落地经验。 符合京东价值观:客户为先、创新、拼搏、担当、感恩、诚信。
工作职责
1、负责商业化广告数据处理和数据应用的产品调研以及架构设计; 2、承担数据相关业务的重难点技术攻坚,主导核心数据处理框架和重要组件的研发; 3、分析和发掘现有系统的不足,定位系统瓶颈,提高系统性能、稳定性以及业务扩展性; 4、在产品意识,技术认知和思考模式等方面对团队持续输出影响力; 5、主导跨部门协作和复杂功能的调研、设计、协调、实施和落地;
包括英文材料
系统设计+
https://roadmap.sh/system-design
Everything you need to know about designing large scale systems.
https://www.youtube.com/watch?v=F2FmTdLtb_4
This complete system design tutorial covers scalability, reliability, data handling, and high-level architecture with clear explanations, real-world examples, and practical strategies.
中间件+
https://www.youtube.com/watch?v=1oWPUpMheGk
缓存+
https://hackernoon.com/the-system-design-cheat-sheet-cache
The cache is a layer that stores a subset of data, typically the most frequently accessed or essential information, in a location quicker to access than its primary storage location.
https://www.youtube.com/watch?v=bP4BeUjNkXc
Caching strategies, Distributed Caching, Eviction Policies, Write-Through Cache and Least Recently Used (LRU) cache are all important terms when it comes to designing an efficient system with a caching layer.
https://www.youtube.com/watch?v=dGAgxozNWFE
消息队列+
https://www.youtube.com/watch?v=xErwDaOc-Gs
大数据+
https://www.youtube.com/watch?v=bAyrObl7TYE
https://www.youtube.com/watch?v=H4bf_uuMC-g
With all this talk of Big Data, we got Rebecca Tickle to explain just what makes data into Big Data.
性能调优+
https://goperf.dev/
The Go App Optimization Guide is a series of in-depth, technical articles for developers who want to get more performance out of their Go code without relying on guesswork or cargo cult patterns.
https://web.dev/learn/performance
This course is designed for those new to web performance, a vital aspect of the user experience.
https://www.ibm.com/think/insights/application-performance-optimization
Application performance is not just a simple concern for most organizations; it’s a critical factor in their business’s success.
https://www.oreilly.com/library/view/optimizing-java/9781492039259/
Performance tuning is an experimental science, but that doesn’t mean engineers should resort to guesswork and folklore to get the job done.
分布式系统+
https://www.distributedsystemscourse.com/
The home page of a free online class in distributed systems.
https://www.youtube.com/watch?v=7VbL89mKK3M&list=PLOE1GTZ5ouRPbpTnrZ3Wqjamfwn_Q5Y9A
高可用+
https://redis.io/blog/high-availability-architecture/
A high available architecture is when there are a number of different components, modules, or services that work together to maintain optimal performance, irrespective of peak-time loads.
https://www.ibm.com/think/topics/high-availability
High availability (HA) is a term that refers to a system’s ability to be accessible and reliable close to 100% of the time.
高并发+
https://www.baeldung.com/concurrency-principles-patterns
In this tutorial, we’ll discuss some of the design principles and patterns that have been established over time to build highly concurrent applications.
https://www.baeldung.com/java-concurrency
Handling concurrency in an application can be a tricky process with many potential pitfalls. A solid grasp of the fundamentals will go a long way to help minimize these issues.
https://www.oreilly.com/library/view/concurrency-in-go/9781491941294/
You’ll understand how Go chooses to model concurrency, what issues arise from this model, and how you can compose primitives within this model to solve problems.
https://www.oreilly.com/library/view/modern-concurrency-in/9781098165406/
With this book, you'll explore the transformative world of Java 21's key feature: virtual threads.
https://www.youtube.com/watch?v=qyM8Pi1KiiM
https://www.youtube.com/watch?v=wEsPL50Uiyo
Flink+
https://nightlies.apache.org/flink/flink-docs-release-2.0/docs/learn-flink/overview/
This training presents an introduction to Apache Flink that includes just enough to get you started writing scalable streaming ETL, analytics, and event-driven applications, while leaving out a lot of (ultimately important) details.
https://www.youtube.com/watch?v=WajYe9iA2Uk&list=PLa7VYi0yPIH2GTo3vRtX8w9tgNTTyYSux
Today’s businesses are increasingly software-defined, and their business processes are being automated. Whether it’s orders and shipments, or downloads and clicks, business events can always be streamed. Flink can be used to manipulate, process, and react to these streaming events as they occur.
Spark+
[英文] Learning Spark Book
https://pages.databricks.com/rs/094-YMS-629/images/LearningSpark2.0.pdf
This new edition has been updated to reflect Apache Spark’s evolution through Spark 2.x and Spark 3.0, including its expanded ecosystem of built-in and external data sources, machine learning, and streaming technologies with which Spark is tightly integrated.
Hive+
[英文] Hive Tutorial
https://www.tutorialspoint.com/hive/index.htm
Hive is a data warehouse infrastructure tool to process structured data in Hadoop. It resides on top of Hadoop to summarize Big Data, and makes querying and analyzing easy.
https://www.youtube.com/watch?v=D4HqQ8-Ja9Y
Kafka+
https://developer.confluent.io/what-is-apache-kafka/
https://www.youtube.com/watch?v=CU44hKLMg7k
https://www.youtube.com/watch?v=j4bqyAMMb7o&list=PLa7VYi0yPIH0KbnJQcMv5N9iW8HkZHztH
In this Apache Kafka fundamentals course, we introduce you to the basic Apache Kafka elements and APIs, as well as the broader Kafka ecosystem.
Spring Boot+
https://spring.io/guides/gs/spring-boot
his guide provides a sampling of how Spring Boot helps you accelerate application development.
https://www.youtube.com/watch?v=Nv2DERaMx-4&list=PLzUMQwCOrQTksiYqoumAQxuhPNa3HqasL
The author teaches you how to use Spring Boot from a complete beginner, to building a REST API with a real database, Dockerising it and deploying it to the cloud.
MyBatis+
https://mybatis.org/mybatis-3/getting-started.html
https://www.baeldung.com/mybatis
MyBatis is an open source persistence framework which simplifies the implementation of database access in Java applications.
MySQL+
https://juejin.cn/post/7190306988939542585
这是一篇 MySQL 通关一篇过硬核经验学习路线,包括数据库相关知识,SQL语句的使用,数据库约束,设计等。
[英文] MySQL Tutorial
https://www.mysqltutorial.org/
your go-to resource for mastering MySQL in a fast, easy, and enjoyable way.
https://www.youtube.com/watch?v=5OdVJbNCSso
MySQL SQL tutorial for beginners
https://www.youtube.com/watch?v=7S_tz1z_5bA
This beginner-friendly course teaches you SQL from scratch.
Redis+
[英文] Developer Hub
https://redis.io/dev/
Get all the tutorials, learning paths, and more you need to start building—fast.
https://www.runoob.com/redis/redis-tutorial.html
REmote DIctionary Server(Redis) 是一个由 Salvatore Sanfilippo 写的 key-value 存储系统,是跨平台的非关系型数据库。
https://www.youtube.com/watch?v=jgpVdJB2sKQ
In this video I will be covering Redis in depth from how to install it, what commands you can use, all the way to how to use it in a real world project.
TiDB+
OceanBase+
https://en.oceanbase.com/docs/common-oceanbase-database-10000000001028975
This topic describes how to deploy OceanBase Database in a demo environment, a cluster, or a container for quick hands-on experience.
Doris+
https://doris.apache.org/docs/gettingStarted/what-is-apache-doris
ClickHouse+
[英文] Advanced Tutorial
https://clickhouse.com/docs/tutorial
Learn how to ingest and query data in ClickHouse using the New York City taxi example dataset.
https://www.youtube.com/watch?v=FtoWGT7kS-c
ClickHouse is an open-source column-oriented DBMS for online analytical processing that allows users to generate analytical reports using SQL queries in real-time.
https://www.youtube.com/watch?v=Rhe-kUyrFUE&list=PL0Z2YDlm0b3gcY5R_MUo4fT5bPqUQ66ep
Presto+
[英文] What is Presto?
https://prestodb.io/what-is-presto/
https://www.tutorialspoint.com/apache_presto/index.htm
Calcite+
https://calcite.apache.org/docs/tutorial.html
This is a step-by-step tutorial that shows how to build and connect to Calcite.
https://www.baeldung.com/apache-calcite
It’s a powerful data management framework that can be used in various scenarios concerning data access.
AI agent+
https://www.ibm.com/think/ai-agents
Your one-stop resource for gaining in-depth knowledge and hands-on applications of AI agents.
相关职位
社招5年以上技术类
1.团队管理 :负责数据开发团队的日常管理工作,为团队设定任务和目标,跟进团队成员的项目进度和业务结果,提升团队整体技能水平,做好人才梯队建设。2.项目开发与交付 : 带领团队完成大数据离线、实时等业务需求的开发、实施和维护工作 ,并能配合PM完成需求的快速研发与交付 。3.数据治理与质量保障 :主导或参与数据治理及管理工作,包括设计数据模型、定义数据标准、构建数据管理体系,确保数据的质量,包括准确性、完整性和一致性等。4.跨部门协作与沟通 :作为技术接口人, 需要与其他部门和团队协作,确保需求的合理性和投入产出比,与产品、数据洞察、运营等多个角色紧密合作, 将业务需求转化为技术方案,并用数据赋能业务发展。
更新于 2025-09-28
社招JNU21
1、基于大数据平台开发大数据分析、可视化产品; 2、负责数据产品的前端技术选型和调研,推动与优化已有前端项目的组件抽象; 3、对前端团队产出的质量和效率负责。
更新于 2018-07-19
社招2年以上WXG公共技术
1.负责微信数据平台前端的架构设计与开发,直接参与前端核心能力的建设,支撑微信多个重要产品的数据应用; 2.负责微信数据平台前端系统的优化,包括:性能优化、稳定性优化,容错优化等; 3.协助团队完成技术选型和方案设计等工作,并与产品、测试等团队协同合作,确保交付质量。
更新于 2025-06-23