字节跳动推荐架构工程师-Enterprise 大数据方向
社招全职A206708地点:上海状态:停招
任职要求
1、熟练掌握 Spark、Flink、Kafka 等大数据批流应用开发,熟悉原理与性能调优; 2、熟练掌握 Java、Scala、Python 等编程语言,具备扎实的计算机基础和强悍的编码、系统设计及解决问题的能力; 3、熟悉 Yarn、K8s、HD…
登录查看完整任职要求
微信扫码,1秒登录
工作职责
字节跳动推荐架构团队企业服务方向,负责字节跳动旗下国内和海外的推荐系统 toB 产品架构设计、开发与演进,打造敏捷高效的推荐数据架构能力。 1、负责火山引擎智能推荐平台和海外推荐产品的数据架构持续演进; 2、建设领先的索引、特征和样本生产回溯与存储方案,支持算法高效迭代; 3、建设灵活稳固合规的大数据底座,满足云化、私有化、全球化的严苛挑战; 4、建设体系化的流程与工具,优化系统成本提升整体的稳定性与效率。
包括英文材料
Spark+
[英文] Learning Spark Book
https://pages.databricks.com/rs/094-YMS-629/images/LearningSpark2.0.pdf
This new edition has been updated to reflect Apache Spark’s evolution through Spark 2.x and Spark 3.0, including its expanded ecosystem of built-in and external data sources, machine learning, and streaming technologies with which Spark is tightly integrated.
Flink+
https://nightlies.apache.org/flink/flink-docs-release-2.0/docs/learn-flink/overview/
This training presents an introduction to Apache Flink that includes just enough to get you started writing scalable streaming ETL, analytics, and event-driven applications, while leaving out a lot of (ultimately important) details.
https://www.youtube.com/watch?v=WajYe9iA2Uk&list=PLa7VYi0yPIH2GTo3vRtX8w9tgNTTyYSux
Today’s businesses are increasingly software-defined, and their business processes are being automated. Whether it’s orders and shipments, or downloads and clicks, business events can always be streamed. Flink can be used to manipulate, process, and react to these streaming events as they occur.
Kafka+
https://developer.confluent.io/what-is-apache-kafka/
https://www.youtube.com/watch?v=CU44hKLMg7k
https://www.youtube.com/watch?v=j4bqyAMMb7o&list=PLa7VYi0yPIH0KbnJQcMv5N9iW8HkZHztH
In this Apache Kafka fundamentals course, we introduce you to the basic Apache Kafka elements and APIs, as well as the broader Kafka ecosystem.
大数据+
https://www.youtube.com/watch?v=bAyrObl7TYE
https://www.youtube.com/watch?v=H4bf_uuMC-g
With all this talk of Big Data, we got Rebecca Tickle to explain just what makes data into Big Data.
性能调优+
https://goperf.dev/
The Go App Optimization Guide is a series of in-depth, technical articles for developers who want to get more performance out of their Go code without relying on guesswork or cargo cult patterns.
https://web.dev/learn/performance
This course is designed for those new to web performance, a vital aspect of the user experience.
https://www.ibm.com/think/insights/application-performance-optimization
Application performance is not just a simple concern for most organizations; it’s a critical factor in their business’s success.
https://www.oreilly.com/library/view/optimizing-java/9781492039259/
Performance tuning is an experimental science, but that doesn’t mean engineers should resort to guesswork and folklore to get the job done.
Java+
https://www.youtube.com/watch?v=eIrMbAQSU34
Master Java – a must-have language for software development, Android apps, and more! ☕️ This beginner-friendly course takes you from basics to real coding skills.
Scala+
还有更多 •••
相关职位
社招软件开发岗
1.参与京东多个重要流量场景的推荐和增长体系的研发; 2.与产品经理、业务算法一起持续优化用户体验,提升推荐效果; 3.解决业务迭代过程中遇到的架构和性能问题。
更新于 2025-09-09北京
社招3-5年J0012
1、设计与搭建个性化推荐系统架构,提升系统性能和稳定性; 2、参与分布式推荐存储和特征中心的建设,支持在线存储服务和离线特征迭代的开发; 3、参与针对推荐业务的机器学习系统搭建,包括推荐模型的训练、图神经网络和在线预估和召回等等; 4、优化算法可扩展性,优化自动测试工具, 保障算法策略模块快速迭代。
更新于 2025-08-26北京
社招J0012
1、主导超大规模推荐系统的核心存储架构设计,构建支持千亿级特征的高性能数据管道,优化实时特征计算(毫秒级延迟)与离线特征处理(PB级吞吐)的混合负载调度; 2、研发新一代多模态特征存储引擎,设计支持稀疏矩阵、时序特征、图的高效存储格式,实现特征数据的版本化管理和跨DC级数据同步; 3、打造智能特征服务平台,集成特征血缘追踪、自动降级熔断、热点数据预取等核心能力,支撑日均千亿次特征查询的稳定服务; 4、探索存储技术前沿,落地向量化检索、持久化内存、异构存储池管理等创新技术,构建支持千卡级GPU集群的特征供给体系; 5、设计面向特征工程的开发框架,实现特征变换、特征注册、质量监控的完整工具链,提升算法团队迭代效率。
更新于 2026-03-18北京