小米高级数据服务开发工程师 - 商业化
社招全职5年以上A194338地点:北京状态:招聘
任职要求
1. 本科及以上学历,熟悉Java,5年以上生产项目开发经验,有计算广告经验者优先; 2. 对高性能在线服务、分布式计算、大规模存储中的一项或多项有深入理解; 3. 熟练掌握Hive、Spark/Flink、Doris等一种以上大数据开发组件,有应用和优化经验; 4. 工作积极主动,责任心强,善于沟通,具备良好的团队协作能力; 5. 了解Agent开发技术组件,如LangGraph、RAG、SSE、Spring Webflux等技术
工作职责
1. 深入理解数据分析需求,负责小米广告海量数据的高性能数据分析平台,帮助广告商业化提效 2. 深入理解广告业务,负责小米广告系统智能诊断与监控,保障系统稳定运行 3. 负责小米广告系统与广告主间投放策略、数据归因等能力交互,承载高并发、低时延的架构需求 4. 负责广告数据平台智能问数、智能诊断Agent相关开发
包括英文材料
学历+
Java+
https://www.youtube.com/watch?v=eIrMbAQSU34
Master Java – a must-have language for software development, Android apps, and more! ☕️ This beginner-friendly course takes you from basics to real coding skills.
Hive+
[英文] Hive Tutorial
https://www.tutorialspoint.com/hive/index.htm
Hive is a data warehouse infrastructure tool to process structured data in Hadoop. It resides on top of Hadoop to summarize Big Data, and makes querying and analyzing easy.
https://www.youtube.com/watch?v=D4HqQ8-Ja9Y
Spark+
[英文] Learning Spark Book
https://pages.databricks.com/rs/094-YMS-629/images/LearningSpark2.0.pdf
This new edition has been updated to reflect Apache Spark’s evolution through Spark 2.x and Spark 3.0, including its expanded ecosystem of built-in and external data sources, machine learning, and streaming technologies with which Spark is tightly integrated.
Flink+
https://nightlies.apache.org/flink/flink-docs-release-2.0/docs/learn-flink/overview/
This training presents an introduction to Apache Flink that includes just enough to get you started writing scalable streaming ETL, analytics, and event-driven applications, while leaving out a lot of (ultimately important) details.
https://www.youtube.com/watch?v=WajYe9iA2Uk&list=PLa7VYi0yPIH2GTo3vRtX8w9tgNTTyYSux
Today’s businesses are increasingly software-defined, and their business processes are being automated. Whether it’s orders and shipments, or downloads and clicks, business events can always be streamed. Flink can be used to manipulate, process, and react to these streaming events as they occur.
Doris+
https://doris.apache.org/docs/gettingStarted/what-is-apache-doris
大数据+
https://www.youtube.com/watch?v=bAyrObl7TYE
https://www.youtube.com/watch?v=H4bf_uuMC-g
With all this talk of Big Data, we got Rebecca Tickle to explain just what makes data into Big Data.
AI agent+
https://www.ibm.com/think/ai-agents
Your one-stop resource for gaining in-depth knowledge and hands-on applications of AI agents.
RAG+
https://www.youtube.com/watch?v=sVcwVQRHIc8
Learn how to implement RAG (Retrieval Augmented Generation) from scratch, straight from a LangChain software engineer.
Spring+
https://liaoxuefeng.com/books/java/spring/index.html
Spring是一个支持快速开发Java EE应用程序的框架。它提供了一系列底层容器和基础设施,并可以和大量常用的开源框架无缝集成,可以说是开发Java EE应用程序的必备。
https://spring.io/guides/gs/rest-service
https://spring.io/quickstart
Level up your Java code and explore what Spring can do for you.
相关职位
社招2年以上JRM41
1、负责字节商业化核心业务的离线与在线数据服务平台建设与维护; 2、负责字节商业化核心业务BI工具,取数工具,核心数据看板的建设与维护工作; 3、负责字节商业化核心业务业绩,提成等模块的建设与维护工作;
更新于 2020-11-12
社招2年以上技术类
1. 负责公司内部商业化数据的开发和维护,为产品和营销团队提供数据支持和分析服务; 2. 设计和开发商业化数据仓库和数据集市,实现数据的采集、清洗、存储和分析; 3. 负责数据架构的设计和维护,确保数据准确性、完整性和安全性; 4. 参与业务需求分析和数据建模工作,编写SQL语句完成数据提取、转换和加载(ETL); 5. 能够独立完成数据问题的排查和处理,解决数据质量和性能问题; 6. 具有良好的沟通能力和团队协作能力,与不同部门的业务人员和技术人员合作,推进数据项目的进展。
更新于 2025-04-07
社招3-5年后端开发
岗位职责: - 参与小红书商业化数据产品开发工作,业务方向包括但不限于销售业绩、客户分析、代理商盯盘等 - 与产品、运营、后端、测试、运维等多角色协同工作,包括业务理解,需求评审,方案沟通,系统维护等 - 设计并实现高效、可扩展的数据架构,确保系统能够支持复杂的业务逻辑和大数据量处理,持续提升交付质量和效率 - 负责复杂数据链路架构、稳定性、成本、性能等方面的优化工作,保障线上服务运行稳定,资源使用合理
更新于 2025-10-16