美团数据平台研发工程师|AI + 数据 / OLAP / 实时计算方向
社招全职软硬件服务-软件研发部地点:北京状态:招聘
任职要求
1. 本科及以上学历,计算机、软件工程、数据科学、人工智能等相关专业优先。 2. 熟悉 Java / Go / Python / Scala / TypeScript 中至少一种语言,具备扎实的工程开发能力。 3. 熟悉 SQL、数据仓库、数据建模、指标体系、任务调度、数据治理等数据研发基础能力。 4. 熟悉 OLAP 查询系统或分析型数据库,有 StarRocks、ClickHouse、Doris、Trino、Presto、DuckDB 等使用或建设经验优先。 5. 熟悉 Flink 实时计算,有实时数仓、实时指标、实时宽表、实时数据服务、流批一体等实践经验优先。 6. 理解 LLM / AI Agent 应用开发范式,熟悉 Prompt Engineering、RAG、Function Calling / Tool Use、Agent Workflow、Text-to-SQL 等能力。 7. 具备较强的问题定位能力,能够处理 SQL …
登录查看完整任职要求
微信扫码,1秒登录
工作职责
1. 负责 AI + 数据平台能力建设,基于 LLM / AI Agent 提升数据开发、实时计算、OLAP 查询、指标分析和数据资产使用效率。 2. 建设面向数据研发场景的 Data Agent,包括需求理解、指标口径查询、表资产推荐、SQL 生成、实时任务开发辅助、数据校验、问题诊断、上线运维辅助等能力。 3. 参与 OLAP 查询能力建设,围绕 StarRocks / ClickHouse / Doris / Trino / DuckDB 等引擎,支持报表分析、经营看板、实时查询、数据服务等场景。 4. 参与自研实时计算链路建设,包括实时数据接入、实时指标加工、实时宽表、窗口计算、状态管理、任务稳定性治理和延迟优化。 5. 结合 AI Agent 能力优化数据研发流程,将 SQL 开发、Flink 任务开发、指标口径校验、查询性能诊断、数据质量排查等高频工作自动化。 6. 建设数据资产 Skill 体系,将表结构、指标口径、血缘关系、数据质量规则、查询样例、实时任务 SOP、开发规范等沉淀为 Agent 可调用能力。 7. 打通数据平台、元数据平台、指标平台、调度系统、实时计算平台、OLAP 引擎、BI 系统和知识库,形成面向数据研发和分析的智能工作台。 8. 负责 AI 数据助手的工程化落地,包括 Prompt 管理、工具调用、权限控制、上下文管理、日志追踪、效果评估、异常兜底和成本优化。
包括英文材料
学历+
数据科学+
https://roadmap.sh/ai-data-scientist
Step by step roadmap guide to becoming an AI and Data Scientist
Java+
https://www.youtube.com/watch?v=eIrMbAQSU34
Master Java – a must-have language for software development, Android apps, and more! ☕️ This beginner-friendly course takes you from basics to real coding skills.
Go+
https://www.youtube.com/watch?v=8uiZC0l4Ajw
学习Golang的完整教程!从开始到结束不到一个小时,包括如何在Go中构建API的完整演示。没有多余的内容,只有你需要知道的知识。
Python+
https://liaoxuefeng.com/books/python/introduction/index.html
中文,免费,零起点,完整示例,基于最新的Python 3版本。
https://www.learnpython.org/
a free interactive Python tutorial for people who want to learn Python, fast.
https://www.youtube.com/watch?v=K5KVEU3aaeQ
Master Python from scratch 🚀 No fluff—just clear, practical coding skills to kickstart your journey!
https://www.youtube.com/watch?v=rfscVS0vtbw
This course will give you a full introduction into all of the core concepts in python.
Scala+
TypeScript+
https://www.youtube.com/watch?v=JHEB7RhJG1Y
Master TypeScript from basics to advanced concepts through hands-on tutorials covering type annotations, generics, data fetching, Zod library, and more, with practical challenges for effective real-world application.
SQL+
https://liaoxuefeng.com/books/sql/introduction/index.html
什么是SQL?简单地说,SQL就是访问和处理关系数据库的计算机标准语言。
https://sqlbolt.com/
Learn SQL with simple, interactive exercises.
https://www.youtube.com/watch?v=p3qvj9hO_Bo
In this video we will cover everything you need to know about SQL in only 60 minutes.
数据仓库+
https://www.youtube.com/watch?v=9GVqKuTVANE
From Zero to Data Warehouse Hero: A Full SQL Project Walkthrough and Real Industry Experience!
https://www.youtube.com/watch?v=k4tK2ttdSDg
数据治理+
https://www.ibm.com/think/topics/data-governance
Data governance is the data management discipline that focuses on the quality, security and availability of an organization’s data.
https://www.youtube.com/watch?v=uPsUjKLHLAg
Building data fabric eliminates the technological complexities of data governance so users can connect to the right data at the right time, regardless of where it resides.
OLAP+
https://www.youtube.com/watch?v=iw-5kFzIdgY
OLAP (for online analytical processing) is software for performing multidimensional analysis at high speeds on large volumes of data from a data warehouse, data mart, or some other unified, centralized data store.
StarRocks+
https://docs.starrocks.io/docs/quick_start/
These Quick Start guides will help you get going with a small StarRocks environment.
https://itnext.io/introduction-to-starrocks-a-new-modern-analytical-database-1db2177d26e1
Recently, I had the opportunity to explore StarRocks which is the new kid in the block when talking about massive scale databases which are able to handle petabytes of data.
ClickHouse+
[英文] Advanced Tutorial
https://clickhouse.com/docs/tutorial
Learn how to ingest and query data in ClickHouse using the New York City taxi example dataset.
https://www.youtube.com/watch?v=FtoWGT7kS-c
ClickHouse is an open-source column-oriented DBMS for online analytical processing that allows users to generate analytical reports using SQL queries in real-time.
https://www.youtube.com/watch?v=Rhe-kUyrFUE&list=PL0Z2YDlm0b3gcY5R_MUo4fT5bPqUQ66ep
Doris+
https://doris.apache.org/docs/gettingStarted/what-is-apache-doris
Presto+
[英文] What is Presto?
https://prestodb.io/what-is-presto/
https://www.tutorialspoint.com/apache_presto/index.htm
还有更多 •••