美团医药健康数据开发工程师-实习生
实习兼职核心本地商业-基础研发平台地点:北京状态:招聘
任职要求
1.具有扎实的计算机专业知识,极强的问题解决能力; 2.掌握数据仓库的经典建模方法,熟悉不同建模方法的优劣,三年以上的数仓开发经验; 3.掌握大数据生态技术栈,具备较丰富的Hadoop、Hive、doris、Spark、flink、kafka等大数据工具应用和开发经验; 4.扎实的SQL功底,了解不同框架下SQL执行的原理,有过性能优化的实际经验; 5.优秀的业务理解能力和良好的沟通协调能力; 6.心态开放,保持好奇心,有自驱力。 具备以下条件优先 1.具备AI工具使用经验优先:使用过如DeepSeek、ChatGPT、Claude、Cursor、MCopilot等AI工具,能把握不同模型的优势边界,可运用AI解决实际业务问题; 2.有数据敏感度、能够从数据分析的视角看待问题或有一定数据分析经验; 3.了解或有一定系统开发经验,能够使用java、python等语言进行编程; 4.有数据一致性保障相关实践经验; 5.有体系化大数据治理相关实践经验。
工作职责
1.承担美团医药健康业务线的数仓设计和开发工作; 2.承担业务方应用层数据的搭建和开发工作; 3.承担医药健康业务数据质量、成本、安全等各方向数据治理工作; 4.业务方数据问题的统一接口人与综合解决方案提供方,对外提供一站式服务; 5.跨团队沟通、推动数据生产链路上的问题改进。
包括英文材料
数据仓库+
https://www.youtube.com/watch?v=9GVqKuTVANE
From Zero to Data Warehouse Hero: A Full SQL Project Walkthrough and Real Industry Experience!
https://www.youtube.com/watch?v=k4tK2ttdSDg
大数据+
https://www.youtube.com/watch?v=bAyrObl7TYE
https://www.youtube.com/watch?v=H4bf_uuMC-g
With all this talk of Big Data, we got Rebecca Tickle to explain just what makes data into Big Data.
Hadoop+
https://www.runoob.com/w3cnote/hadoop-tutorial.html
Hadoop 为庞大的计算机集群提供可靠的、可伸缩的应用层计算和存储支持,它允许使用简单的编程模型跨计算机群集分布式处理大型数据集,并且支持在单台计算机到几千台计算机之间进行扩展。
[英文] Hadoop Tutorial
https://www.tutorialspoint.com/hadoop/index.htm
Hadoop is an open-source framework that allows to store and process big data in a distributed environment across clusters of computers using simple programming models.
Hive+
[英文] Hive Tutorial
https://www.tutorialspoint.com/hive/index.htm
Hive is a data warehouse infrastructure tool to process structured data in Hadoop. It resides on top of Hadoop to summarize Big Data, and makes querying and analyzing easy.
https://www.youtube.com/watch?v=D4HqQ8-Ja9Y
Doris+
https://doris.apache.org/docs/gettingStarted/what-is-apache-doris
Spark+
[英文] Learning Spark Book
https://pages.databricks.com/rs/094-YMS-629/images/LearningSpark2.0.pdf
This new edition has been updated to reflect Apache Spark’s evolution through Spark 2.x and Spark 3.0, including its expanded ecosystem of built-in and external data sources, machine learning, and streaming technologies with which Spark is tightly integrated.
Flink+
https://nightlies.apache.org/flink/flink-docs-release-2.0/docs/learn-flink/overview/
This training presents an introduction to Apache Flink that includes just enough to get you started writing scalable streaming ETL, analytics, and event-driven applications, while leaving out a lot of (ultimately important) details.
https://www.youtube.com/watch?v=WajYe9iA2Uk&list=PLa7VYi0yPIH2GTo3vRtX8w9tgNTTyYSux
Today’s businesses are increasingly software-defined, and their business processes are being automated. Whether it’s orders and shipments, or downloads and clicks, business events can always be streamed. Flink can be used to manipulate, process, and react to these streaming events as they occur.
Kafka+
https://developer.confluent.io/what-is-apache-kafka/
https://www.youtube.com/watch?v=CU44hKLMg7k
https://www.youtube.com/watch?v=j4bqyAMMb7o&list=PLa7VYi0yPIH0KbnJQcMv5N9iW8HkZHztH
In this Apache Kafka fundamentals course, we introduce you to the basic Apache Kafka elements and APIs, as well as the broader Kafka ecosystem.
SQL+
https://liaoxuefeng.com/books/sql/introduction/index.html
什么是SQL?简单地说,SQL就是访问和处理关系数据库的计算机标准语言。
https://sqlbolt.com/
Learn SQL with simple, interactive exercises.
https://www.youtube.com/watch?v=p3qvj9hO_Bo
In this video we will cover everything you need to know about SQL in only 60 minutes.
数据分析+
[英文] Data Analyst Roadmap
https://roadmap.sh/data-analyst
Step by step guide to becoming an Data Analyst in 2025
Java+
https://www.youtube.com/watch?v=eIrMbAQSU34
Master Java – a must-have language for software development, Android apps, and more! ☕️ This beginner-friendly course takes you from basics to real coding skills.
Python+
https://liaoxuefeng.com/books/python/introduction/index.html
中文,免费,零起点,完整示例,基于最新的Python 3版本。
https://www.learnpython.org/
a free interactive Python tutorial for people who want to learn Python, fast.
https://www.youtube.com/watch?v=K5KVEU3aaeQ
Master Python from scratch 🚀 No fluff—just clear, practical coding skills to kickstart your journey!
https://www.youtube.com/watch?v=rfscVS0vtbw
This course will give you a full introduction into all of the core concepts in python.
数据治理+
https://www.ibm.com/think/topics/data-governance
Data governance is the data management discipline that focuses on the quality, security and availability of an organization’s data.
https://www.youtube.com/watch?v=uPsUjKLHLAg
Building data fabric eliminates the technological complexities of data governance so users can connect to the right data at the right time, regardless of where it resides.
相关职位
社招核心本地商业-基
1.参与美团CLC和业务场景BI工具的建设,保障系统的稳定性,持续对性能进行优化和保质保量按时交付; 2.参与数据领域 AIOps运维体系建设,包括全链路数据可观测、治理工具等,提升运维和运营效率
更新于 2025-06-20
社招5年以上核心本地商业-医
1.负责跟进销售数据分析,及时发现问题并提出改进建议; 2.负责制定销售运营策略,优化销售流程,提升销售效率; 3.建立迭代销售业务流程机制,开发高效的销售数据工具,提升效率,对业务团队快速响应; 4.负责与其他部门沟通合作,推动销售业务的顺利进行。
更新于 2025-06-17
社招5年以上核心本地商业-医
1.负责跟进销售数据分析,及时发现问题并提出改进建议; 2.负责制定销售运营策略,优化销售流程,提升销售效率; 3.建立迭代销售业务流程机制,开发高效的销售数据工具,提升效率,对业务团队快速响应; 4.负责与其他部门沟通合作,推动销售业务的顺利进行。
更新于 2025-06-17