饿了么淘宝闪购-高级数据研发专家-上海
社招全职5年以上技术类-数据地点:上海状态:招聘
任职要求
1. 学历与背景 - 计算机科学、软件工程、数据科学或相关专业硕士及以上学历(优秀者可放宽); - 熟悉分布式系统原理、数据库原理及大数据生态技术栈。 2. 技术能力 - 精通至少一种编程语言(Java/Python/Scala),熟悉SQL及NoSQL数据库(如Hive、HBase、ClickHouse、MongoDB等); - 熟悉大数据处理框架(如Hadoop、Spark、Flink、Kafka、Storm等),具备实际项目开发经验;熟悉Alibaba MaxCompute更好。 - 熟悉数据仓库设计及数据可视化工具(如Tableau、Power BI、Qu…
登录查看完整任职要求
微信扫码,1秒登录
工作职责
1. 大数据平台架构设计与开发 - 负责构建、优化和维护企业级大数据平台,包括数据采集、存储、处理、分析及可视化系统; - 设计高可用、高并发、可扩展的大数据架构,支持海量数据的实时/离线处理与分析。 - 设计高质量的数据模型,确保模型规范易用 2. 数据处理与分析 - 基于阿里大数据开发规范,构建数据仓库和数据湖,开发离线和实时ETL任务。 - 利用统计分析/机器学习/深度学习算法挖掘数据洞察,支持运营和产品决策和行动 3. 问题排查与系统性能优化 - 及时诊断、定位、解决离线和实时等各类计算任务的问题; - 对长耗时计算任务进行性能优化 4. 技术研究与创新 - 善于技术钻研,跟踪大数据领域前沿技术,推动技术落地与应用; - 推动AI技术在数据研发域的效能提升和产品创新 5. 数据安全与合规 - 设计并实施数据安全策略,确保数据隐私与合规性。
包括英文材料
学历+
数据科学+
https://roadmap.sh/ai-data-scientist
Step by step roadmap guide to becoming an AI and Data Scientist
分布式系统+
https://www.distributedsystemscourse.com/
The home page of a free online class in distributed systems.
https://www.youtube.com/watch?v=7VbL89mKK3M&list=PLOE1GTZ5ouRPbpTnrZ3Wqjamfwn_Q5Y9A
大数据+
https://www.youtube.com/watch?v=bAyrObl7TYE
https://www.youtube.com/watch?v=H4bf_uuMC-g
With all this talk of Big Data, we got Rebecca Tickle to explain just what makes data into Big Data.
Java+
https://www.youtube.com/watch?v=eIrMbAQSU34
Master Java – a must-have language for software development, Android apps, and more! ☕️ This beginner-friendly course takes you from basics to real coding skills.
Python+
https://liaoxuefeng.com/books/python/introduction/index.html
中文,免费,零起点,完整示例,基于最新的Python 3版本。
https://www.learnpython.org/
a free interactive Python tutorial for people who want to learn Python, fast.
https://www.youtube.com/watch?v=K5KVEU3aaeQ
Master Python from scratch 🚀 No fluff—just clear, practical coding skills to kickstart your journey!
https://www.youtube.com/watch?v=rfscVS0vtbw
This course will give you a full introduction into all of the core concepts in python.
Scala+
SQL+
https://liaoxuefeng.com/books/sql/introduction/index.html
什么是SQL?简单地说,SQL就是访问和处理关系数据库的计算机标准语言。
https://sqlbolt.com/
Learn SQL with simple, interactive exercises.
https://www.youtube.com/watch?v=p3qvj9hO_Bo
In this video we will cover everything you need to know about SQL in only 60 minutes.
NoSQL+
https://nosql-database.org/
Everything about NoSQL Systems – Types, Benefits, and Real-World Uses
https://piaosanlang.gitbooks.io/mongodb/content/section1.1.html
NoSQL(NoSQL = Not Only SQL ),即"不仅仅是SQL",指的是非关系型的数据库。是对不同于传统的关系型数据库管理系统的统称。
https://www.youtube.com/watch?v=0buKQHokLK8
NoSQL databases can operate in multiple modes: as key-value store, document store or wide column store.
Hive+
[英文] Hive Tutorial
https://www.tutorialspoint.com/hive/index.htm
Hive is a data warehouse infrastructure tool to process structured data in Hadoop. It resides on top of Hadoop to summarize Big Data, and makes querying and analyzing easy.
https://www.youtube.com/watch?v=D4HqQ8-Ja9Y
HBase+
[英文] HBase Tutorial
https://www.tutorialspoint.com/hbase/index.htm
HBase is a data model that is similar to Google's big table designed to provide quick random access to huge amounts of structured data. This tutorial provides an introduction to HBase, the procedures to set up HBase on Hadoop File Systems, and ways to interact with HBase shell.
ClickHouse+
[英文] Advanced Tutorial
https://clickhouse.com/docs/tutorial
Learn how to ingest and query data in ClickHouse using the New York City taxi example dataset.
https://www.youtube.com/watch?v=FtoWGT7kS-c
ClickHouse is an open-source column-oriented DBMS for online analytical processing that allows users to generate analytical reports using SQL queries in real-time.
https://www.youtube.com/watch?v=Rhe-kUyrFUE&list=PL0Z2YDlm0b3gcY5R_MUo4fT5bPqUQ66ep
MongoDB+
https://learnxinyminutes.com/mongodb/
MongoDB is a NoSQL document database for high volume data storage.
https://studio3t.com/academy/#courses
The fastest way to learn MongoDB
https://www.youtube.com/watch?v=c2M-rlkkT5o
This video will give you and introduction to MongoDB in 1 Hour. Afterwards I recommend exploring aggregation, replication, and sharding.
https://www.youtube.com/watch?v=ExcRbA7fy_A&list=PL4cUxeGkcC9h77dJ-QJlwGlZlTd4ecZOA
You'll learn how to use MongoDB (a NoSQL database) from scratch. You'll also learn how to integrate it into a simple Node.js API.
Hadoop+
https://www.runoob.com/w3cnote/hadoop-tutorial.html
Hadoop 为庞大的计算机集群提供可靠的、可伸缩的应用层计算和存储支持,它允许使用简单的编程模型跨计算机群集分布式处理大型数据集,并且支持在单台计算机到几千台计算机之间进行扩展。
[英文] Hadoop Tutorial
https://www.tutorialspoint.com/hadoop/index.htm
Hadoop is an open-source framework that allows to store and process big data in a distributed environment across clusters of computers using simple programming models.
还有更多 •••
相关职位
社招3年以上技术类-算法
1. 负责淘宝闪购用户增长业务场景中的推荐算法的研发,包括图文多媒体推荐、消息PUSH推送、店铺菜品信息流、外投广告投放等,从用户体验、流量效果和商业目标等方向设计和迭代算法,促进商业发展 2. 负责淘宝闪购用户增长算法设计与调优,创新全生命周期的用户增长以及增值算法,促进拉新效率和留存效果,增加用户粘性,防止用户流失 3. 负责本地商业机制设计和改进,提升商家淘宝闪购运营体验和效率
更新于 2026-04-03杭州|上海

社招3年以上技术类-算法
1. 负责淘宝闪购用户增长业务场景中的推荐算法的研发,包括图文多媒体推荐、消息PUSH推送、店铺菜品信息流、外投广告投放等,从用户体验、流量效果和商业目标等方向设计和迭代算法,促进商业发展 2. 负责淘宝闪购用户增长算法设计与调优,创新全生命周期的用户增长以及增值算法,促进拉新效率和留存效果,增加用户粘性,防止用户流失 3. 负责本地商业机制设计和改进,提升商家淘宝闪购运营体验和效率
更新于 2026-04-09杭州|上海
社招3年以上技术类-算法
1. 负责淘宝闪购搜索推荐算法的基础模型研发工作,包括店铺和商品信息流推荐、搜索结果页排序等,覆盖千万级DAU; 2. 基于业务问题,设计并实现推荐全链路算法模型,包括召回、粗排、精排、重排及混排等模块,搜索全链路算法模型,包括Query理解、召回、精排、重排等模块,持续迭代提升业务效果; 3. 跟踪国内外搜索推荐领域的最新进展,结合业务特点进行技术创新,推动算法模型的优化和升级; 4. 协同业务进行跨团队合作,与产品、运营等部门紧密合作,确保算法模型的有效落地和业务目标的达成。
更新于 2026-03-31上海