蚂蚁金服蚂蚁集团-大数据工程师-支付宝技术
社招全职4年以上技术类-数据地点:上海 | 杭州状态:招聘
任职要求
1、有良好的抗压能力、沟通能力、自我驱动动力,具备出色的规划、执行力,强烈的责任感,以及优秀的学习能力,对技术有热情,愿意不断尝试新技术和业务挑战。 2、本科以上,3年以上大数据相关工作经验,mr研发经验(必须),在海量数据下的数仓建设,数据架构治理方面有经验沉淀,技术栈包括但不…
登录查看完整任职要求
微信扫码,1秒登录
工作职责
1、负责支付宝工程效能领域数据规划与建设,推进领域数字化、智能化的进程。 2、提供数据采集、计算、存储、产品化全链路数据解决方案,并参与方案建设。 3、负责领域数据架构治理工作,保障领域数仓健康有序发展。包括核心资产建设、数据质量保障等。
包括英文材料
大数据+
https://www.youtube.com/watch?v=bAyrObl7TYE
https://www.youtube.com/watch?v=H4bf_uuMC-g
With all this talk of Big Data, we got Rebecca Tickle to explain just what makes data into Big Data.
Spark+
[英文] Learning Spark Book
https://pages.databricks.com/rs/094-YMS-629/images/LearningSpark2.0.pdf
This new edition has been updated to reflect Apache Spark’s evolution through Spark 2.x and Spark 3.0, including its expanded ecosystem of built-in and external data sources, machine learning, and streaming technologies with which Spark is tightly integrated.
Hive+
[英文] Hive Tutorial
https://www.tutorialspoint.com/hive/index.htm
Hive is a data warehouse infrastructure tool to process structured data in Hadoop. It resides on top of Hadoop to summarize Big Data, and makes querying and analyzing easy.
https://www.youtube.com/watch?v=D4HqQ8-Ja9Y
Flink+
https://nightlies.apache.org/flink/flink-docs-release-2.0/docs/learn-flink/overview/
This training presents an introduction to Apache Flink that includes just enough to get you started writing scalable streaming ETL, analytics, and event-driven applications, while leaving out a lot of (ultimately important) details.
https://www.youtube.com/watch?v=WajYe9iA2Uk&list=PLa7VYi0yPIH2GTo3vRtX8w9tgNTTyYSux
Today’s businesses are increasingly software-defined, and their business processes are being automated. Whether it’s orders and shipments, or downloads and clicks, business events can always be streamed. Flink can be used to manipulate, process, and react to these streaming events as they occur.
Hadoop+
https://www.runoob.com/w3cnote/hadoop-tutorial.html
Hadoop 为庞大的计算机集群提供可靠的、可伸缩的应用层计算和存储支持,它允许使用简单的编程模型跨计算机群集分布式处理大型数据集,并且支持在单台计算机到几千台计算机之间进行扩展。
[英文] Hadoop Tutorial
https://www.tutorialspoint.com/hadoop/index.htm
Hadoop is an open-source framework that allows to store and process big data in a distributed environment across clusters of computers using simple programming models.
HDFS+
https://hadoop.apache.org/docs/r1.2.1/hdfs_design.html
The Hadoop Distributed File System (HDFS) is a distributed file system designed to run on commodity hardware.
https://www.ibm.com/cn-zh/think/topics/hdfs
Hadoop 分布式文件系统 (HDFS) 是一种管理大型数据集的文件系统,可在商用硬件上运行。
MapReduce+
https://www.youtube.com/watch?v=bcjSe0xCHbE
https://www.youtube.com/watch?v=cHGaQz0E7AU
In this video I explain the basics of Map Reduce model, an important concept for any software engineer to be aware of.
还有更多 •••
相关职位
社招3年以上技术类-数据
负责支付宝数字化业务的数据体系设计和建设,包括但不限于全链路流程和机制的构建、全链路数据研发工作、稳定可扩展的数据体系建设,构建业务账单,实现生态繁荣和业务增长。
更新于 2025-08-12上海|杭州
社招3年以上技术类-数据
1、基于支付宝端海量数据,通过数据挖掘算法、大模型等手段,深度挖掘支付宝内部服务/服务动线,深度参与到支付宝端侧智能建设; 2、探索基于海量用户行为数据,实现对用户行为挖掘\理解和感知,探索app新的操作模式;
更新于 2025-08-18北京
社招4年以上技术类-开发
团队介绍:支付宝技术部用户平台,负责支付宝用户信息基础设施和用户隐私保护,负责全端用户实时行为和全网供给的统一和理解,通过深度用户理解赋能AI时代的支付宝。 工作内容: 1、海量用户数据分析处理,产出用户特征和画像; 2、海量供给数据分析处理,产出供给特征和供给理解; 3、在线用户特征和供给特征服务于支付宝各个业务场景; 4、构建AI工程深度理解用户和供给,服务于支付宝各个AI场景。
更新于 2025-08-28杭州