蚂蚁金服蚂蚁集团-大数据工程师-支付宝技术
社招全职4年以上技术类-数据地点:上海 | 杭州状态:招聘
任职要求
1、有良好的抗压能力、沟通能力、自我驱动动力,具备出色的规划、执行力,强烈的责任感,以及优秀的学习能力,对技术有热情,愿意不断尝试新技术和业务挑战。 2、本科以上,3年以上大数据相关工作经验,mr研发经验(必须),在海量数据下的数仓建设,数据架构治理方面有经验沉淀,技术栈包括但不限于spark/hive/flink等使用经验,计算机/数学相关专业优先; 3、熟悉hadoop生态,包括hdfs/mapreduce/hive/spark/flink/hbase; 4、掌握Java、python、sql语言; 5、有数据挖掘分析、算法、大模型应用相关经验者优先;
工作职责
1、负责支付宝工程效能领域数据规划与建设,推进领域数字化、智能化的进程。 2、提供数据采集、计算、存储、产品化全链路数据解决方案,并参与方案建设。 3、负责领域数据架构治理工作,保障领域数仓健康有序发展。包括核心资产建设、数据质量保障等。
包括英文材料
大数据+
https://www.youtube.com/watch?v=bAyrObl7TYE
https://www.youtube.com/watch?v=H4bf_uuMC-g
With all this talk of Big Data, we got Rebecca Tickle to explain just what makes data into Big Data.
Spark+
[英文] Learning Spark Book
https://pages.databricks.com/rs/094-YMS-629/images/LearningSpark2.0.pdf
This new edition has been updated to reflect Apache Spark’s evolution through Spark 2.x and Spark 3.0, including its expanded ecosystem of built-in and external data sources, machine learning, and streaming technologies with which Spark is tightly integrated.
Hive+
[英文] Hive Tutorial
https://www.tutorialspoint.com/hive/index.htm
Hive is a data warehouse infrastructure tool to process structured data in Hadoop. It resides on top of Hadoop to summarize Big Data, and makes querying and analyzing easy.
https://www.youtube.com/watch?v=D4HqQ8-Ja9Y
Flink+
https://nightlies.apache.org/flink/flink-docs-release-2.0/docs/learn-flink/overview/
This training presents an introduction to Apache Flink that includes just enough to get you started writing scalable streaming ETL, analytics, and event-driven applications, while leaving out a lot of (ultimately important) details.
https://www.youtube.com/watch?v=WajYe9iA2Uk&list=PLa7VYi0yPIH2GTo3vRtX8w9tgNTTyYSux
Today’s businesses are increasingly software-defined, and their business processes are being automated. Whether it’s orders and shipments, or downloads and clicks, business events can always be streamed. Flink can be used to manipulate, process, and react to these streaming events as they occur.
Hadoop+
https://www.runoob.com/w3cnote/hadoop-tutorial.html
Hadoop 为庞大的计算机集群提供可靠的、可伸缩的应用层计算和存储支持,它允许使用简单的编程模型跨计算机群集分布式处理大型数据集,并且支持在单台计算机到几千台计算机之间进行扩展。
[英文] Hadoop Tutorial
https://www.tutorialspoint.com/hadoop/index.htm
Hadoop is an open-source framework that allows to store and process big data in a distributed environment across clusters of computers using simple programming models.
HDFS+
https://hadoop.apache.org/docs/r1.2.1/hdfs_design.html
The Hadoop Distributed File System (HDFS) is a distributed file system designed to run on commodity hardware.
https://www.ibm.com/cn-zh/think/topics/hdfs
Hadoop 分布式文件系统 (HDFS) 是一种管理大型数据集的文件系统,可在商用硬件上运行。
MapReduce+
https://www.youtube.com/watch?v=bcjSe0xCHbE
https://www.youtube.com/watch?v=cHGaQz0E7AU
In this video I explain the basics of Map Reduce model, an important concept for any software engineer to be aware of.
HBase+
[英文] HBase Tutorial
https://www.tutorialspoint.com/hbase/index.htm
HBase is a data model that is similar to Google's big table designed to provide quick random access to huge amounts of structured data. This tutorial provides an introduction to HBase, the procedures to set up HBase on Hadoop File Systems, and ways to interact with HBase shell.
Java+
https://www.youtube.com/watch?v=eIrMbAQSU34
Master Java – a must-have language for software development, Android apps, and more! ☕️ This beginner-friendly course takes you from basics to real coding skills.
Python+
https://liaoxuefeng.com/books/python/introduction/index.html
中文,免费,零起点,完整示例,基于最新的Python 3版本。
https://www.learnpython.org/
a free interactive Python tutorial for people who want to learn Python, fast.
https://www.youtube.com/watch?v=K5KVEU3aaeQ
Master Python from scratch 🚀 No fluff—just clear, practical coding skills to kickstart your journey!
https://www.youtube.com/watch?v=rfscVS0vtbw
This course will give you a full introduction into all of the core concepts in python.
SQL+
https://liaoxuefeng.com/books/sql/introduction/index.html
什么是SQL?简单地说,SQL就是访问和处理关系数据库的计算机标准语言。
https://sqlbolt.com/
Learn SQL with simple, interactive exercises.
https://www.youtube.com/watch?v=p3qvj9hO_Bo
In this video we will cover everything you need to know about SQL in only 60 minutes.
数据挖掘+
https://www.youtube.com/watch?v=-bSkREem8dM
Database vs Data Warehouse vs Data Lake
https://www.youtube.com/watch?v=7rs0i-9nOjo
算法+
https://roadmap.sh/datastructures-and-algorithms
Step by step guide to learn Data Structures and Algorithms in 2025
https://www.hellointerview.com/learn/code
A visual guide to the most important patterns and approaches for the coding interview.
https://www.w3schools.com/dsa/
大模型+
https://www.youtube.com/watch?v=xZDB1naRUlk
You will build projects with LLMs that will enable you to create dynamic interfaces, interact with vast amounts of text data, and even empower LLMs with the capability to browse the internet for research papers.
https://www.youtube.com/watch?v=zjkBMFhNj_g
相关职位
社招3年以上技术类-数据
负责支付宝数字化业务的数据体系设计和建设,包括但不限于全链路流程和机制的构建、全链路数据研发工作、稳定可扩展的数据体系建设,构建业务账单,实现生态繁荣和业务增长。
更新于 2025-08-12
社招3年以上技术类-开发
在支付宝app中,“扫一扫”和“付款/收款”长期占据着首页“四大金刚”的两个入口,万物皆码,码作为链接,是移动支付和走向线下的最快捷手段之一。 - “语音助手”等新兴入口,加入到支付宝下拉二楼和悬浮球工具栏,用户在AI应用、服务理解、办结上,有丰富多彩的体验。 我们的优势? - 深入到支付宝最为核心的支付业务,全面掌握整个支付链路。 - 深度参与全新支付场景的探索,运用极具挑战的前沿技术,创造业务价值,享受突破重重困难后的成就感。 - 亿万量级的访问,高并发、高稳定、高可用是我们的日常,大促支付丝般顺滑离不开我们的付出。 你的机会? - 在这里,你将亲身参与“万物皆码”、“生物支付”、“IoT支付”,给人们生活带来的巨大的支付变革! - 在这里,你创新的小想法将有机会落地,进而能够改变亿万用户的支付习惯,你将收获前所未有的成就感
更新于 2025-10-13
社招3年以上技术类-数据
1、基于支付宝端海量数据,通过数据挖掘算法、大模型等手段,深度挖掘支付宝内部服务/服务动线,深度参与到支付宝端侧智能建设; 2、探索基于海量用户行为数据,实现对用户行为挖掘\理解和感知,探索app新的操作模式;
更新于 2025-08-18