蚂蚁金服蚂蚁集团-数据研发工程师-杭州
社招全职2年以上技术类-数据地点:杭州状态:招聘
任职要求
1. 2年以上工作经验,计算机等相关专业本科以上学历 ,具备独立的模块开发能力; 2. 精通业务建模、数据仓库建模、精通ETL设计开发,有数据风险管理与治理相关经验; 3. 熟悉数据仓库领域知识和技能者优先,如Hadoop/Hive/Spark/Flink等; 4. 熟悉SQL执行原理,了解CBO/HBO,结合数据仓库设计可以快速的提供成本/时效/合规/架构最优的解决方案; 5. 具有跨部门的复杂数据项目或者技术领域的管理经验; 6. 热爱大数据,性格沉稳,有较好的语言表达能力,能自我驱动,有强烈的求知欲与进取心,有团队合作精神,敢于挑战,能在压力下成长。
工作职责
1)直面业务问题,制定风险管理领域(隐私风险、法务风险、机构风险、流动性风险、市场风险、内控&操作风险等等)的数据解决方案,建设数据资产,并协同产技落地产品技术能力,助力风险业务提升数字化和智能化的能力。 2)能够主动推动安全合规技术以及产品平台的不断迭代优化,主导能力在业务侧的推广运营落地,让蚂蚁业务数据安全、合规、高效流动.;
包括英文材料
学历+
数据仓库+
https://www.youtube.com/watch?v=9GVqKuTVANE
From Zero to Data Warehouse Hero: A Full SQL Project Walkthrough and Real Industry Experience!
https://www.youtube.com/watch?v=k4tK2ttdSDg
ETL+
https://www.ibm.com/think/topics/etl
ETL—meaning extract, transform, load—is a data integration process that combines, cleans and organizes data from multiple sources into a single, consistent data set for storage in a data warehouse, data lake or other target system.
https://www.youtube.com/watch?v=OW5OgsLpDCQ
It explains what ETL is and what it can do for you to improve your data analysis and productivity.
Hadoop+
https://www.runoob.com/w3cnote/hadoop-tutorial.html
Hadoop 为庞大的计算机集群提供可靠的、可伸缩的应用层计算和存储支持,它允许使用简单的编程模型跨计算机群集分布式处理大型数据集,并且支持在单台计算机到几千台计算机之间进行扩展。
[英文] Hadoop Tutorial
https://www.tutorialspoint.com/hadoop/index.htm
Hadoop is an open-source framework that allows to store and process big data in a distributed environment across clusters of computers using simple programming models.
Hive+
[英文] Hive Tutorial
https://www.tutorialspoint.com/hive/index.htm
Hive is a data warehouse infrastructure tool to process structured data in Hadoop. It resides on top of Hadoop to summarize Big Data, and makes querying and analyzing easy.
https://www.youtube.com/watch?v=D4HqQ8-Ja9Y
Spark+
[英文] Learning Spark Book
https://pages.databricks.com/rs/094-YMS-629/images/LearningSpark2.0.pdf
This new edition has been updated to reflect Apache Spark’s evolution through Spark 2.x and Spark 3.0, including its expanded ecosystem of built-in and external data sources, machine learning, and streaming technologies with which Spark is tightly integrated.
Flink+
https://nightlies.apache.org/flink/flink-docs-release-2.0/docs/learn-flink/overview/
This training presents an introduction to Apache Flink that includes just enough to get you started writing scalable streaming ETL, analytics, and event-driven applications, while leaving out a lot of (ultimately important) details.
https://www.youtube.com/watch?v=WajYe9iA2Uk&list=PLa7VYi0yPIH2GTo3vRtX8w9tgNTTyYSux
Today’s businesses are increasingly software-defined, and their business processes are being automated. Whether it’s orders and shipments, or downloads and clicks, business events can always be streamed. Flink can be used to manipulate, process, and react to these streaming events as they occur.
SQL+
https://liaoxuefeng.com/books/sql/introduction/index.html
什么是SQL?简单地说,SQL就是访问和处理关系数据库的计算机标准语言。
https://sqlbolt.com/
Learn SQL with simple, interactive exercises.
https://www.youtube.com/watch?v=p3qvj9hO_Bo
In this video we will cover everything you need to know about SQL in only 60 minutes.
大数据+
https://www.youtube.com/watch?v=bAyrObl7TYE
https://www.youtube.com/watch?v=H4bf_uuMC-g
With all this talk of Big Data, we got Rebecca Tickle to explain just what makes data into Big Data.
相关职位
社招1年以上技术类-数据
职位描述: 1、负责菜鸟全球供应链大数据的采集、存储、处理,通过分布式大数据平台加工数据,支持业务管理决策; 2、参与菜鸟全球供应链大数据体系的模型设计、开发、维护,通过元数据、质量体系有效的管理和组织EB级的数据; 3、参与菜鸟全球供应链大数据产品的研发,通过数据分析和算法洞察数据背后的商业机会点,探索大数据商业化。
更新于 2025-09-11
社招技术类-数据
1.负责蚂蚁集团国际数据体系的建设,通过数据+算法+工程化能力,处理和萃取数据特征支持上层的数据运营决策; 2.参与大数据基础架构、产品技术的规划建设,包括数据合规、数据资产、数据产品、数据质量及稳定性保障体系建设。
更新于 2025-06-30
社招
1、负责核心业务域数据体系的规划和建设,通过数据产品和数据服务等方式,高效支撑业务场景的数据需求 2、深度理解业务,通过对业务策略和痛点的分析,制定系统性端到端的数据解决方案并落地 3、负责数据资产建设、数据质量与稳定性管理,构建共享融通的数据平台,让数据标准更规范、数据获取更高效
更新于 2025-05-23