蚂蚁金服蚂蚁集团-数据研发专家(风险)-杭州
社招全职3年以上技术类-数据地点:杭州状态:招聘
任职要求
1.计算机、软件工程等相关理工科专业背景,本科及以上学历, 3年以上工作经验,具有丰富的数据建模与实践经验 2.精通业务建模、数据仓库建模设计开发,具备体系化的数据质量与数据治理相关经验,有大型项目实践经验,能独立主导完成某一业务领域的整体模型设计与落地 3.掌握离线(Hive/Spark)或者实时(Flink)数据研发技术体系及底层原理,有丰富的相关项目经验,PB级数据处理与优化经验 4.对数据敏感,能从数据中发现问题、解决问题、熟悉数据科学基本方法(如因果推断、归因等),有金融风控数据相关项目经验优先 5.热爱数据技术,良好的思维逻辑性和语言表达能力,能自我驱动,有强烈的求知欲与进取心,有团队合作精神,敢于挑战,能在压力下成长 6.具备一定的JAVA、Python语言的工程开发能力、机器学习算法能力尤佳
工作职责
1.负责消费信贷贷后数据架构和指标体系建设,基于业务理解完成数据建模及数据指标体系设计开发,发现洞察业务问题和机会,沉淀精品数据资产和抽象数据产品提升业务效能 2.深入理解业务的策略打法,敏锐洞察业务痛点,利用数据技术和数据科学手段为业务决策、增长策略提供专业化的离在线数据解决方案,助力万亿级规模的信贷业务高速且稳健的发展 3.负责数据质量、稳定性、计存治理等数据治理工作,让数据标准更规范、数据获取更高效、数据链路具备更好的可扩展性和可维护性
包括英文材料
学历+
数据仓库+
https://www.youtube.com/watch?v=9GVqKuTVANE
From Zero to Data Warehouse Hero: A Full SQL Project Walkthrough and Real Industry Experience!
https://www.youtube.com/watch?v=k4tK2ttdSDg
数据治理+
https://www.ibm.com/think/topics/data-governance
Data governance is the data management discipline that focuses on the quality, security and availability of an organization’s data.
https://www.youtube.com/watch?v=uPsUjKLHLAg
Building data fabric eliminates the technological complexities of data governance so users can connect to the right data at the right time, regardless of where it resides.
Hive+
[英文] Hive Tutorial
https://www.tutorialspoint.com/hive/index.htm
Hive is a data warehouse infrastructure tool to process structured data in Hadoop. It resides on top of Hadoop to summarize Big Data, and makes querying and analyzing easy.
https://www.youtube.com/watch?v=D4HqQ8-Ja9Y
Spark+
[英文] Learning Spark Book
https://pages.databricks.com/rs/094-YMS-629/images/LearningSpark2.0.pdf
This new edition has been updated to reflect Apache Spark’s evolution through Spark 2.x and Spark 3.0, including its expanded ecosystem of built-in and external data sources, machine learning, and streaming technologies with which Spark is tightly integrated.
Flink+
https://nightlies.apache.org/flink/flink-docs-release-2.0/docs/learn-flink/overview/
This training presents an introduction to Apache Flink that includes just enough to get you started writing scalable streaming ETL, analytics, and event-driven applications, while leaving out a lot of (ultimately important) details.
https://www.youtube.com/watch?v=WajYe9iA2Uk&list=PLa7VYi0yPIH2GTo3vRtX8w9tgNTTyYSux
Today’s businesses are increasingly software-defined, and their business processes are being automated. Whether it’s orders and shipments, or downloads and clicks, business events can always be streamed. Flink can be used to manipulate, process, and react to these streaming events as they occur.
数据科学+
https://roadmap.sh/ai-data-scientist
Step by step roadmap guide to becoming an AI and Data Scientist
因果推断+
https://web.stanford.edu/~swager/causal_inf_book.pdf
How best to understand and characterize causality is an age-old question in philosophy.
Java+
https://www.youtube.com/watch?v=eIrMbAQSU34
Master Java – a must-have language for software development, Android apps, and more! ☕️ This beginner-friendly course takes you from basics to real coding skills.
Python+
https://liaoxuefeng.com/books/python/introduction/index.html
中文,免费,零起点,完整示例,基于最新的Python 3版本。
https://www.learnpython.org/
a free interactive Python tutorial for people who want to learn Python, fast.
https://www.youtube.com/watch?v=K5KVEU3aaeQ
Master Python from scratch 🚀 No fluff—just clear, practical coding skills to kickstart your journey!
https://www.youtube.com/watch?v=rfscVS0vtbw
This course will give you a full introduction into all of the core concepts in python.
机器学习+
https://www.youtube.com/watch?v=0oyDqO8PjIg
Learn about machine learning and AI with this comprehensive 11-hour course from @LunarTech_ai.
https://www.youtube.com/watch?v=i_LwzRVP7bg
Learn Machine Learning in a way that is accessible to absolute beginners.
https://www.youtube.com/watch?v=NWONeJKn6kc
Learn the theory and practical application of machine learning concepts in this comprehensive course for beginners.
https://www.youtube.com/watch?v=PcbuKRNtCUc
Learn about all the most important concepts and terms related to machine learning and AI.
算法+
https://roadmap.sh/datastructures-and-algorithms
Step by step guide to learn Data Structures and Algorithms in 2025
https://www.hellointerview.com/learn/code
A visual guide to the most important patterns and approaches for the coding interview.
https://www.w3schools.com/dsa/
相关职位
社招3年以上技术类-数据
1)熟悉隐私安全法律法规,制定数据风险管理领域的解决方案。让蚂蚁业务数据安全、合规、高效流动.; 2)负责风险领域相关数据资产建设,数据化指引/落地风险管理治理工作; 3)能够主动推动安全合规技术以及产品平台的不断迭代优化,主导能力在业务侧的推广运营落地。
更新于 2025-09-23
社招5年以上云智能集团
1、技术方案设计,技术方案的落地与实现,并确保产品稳定性并持续提升产品性能实现性能优化, 2、参与从用户侧到后端资源侧,数据链路,控制链路,性能日志采集,审计,检索,分析等一整套分布式系统的研发,提供全球数据库服务; 3、利用云原生,基于K8S,Docker,云上ECS/神龙,云盘,VPC等云原生技术与数据库技术结合,给用户提供优质体验,高性价比,易用,高性能的云数据库服务; 4、通过产品化,智能化方式管控阿里云和阿里巴巴经济体的大规模分布式数据库实例集群,并支撑公共云和集团业务需求,为双十一等大促场景提供稳定,顺滑的体验。 5、参与数据库 DBaaS 平台的产品规划和平台技术演进。
更新于 2025-09-22
社招技术类-开发
1、负责应对各种复杂业务场景的分布式文件系统的设计与研发,包含高可用高可靠高性能设计,文件系统核心 IO 栈的研发,参与数据路径和元数据路径的设计和研发。 2、负责分布式文件系统的稳定性工程,包括但不限于系统的可观测性、FaultTolerance、多租户 QoS系统研发。针对专属云网络隔离、专线带宽受限等特定风险,负责针对性的稳定性设计、SOP 和 演练。
更新于 2025-06-18