字节跳动资深测试开发工程师-抖音支付数据方向
社招全职U4995地点:北京状态:招聘
任职要求
1、计算机或其相关专业,精通Python/Java/Go等至少一门编程语言; 2、有算法模型或大数据相关技术经验优先,包括但不限于Hadoop/Hive/Spark/Storm/Flink/HBase/ES/ClickHouse等; 3、具备较好的主导项目,多团队协作,高质量交付能力; 4、对测试行业/质量保障体系建设工作有深入的思考,了解行业现状和前瞻性方向; 5、有搜索/推荐/广告/数仓经验者优先。
工作职责
1、参与支付/消费金融/保险增长策略和算法模型交付/财经数仓相关数据质量保障,对产品交付质量负责; 2、参与构建相关质量保障体系,包括不限于增长策略的质量验收标准、模型稳定性和效果评估、大数据准确性、稳定性监控等; 3、线上质量数据的分析建模、问题发现、归因,改进现有工具或开源项目,推动适合的技术应用落地,帮助质量改进。
包括英文材料
Python+
https://liaoxuefeng.com/books/python/introduction/index.html
中文,免费,零起点,完整示例,基于最新的Python 3版本。
https://www.learnpython.org/
a free interactive Python tutorial for people who want to learn Python, fast.
https://www.youtube.com/watch?v=K5KVEU3aaeQ
Master Python from scratch 🚀 No fluff—just clear, practical coding skills to kickstart your journey!
https://www.youtube.com/watch?v=rfscVS0vtbw
This course will give you a full introduction into all of the core concepts in python.
Java+
https://www.youtube.com/watch?v=eIrMbAQSU34
Master Java – a must-have language for software development, Android apps, and more! ☕️ This beginner-friendly course takes you from basics to real coding skills.
Go+
https://www.youtube.com/watch?v=8uiZC0l4Ajw
学习Golang的完整教程!从开始到结束不到一个小时,包括如何在Go中构建API的完整演示。没有多余的内容,只有你需要知道的知识。
算法+
https://roadmap.sh/datastructures-and-algorithms
Step by step guide to learn Data Structures and Algorithms in 2025
https://www.hellointerview.com/learn/code
A visual guide to the most important patterns and approaches for the coding interview.
https://www.w3schools.com/dsa/
大数据+
https://www.youtube.com/watch?v=bAyrObl7TYE
https://www.youtube.com/watch?v=H4bf_uuMC-g
With all this talk of Big Data, we got Rebecca Tickle to explain just what makes data into Big Data.
Hadoop+
https://www.runoob.com/w3cnote/hadoop-tutorial.html
Hadoop 为庞大的计算机集群提供可靠的、可伸缩的应用层计算和存储支持,它允许使用简单的编程模型跨计算机群集分布式处理大型数据集,并且支持在单台计算机到几千台计算机之间进行扩展。
[英文] Hadoop Tutorial
https://www.tutorialspoint.com/hadoop/index.htm
Hadoop is an open-source framework that allows to store and process big data in a distributed environment across clusters of computers using simple programming models.
Hive+
[英文] Hive Tutorial
https://www.tutorialspoint.com/hive/index.htm
Hive is a data warehouse infrastructure tool to process structured data in Hadoop. It resides on top of Hadoop to summarize Big Data, and makes querying and analyzing easy.
https://www.youtube.com/watch?v=D4HqQ8-Ja9Y
Spark+
[英文] Learning Spark Book
https://pages.databricks.com/rs/094-YMS-629/images/LearningSpark2.0.pdf
This new edition has been updated to reflect Apache Spark’s evolution through Spark 2.x and Spark 3.0, including its expanded ecosystem of built-in and external data sources, machine learning, and streaming technologies with which Spark is tightly integrated.
Flink+
https://nightlies.apache.org/flink/flink-docs-release-2.0/docs/learn-flink/overview/
This training presents an introduction to Apache Flink that includes just enough to get you started writing scalable streaming ETL, analytics, and event-driven applications, while leaving out a lot of (ultimately important) details.
https://www.youtube.com/watch?v=WajYe9iA2Uk&list=PLa7VYi0yPIH2GTo3vRtX8w9tgNTTyYSux
Today’s businesses are increasingly software-defined, and their business processes are being automated. Whether it’s orders and shipments, or downloads and clicks, business events can always be streamed. Flink can be used to manipulate, process, and react to these streaming events as they occur.
HBase+
[英文] HBase Tutorial
https://www.tutorialspoint.com/hbase/index.htm
HBase is a data model that is similar to Google's big table designed to provide quick random access to huge amounts of structured data. This tutorial provides an introduction to HBase, the procedures to set up HBase on Hadoop File Systems, and ways to interact with HBase shell.
ElasticSearch+
https://www.youtube.com/watch?v=a4HBKEda_F8
Learn about Elasticsearch with this comprehensive course designed for beginners, featuring both theoretical concepts and hands-on applications using Python (though applicable to any programming language). The course is structured in two parts: first covering essential Elasticsearch fundamentals including index management, document storage, text analysis, pipeline creation, search functionality, and advanced features like semantic search and embeddings; followed by a practical section where you'll build a real-world website using Elasticsearch as a search engine, working with the Astronomy Picture of the Day (APOD) dataset to implement features such as data cleaning pipelines, tokenization, pagination, and aggregations.
ClickHouse+
[英文] Advanced Tutorial
https://clickhouse.com/docs/tutorial
Learn how to ingest and query data in ClickHouse using the New York City taxi example dataset.
https://www.youtube.com/watch?v=FtoWGT7kS-c
ClickHouse is an open-source column-oriented DBMS for online analytical processing that allows users to generate analytical reports using SQL queries in real-time.
https://www.youtube.com/watch?v=Rhe-kUyrFUE&list=PL0Z2YDlm0b3gcY5R_MUo4fT5bPqUQ66ep
相关职位
社招A160626
1、负责国际支付数据产品平台的服务端研发; 2、参与和负责数据产品平台的架构设计、技术方案设计和系统实施; 3、结合业务发展趋势,建立数据驱动的平台产品能力,使数据能更好的服务于国际支付的业务发展; 4、与数据产品经理、数据开发、项目经理、各业务域产研紧密配合,共同完成目标。
更新于 2023-12-15
社招3年以上技术类-数据
方向一: 1. 解决数据在业务N场景下的基建问题,充分提升底层数据建设,提升N项目中的数据研发质量和效率 2.解决N项目的数据洞察问题,通过数据的理解分析,发现N项目的可提升点,辅助业务快速提升 3. 解决N项目资产沉淀等问题,为算法和人工策略提供基础,辅助业务在铺设、营销等场景的快速提升 方向二: 1.数据基建:负责支付宝核心支付数据体系建设,解决支付场景下的数据基建问题,提升支付的数据研发质量和效率,使支付数据体系可持续发展。 2.数据洞察:围绕支付业务目标,通过数据分析洞察、构建分析工具,为业务目标达成提高效率;通过统计建模、因果推断、AB实验设计等数据科学方法,在科学量化运营策略效果和价值的同时,产出业务策略优化建议,助力业务目标达成。 3.前沿技术探索与落地:探索大模型等技术在支付数据场景的应用,推动数据分析和挖掘的效能革新。
更新于 2025-09-19
社招3年以上A236901
1、推动支付安全能力的设计与落地(包括数据分级治理、数据资产地图、数据血缘跟踪应用、数据加解密、数据共享、权限管控、安全审计、支付应用安全等); 2、推动支付业务线数据安全风险治理(包括风险识别与发现、解决方案制定与推动治理、持续改进优化及事后跟踪等); 3、对现有数据安全控制措施有效性进行持续验证,跟进数据安全事件识别、响应、处置、调查、取证。
更新于 2024-06-12