鹰角网络大数据测试工程师(平台)
社招全职3年以上地点:上海状态:招聘
任职要求
1、本科及以上学历,3年以上工作经验,能够独立负责大数据产品的测试工作 2、熟悉互联网测试,具备数据管理平台、数据仓库、ETL、BI报表等产品相关经验者优先 3、熟练掌握SQL,能独立编写复杂SQL语句,具备数据分析和数据验证能力 4、熟悉常用的测试方法和流程,能独立设计完备的测试用例,具备缺陷分析和定位的能力 5、熟悉至少一种编程语言(Python/Java/Go),能编写数据类测试脚本或工具 6、理解并熟悉大数据技术体系(Hive/Spark/Flink等)原理,有分布式系统测试经验者优先 7、具有较好的人际沟通能力,有较强的责任心、团队精神和自我驱动力
工作职责
1、负责大数据平台相关业务的测试工作,包括但不限于技术基建平台及组件功能、业务数仓底表等项目,为产品的交付质量负责 2、参与大数据产品的需求评审,按照数据架构、数据链路制定合理的测试策略与质量保障方案 3、负责编写测试计划与测试用例,执行测试、跟踪缺陷并进行测试结果分析,输出测试报告 4、跟进需求、开发、测试、发布及线上各阶段的质量情况,推动问题闭环与质量提升 5、参与并推动大数据质量体系建设,包括测试规范制定、自动化测试框架搭建、性能与稳定性测试等工作 6、持续优化大数据业务的测试流程与方法论,提升测试效率与质量可视化能力
包括英文材料
学历+
大数据+
https://www.youtube.com/watch?v=bAyrObl7TYE
https://www.youtube.com/watch?v=H4bf_uuMC-g
With all this talk of Big Data, we got Rebecca Tickle to explain just what makes data into Big Data.
数据仓库+
https://www.youtube.com/watch?v=9GVqKuTVANE
From Zero to Data Warehouse Hero: A Full SQL Project Walkthrough and Real Industry Experience!
https://www.youtube.com/watch?v=k4tK2ttdSDg
ETL+
https://www.ibm.com/think/topics/etl
ETL—meaning extract, transform, load—is a data integration process that combines, cleans and organizes data from multiple sources into a single, consistent data set for storage in a data warehouse, data lake or other target system.
https://www.youtube.com/watch?v=OW5OgsLpDCQ
It explains what ETL is and what it can do for you to improve your data analysis and productivity.
SQL+
https://liaoxuefeng.com/books/sql/introduction/index.html
什么是SQL?简单地说,SQL就是访问和处理关系数据库的计算机标准语言。
https://sqlbolt.com/
Learn SQL with simple, interactive exercises.
https://www.youtube.com/watch?v=p3qvj9hO_Bo
In this video we will cover everything you need to know about SQL in only 60 minutes.
数据分析+
[英文] Data Analyst Roadmap
https://roadmap.sh/data-analyst
Step by step guide to becoming an Data Analyst in 2025
Python+
https://liaoxuefeng.com/books/python/introduction/index.html
中文,免费,零起点,完整示例,基于最新的Python 3版本。
https://www.learnpython.org/
a free interactive Python tutorial for people who want to learn Python, fast.
https://www.youtube.com/watch?v=K5KVEU3aaeQ
Master Python from scratch 🚀 No fluff—just clear, practical coding skills to kickstart your journey!
https://www.youtube.com/watch?v=rfscVS0vtbw
This course will give you a full introduction into all of the core concepts in python.
Java+
https://www.youtube.com/watch?v=eIrMbAQSU34
Master Java – a must-have language for software development, Android apps, and more! ☕️ This beginner-friendly course takes you from basics to real coding skills.
Go+
https://www.youtube.com/watch?v=8uiZC0l4Ajw
学习Golang的完整教程!从开始到结束不到一个小时,包括如何在Go中构建API的完整演示。没有多余的内容,只有你需要知道的知识。
脚本+
[英文] Scripting language
https://en.wikipedia.org/wiki/Scripting_language
https://zhuanlan.zhihu.com/p/571097954
一个脚本通常是解释执行而非编译。脚本语言通常都有简单、易学、易用的特性,目的就是希望能让程序员快速完成程序的编写工作。
Hive+
[英文] Hive Tutorial
https://www.tutorialspoint.com/hive/index.htm
Hive is a data warehouse infrastructure tool to process structured data in Hadoop. It resides on top of Hadoop to summarize Big Data, and makes querying and analyzing easy.
https://www.youtube.com/watch?v=D4HqQ8-Ja9Y
Spark+
[英文] Learning Spark Book
https://pages.databricks.com/rs/094-YMS-629/images/LearningSpark2.0.pdf
This new edition has been updated to reflect Apache Spark’s evolution through Spark 2.x and Spark 3.0, including its expanded ecosystem of built-in and external data sources, machine learning, and streaming technologies with which Spark is tightly integrated.
Flink+
https://nightlies.apache.org/flink/flink-docs-release-2.0/docs/learn-flink/overview/
This training presents an introduction to Apache Flink that includes just enough to get you started writing scalable streaming ETL, analytics, and event-driven applications, while leaving out a lot of (ultimately important) details.
https://www.youtube.com/watch?v=WajYe9iA2Uk&list=PLa7VYi0yPIH2GTo3vRtX8w9tgNTTyYSux
Today’s businesses are increasingly software-defined, and their business processes are being automated. Whether it’s orders and shipments, or downloads and clicks, business events can always be streamed. Flink can be used to manipulate, process, and react to these streaming events as they occur.
分布式系统+
https://www.distributedsystemscourse.com/
The home page of a free online class in distributed systems.
https://www.youtube.com/watch?v=7VbL89mKK3M&list=PLOE1GTZ5ouRPbpTnrZ3Wqjamfwn_Q5Y9A
相关职位
社招3年以上D2815
1、负责大数据产品相关(大数据开发、数据分析、流量管理与分析、AB实验平台等产品)的测试以及质量保证方面的工作; 2、根据数据产品的需求,分析并制定测试计划,设计测试数据和测试用例,执行测试用例,快速定位和解决问题; 3、对线上问题进行持续追踪,并从中得出一些优化监控、测试方案、框架提升等改进措施; 4、在项目中积极与产品经理、开发工程师和用户进行有效沟通,推进问题及时合理地解决。
更新于 2025-03-24
社招A156683
1、负责字节跳动大数据平台高可用性保障,协同大数据各组件团队制定稳定性标准、明确职责边界、推进稳定性项目落地; 2、负责运维流程标准建设和相应工具能力建设,包括稳定性目标管理、监控诊断运维能力、容灾应急方案等; 3、负责推进大数据组件风险治理和事故管理,降低平台事故、提升运维效率、降低运维成本。
更新于 2025-03-03
社招1年以上程序&技术类
1、负责数据采集、数据处理、数据分析、数据可视化等全流程测试,包括需求分析、测试设计、用例编写、测试执行和缺陷跟踪,保障数据链路的准确性、完整性和稳定性; 2、设计和开发自动化测试脚本,提升大数据测试效率和覆盖率; 3、负责大数据ETL流程、数据同步、数据质量、数据一致性、数据安全等方面的测试与验证; 4、参与数据平台性能测试、容量测试、稳定性测试,定位和分析系统瓶颈; 5、协助开发、产品等团队,推动数据产品质量持续提升; 6、编写和维护测试相关文档,参与测试流程和规范的优化。