富途大数据开发负责人
社招全职5年以上技术类地点:深圳状态:招聘
任职要求
1.经验与教育背景 :统招本科及以上学历,计算机科学、信息技术、软件工程或相关专业背景优先。5年及以上大数据开发经验, 2年及以上技术团队管理经验 。有从0到1建设大数据平台或数据中台的经验者优先。2.技术硬实力 :精通Hive、Spark,熟悉Python、Scala等编程语言,具备深厚的分布式系统或数据库系统的理论基础。熟悉整个大数据的完整处理流程,包括数据的采集、清洗、预处理、存储、分析挖掘和数据可视化。精通大数据生态圈技术,如Hadoop、Spark、Flink、Kafka、Hive、HBase、Elasticsearch等,对其原理有深入理解,有源码阅读或…
登录查看完整任职要求
微信扫码,1秒登录
工作职责
1.团队管理 :负责数据开发团队的日常管理工作,为团队设定任务和目标,跟进团队成员的项目进度和业务结果,提升团队整体技能水平,做好人才梯队建设。2.项目开发与交付 : 带领团队完成大数据离线、实时等业务需求的开发、实施和维护工作 ,并能配合PM完成需求的快速研发与交付 。3.数据治理与质量保障 :主导或参与数据治理及管理工作,包括设计数据模型、定义数据标准、构建数据管理体系,确保数据的质量,包括准确性、完整性和一致性等。4.跨部门协作与沟通 :作为技术接口人, 需要与其他部门和团队协作,确保需求的合理性和投入产出比,与产品、数据洞察、运营等多个角色紧密合作, 将业务需求转化为技术方案,并用数据赋能业务发展。
包括英文材料
学历+
大数据+
https://www.youtube.com/watch?v=bAyrObl7TYE
https://www.youtube.com/watch?v=H4bf_uuMC-g
With all this talk of Big Data, we got Rebecca Tickle to explain just what makes data into Big Data.
Hive+
[英文] Hive Tutorial
https://www.tutorialspoint.com/hive/index.htm
Hive is a data warehouse infrastructure tool to process structured data in Hadoop. It resides on top of Hadoop to summarize Big Data, and makes querying and analyzing easy.
https://www.youtube.com/watch?v=D4HqQ8-Ja9Y
Spark+
[英文] Learning Spark Book
https://pages.databricks.com/rs/094-YMS-629/images/LearningSpark2.0.pdf
This new edition has been updated to reflect Apache Spark’s evolution through Spark 2.x and Spark 3.0, including its expanded ecosystem of built-in and external data sources, machine learning, and streaming technologies with which Spark is tightly integrated.
Python+
https://liaoxuefeng.com/books/python/introduction/index.html
中文,免费,零起点,完整示例,基于最新的Python 3版本。
https://www.learnpython.org/
a free interactive Python tutorial for people who want to learn Python, fast.
https://www.youtube.com/watch?v=K5KVEU3aaeQ
Master Python from scratch 🚀 No fluff—just clear, practical coding skills to kickstart your journey!
https://www.youtube.com/watch?v=rfscVS0vtbw
This course will give you a full introduction into all of the core concepts in python.
Scala+
分布式系统+
https://www.distributedsystemscourse.com/
The home page of a free online class in distributed systems.
https://www.youtube.com/watch?v=7VbL89mKK3M&list=PLOE1GTZ5ouRPbpTnrZ3Wqjamfwn_Q5Y9A
Hadoop+
https://www.runoob.com/w3cnote/hadoop-tutorial.html
Hadoop 为庞大的计算机集群提供可靠的、可伸缩的应用层计算和存储支持,它允许使用简单的编程模型跨计算机群集分布式处理大型数据集,并且支持在单台计算机到几千台计算机之间进行扩展。
[英文] Hadoop Tutorial
https://www.tutorialspoint.com/hadoop/index.htm
Hadoop is an open-source framework that allows to store and process big data in a distributed environment across clusters of computers using simple programming models.
Flink+
https://nightlies.apache.org/flink/flink-docs-release-2.0/docs/learn-flink/overview/
This training presents an introduction to Apache Flink that includes just enough to get you started writing scalable streaming ETL, analytics, and event-driven applications, while leaving out a lot of (ultimately important) details.
https://www.youtube.com/watch?v=WajYe9iA2Uk&list=PLa7VYi0yPIH2GTo3vRtX8w9tgNTTyYSux
Today’s businesses are increasingly software-defined, and their business processes are being automated. Whether it’s orders and shipments, or downloads and clicks, business events can always be streamed. Flink can be used to manipulate, process, and react to these streaming events as they occur.
Kafka+
https://developer.confluent.io/what-is-apache-kafka/
https://www.youtube.com/watch?v=CU44hKLMg7k
https://www.youtube.com/watch?v=j4bqyAMMb7o&list=PLa7VYi0yPIH0KbnJQcMv5N9iW8HkZHztH
In this Apache Kafka fundamentals course, we introduce you to the basic Apache Kafka elements and APIs, as well as the broader Kafka ecosystem.
HBase+
[英文] HBase Tutorial
https://www.tutorialspoint.com/hbase/index.htm
HBase is a data model that is similar to Google's big table designed to provide quick random access to huge amounts of structured data. This tutorial provides an introduction to HBase, the procedures to set up HBase on Hadoop File Systems, and ways to interact with HBase shell.
还有更多 •••
相关职位
社招软件开发岗(团队
1、负责商业化广告数据处理和数据应用的产品调研以及架构设计; 2、承担数据相关业务的重难点技术攻坚,主导核心数据处理框架和重要组件的研发; 3、分析和发掘现有系统的不足,定位系统瓶颈,提高系统性能、稳定性以及业务扩展性; 4、在产品意识,技术认知和思考模式等方面对团队持续输出影响力; 5、主导跨部门协作和复杂功能的调研、设计、协调、实施和落地;
更新于 2025-09-10北京
社招JNU21
1、基于大数据平台开发大数据分析、可视化产品; 2、负责数据产品的前端技术选型和调研,推动与优化已有前端项目的组件抽象; 3、对前端团队产出的质量和效率负责。
更新于 2018-07-19北京
社招2年以上WXG公共技术
1.负责微信数据平台前端的架构设计与开发,直接参与前端核心能力的建设,支撑微信多个重要产品的数据应用; 2.负责微信数据平台前端系统的优化,包括:性能优化、稳定性优化,容错优化等; 3.协助团队完成技术选型和方案设计等工作,并与产品、测试等团队协同合作,确保交付质量。
更新于 2025-06-23深圳
