58同城大数据高级开发工程师——智能营销(J17537)
社招全职3年以上技术类地点:北京状态:招聘
任职要求
从事数据仓库领域至少3年以上,掌握数据仓库模型设计方法论,并有实际模型设计及ETL开发经验 精通Hive、Hbase、Kafka、Storm、Spark、Flink中的一种或多种框架,熟悉Hive/MySQL的基本原理和调优策略,熟悉大数据处理相关技术 精通SQL,具备SQL性能调优经验,并且熟悉MY SQL等常用数据库; 掌握shell、python等一种或多种脚本语言 责任心强,良好的沟通能表达力,热爱大数据开发,踏实、细心、主动,自驱力强
工作职责
负责营销业务过程全域数据仓库建设,并基于大数据对业务提供深入有效的支持,为AI模型提供海量特征 负责大数据相关的离线&实时计算,数据建模等技术方向
包括英文材料
数据仓库+
https://www.youtube.com/watch?v=9GVqKuTVANE
From Zero to Data Warehouse Hero: A Full SQL Project Walkthrough and Real Industry Experience!
https://www.youtube.com/watch?v=k4tK2ttdSDg
ETL+
https://www.ibm.com/think/topics/etl
ETL—meaning extract, transform, load—is a data integration process that combines, cleans and organizes data from multiple sources into a single, consistent data set for storage in a data warehouse, data lake or other target system.
https://www.youtube.com/watch?v=OW5OgsLpDCQ
It explains what ETL is and what it can do for you to improve your data analysis and productivity.
Hive+
[英文] Hive Tutorial
https://www.tutorialspoint.com/hive/index.htm
Hive is a data warehouse infrastructure tool to process structured data in Hadoop. It resides on top of Hadoop to summarize Big Data, and makes querying and analyzing easy.
https://www.youtube.com/watch?v=D4HqQ8-Ja9Y
HBase+
[英文] HBase Tutorial
https://www.tutorialspoint.com/hbase/index.htm
HBase is a data model that is similar to Google's big table designed to provide quick random access to huge amounts of structured data. This tutorial provides an introduction to HBase, the procedures to set up HBase on Hadoop File Systems, and ways to interact with HBase shell.
Kafka+
https://developer.confluent.io/what-is-apache-kafka/
https://www.youtube.com/watch?v=CU44hKLMg7k
https://www.youtube.com/watch?v=j4bqyAMMb7o&list=PLa7VYi0yPIH0KbnJQcMv5N9iW8HkZHztH
In this Apache Kafka fundamentals course, we introduce you to the basic Apache Kafka elements and APIs, as well as the broader Kafka ecosystem.
Apache Storm+
[英文] Tutorial
https://storm.apache.org/releases/2.6.0/Tutorial.html
In this tutorial, you'll learn how to create Storm topologies and deploy them to a Storm cluster.
https://www.baeldung.com/apache-storm
This tutorial will be an introduction to Apache Storm, a distributed real-time computation system.
Spark+
[英文] Learning Spark Book
https://pages.databricks.com/rs/094-YMS-629/images/LearningSpark2.0.pdf
This new edition has been updated to reflect Apache Spark’s evolution through Spark 2.x and Spark 3.0, including its expanded ecosystem of built-in and external data sources, machine learning, and streaming technologies with which Spark is tightly integrated.
Flink+
https://nightlies.apache.org/flink/flink-docs-release-2.0/docs/learn-flink/overview/
This training presents an introduction to Apache Flink that includes just enough to get you started writing scalable streaming ETL, analytics, and event-driven applications, while leaving out a lot of (ultimately important) details.
https://www.youtube.com/watch?v=WajYe9iA2Uk&list=PLa7VYi0yPIH2GTo3vRtX8w9tgNTTyYSux
Today’s businesses are increasingly software-defined, and their business processes are being automated. Whether it’s orders and shipments, or downloads and clicks, business events can always be streamed. Flink can be used to manipulate, process, and react to these streaming events as they occur.
MySQL+
https://juejin.cn/post/7190306988939542585
这是一篇 MySQL 通关一篇过硬核经验学习路线,包括数据库相关知识,SQL语句的使用,数据库约束,设计等。
[英文] MySQL Tutorial
https://www.mysqltutorial.org/
your go-to resource for mastering MySQL in a fast, easy, and enjoyable way.
https://www.youtube.com/watch?v=5OdVJbNCSso
MySQL SQL tutorial for beginners
https://www.youtube.com/watch?v=7S_tz1z_5bA
This beginner-friendly course teaches you SQL from scratch.
SQL+
https://liaoxuefeng.com/books/sql/introduction/index.html
什么是SQL?简单地说,SQL就是访问和处理关系数据库的计算机标准语言。
https://sqlbolt.com/
Learn SQL with simple, interactive exercises.
https://www.youtube.com/watch?v=p3qvj9hO_Bo
In this video we will cover everything you need to know about SQL in only 60 minutes.
性能调优+
https://goperf.dev/
The Go App Optimization Guide is a series of in-depth, technical articles for developers who want to get more performance out of their Go code without relying on guesswork or cargo cult patterns.
https://web.dev/learn/performance
This course is designed for those new to web performance, a vital aspect of the user experience.
https://www.ibm.com/think/insights/application-performance-optimization
Application performance is not just a simple concern for most organizations; it’s a critical factor in their business’s success.
https://www.oreilly.com/library/view/optimizing-java/9781492039259/
Performance tuning is an experimental science, but that doesn’t mean engineers should resort to guesswork and folklore to get the job done.
Bash+
[英文] The Bash Guide
https://guide.bash.academy/
A quality-driven guide through the shell's many features.
https://www.youtube.com/watch?v=tK9Oc6AEnR4
Understanding how to use bash scripting will enhance your productivity by automating tasks, streamlining processes, and making your workflow more efficient.
Python+
https://liaoxuefeng.com/books/python/introduction/index.html
中文,免费,零起点,完整示例,基于最新的Python 3版本。
https://www.learnpython.org/
a free interactive Python tutorial for people who want to learn Python, fast.
https://www.youtube.com/watch?v=K5KVEU3aaeQ
Master Python from scratch 🚀 No fluff—just clear, practical coding skills to kickstart your journey!
https://www.youtube.com/watch?v=rfscVS0vtbw
This course will give you a full introduction into all of the core concepts in python.
脚本+
[英文] Scripting language
https://en.wikipedia.org/wiki/Scripting_language
https://zhuanlan.zhihu.com/p/571097954
一个脚本通常是解释执行而非编译。脚本语言通常都有简单、易学、易用的特性,目的就是希望能让程序员快速完成程序的编写工作。
大数据+
https://www.youtube.com/watch?v=bAyrObl7TYE
https://www.youtube.com/watch?v=H4bf_uuMC-g
With all this talk of Big Data, we got Rebecca Tickle to explain just what makes data into Big Data.
相关职位
社招3年以上技术类-数据
1.负责蚂蚁财富、保险业务线数据体系和解决方案建设,赋能业务数字化运营,提升运营效率,保障数据的质量和稳定性; 2.负责业务领域核心数据体系的规划,以数据为核心生产要素制定数据解决方案,解决业务开展过程中遇到的痛点,包括但不限于用户标签体系、数据智能化和自动化体系和实时数据体系的建设; 3.负责建设高质量的领域数据资产,包括但不限于外部数据引入、数据标注、特征挖掘等,为业务智能化营销、大模型等智能化场景,提供必要的模型训练、迭代、部署等方面的支持,确保业务智能化升级目标能够顺利推进、落地; 4.负责主导或参与数据治理工作,实现持续、低成本的产出高质量的数据;建设数据内部共享融通的数据平台,保障数据的合规使用,避免数据泄漏及违规使用。
更新于 2025-09-01
社招2年以上国际业务开发
1.负责营销触达业务系统的开发,深入理解业务需求,通过打造业界领先的触达推荐用户增长技术体系,支持国际业务的高速发展; 2.系统的部署和维护,持续优化推送系统架构,提高系统在高并发、大流量下的容灾容错能力,保证系统的高可用性(性能、安全、容量); 3.在全域智能化投放的大背景下,参与推送引擎架构的长周期基建,通过算法推荐推高业务天花板。
更新于 2025-03-18
社招3年以上技术类-开发
负责蚂蚁保车险技术研发,例如推荐运筹、智能风控、大模型等技术在业务场景落地,为用户提供智能、专业的车险科技服务: 1. 负责蚂蚁车险的营销/建档/投保/服务等关键领域的架构设计、研发工作和高质量交付; 2. 发现和解决业务系统的技术问题,保证系统的性能和稳定性; 3. 协同他人组织跨团队沟通协作,确保系统架构内外设计合理或保障项目质量与进度; 4、搭建算法工程链路,提出合理可行的架构演进和迭代方案,完成智能化业务方案落地。
更新于 2025-09-10