腾讯微信小店-数据研发工程师
社招全职2年以上微信技术地点:广州状态:招聘
任职要求
1.计算机科学或相关领域本科及以上学历,具备扎实的数据结构和算法基础; 2.具有 2 年以上大数据研发经验,包括实时/离线数据处理、数据建模、ETL开发与设计、数据治理等; 3.编程能力扎实,熟悉至少一门常用的后台开发语言,如Python、Java、Scala等; 4.掌握大数据相关技术, 比如Hive、Iceberg、Spark、Flink的原理了解,要求有实战经验; 5.熟悉一门ClickHouse、Druid、Doris等OLAP引擎,了解系统原理,要求有实战经验; 6.对数据敏感,工作细致负责,具备良好的问题分析与解决能力,学习能力强,善于沟通,具备良好的团队协作精神。 加分项 1.有大规模分布式数据处理和实时计算经验者优先; 2.熟悉机器学习算法,有数据挖掘和预测模型构建经验者优先; 3.对数据安全和数据治理有深入理解和实践经验者优先。
工作职责
1.负责业务数据研发相关工作,对数据进行整合、清洗、存储形成数据资产满足业务实时离线各种场景的业务需求; 2.负责业务主题离线、实时、湖仓一体设计与研发构建,专项推动其数据应用的高可用、高质量和安全可靠; 3.负责参与数据架构的设计与优化,提升系统性能和稳定性; 4.与产品团队紧密协作,理解业务需求,提供数据支持,推动数据驱动的产品改进; 5.制定和优化数据开发规范和流程,提高团队工作效率和质量; 6.跟踪业界最新技术和动态,将其应用到实际项目中,提升产品竞争力。
包括英文材料
学历+
数据结构+
https://www.youtube.com/watch?v=8hly31xKli0
In this course you will learn about algorithms and data structures, two of the fundamental topics in computer science.
https://www.youtube.com/watch?v=B31LgI4Y4DQ
Learn about data structures in this comprehensive course. We will be implementing these data structures in C or C++.
https://www.youtube.com/watch?v=CBYHwZcbD-s
Data Structures and Algorithms full course tutorial java
算法+
https://roadmap.sh/datastructures-and-algorithms
Step by step guide to learn Data Structures and Algorithms in 2025
https://www.hellointerview.com/learn/code
A visual guide to the most important patterns and approaches for the coding interview.
https://www.w3schools.com/dsa/
大数据+
https://www.youtube.com/watch?v=bAyrObl7TYE
https://www.youtube.com/watch?v=H4bf_uuMC-g
With all this talk of Big Data, we got Rebecca Tickle to explain just what makes data into Big Data.
ETL+
https://www.ibm.com/think/topics/etl
ETL—meaning extract, transform, load—is a data integration process that combines, cleans and organizes data from multiple sources into a single, consistent data set for storage in a data warehouse, data lake or other target system.
https://www.youtube.com/watch?v=OW5OgsLpDCQ
It explains what ETL is and what it can do for you to improve your data analysis and productivity.
数据治理+
https://www.ibm.com/think/topics/data-governance
Data governance is the data management discipline that focuses on the quality, security and availability of an organization’s data.
https://www.youtube.com/watch?v=uPsUjKLHLAg
Building data fabric eliminates the technological complexities of data governance so users can connect to the right data at the right time, regardless of where it resides.
Python+
https://liaoxuefeng.com/books/python/introduction/index.html
中文,免费,零起点,完整示例,基于最新的Python 3版本。
https://www.learnpython.org/
a free interactive Python tutorial for people who want to learn Python, fast.
https://www.youtube.com/watch?v=K5KVEU3aaeQ
Master Python from scratch 🚀 No fluff—just clear, practical coding skills to kickstart your journey!
https://www.youtube.com/watch?v=rfscVS0vtbw
This course will give you a full introduction into all of the core concepts in python.
Java+
https://www.youtube.com/watch?v=eIrMbAQSU34
Master Java – a must-have language for software development, Android apps, and more! ☕️ This beginner-friendly course takes you from basics to real coding skills.
Scala+
Hive+
[英文] Hive Tutorial
https://www.tutorialspoint.com/hive/index.htm
Hive is a data warehouse infrastructure tool to process structured data in Hadoop. It resides on top of Hadoop to summarize Big Data, and makes querying and analyzing easy.
https://www.youtube.com/watch?v=D4HqQ8-Ja9Y
Spark+
[英文] Learning Spark Book
https://pages.databricks.com/rs/094-YMS-629/images/LearningSpark2.0.pdf
This new edition has been updated to reflect Apache Spark’s evolution through Spark 2.x and Spark 3.0, including its expanded ecosystem of built-in and external data sources, machine learning, and streaming technologies with which Spark is tightly integrated.
Flink+
https://nightlies.apache.org/flink/flink-docs-release-2.0/docs/learn-flink/overview/
This training presents an introduction to Apache Flink that includes just enough to get you started writing scalable streaming ETL, analytics, and event-driven applications, while leaving out a lot of (ultimately important) details.
https://www.youtube.com/watch?v=WajYe9iA2Uk&list=PLa7VYi0yPIH2GTo3vRtX8w9tgNTTyYSux
Today’s businesses are increasingly software-defined, and their business processes are being automated. Whether it’s orders and shipments, or downloads and clicks, business events can always be streamed. Flink can be used to manipulate, process, and react to these streaming events as they occur.
ClickHouse+
[英文] Advanced Tutorial
https://clickhouse.com/docs/tutorial
Learn how to ingest and query data in ClickHouse using the New York City taxi example dataset.
https://www.youtube.com/watch?v=FtoWGT7kS-c
ClickHouse is an open-source column-oriented DBMS for online analytical processing that allows users to generate analytical reports using SQL queries in real-time.
https://www.youtube.com/watch?v=Rhe-kUyrFUE&list=PL0Z2YDlm0b3gcY5R_MUo4fT5bPqUQ66ep
Doris+
https://doris.apache.org/docs/gettingStarted/what-is-apache-doris
OLAP+
https://www.youtube.com/watch?v=iw-5kFzIdgY
OLAP (for online analytical processing) is software for performing multidimensional analysis at high speeds on large volumes of data from a data warehouse, data mart, or some other unified, centralized data store.
机器学习+
https://www.youtube.com/watch?v=0oyDqO8PjIg
Learn about machine learning and AI with this comprehensive 11-hour course from @LunarTech_ai.
https://www.youtube.com/watch?v=i_LwzRVP7bg
Learn Machine Learning in a way that is accessible to absolute beginners.
https://www.youtube.com/watch?v=NWONeJKn6kc
Learn the theory and practical application of machine learning concepts in this comprehensive course for beginners.
https://www.youtube.com/watch?v=PcbuKRNtCUc
Learn about all the most important concepts and terms related to machine learning and AI.
数据挖掘+
https://www.youtube.com/watch?v=-bSkREem8dM
Database vs Data Warehouse vs Data Lake
https://www.youtube.com/watch?v=7rs0i-9nOjo
后端开发+
https://www.youtube.com/watch?v=tN6oJu2DqCM&list=PLWKjhJtqVAbn21gs5UnLhCQ82f923WCgM
Learn what technologies you should learn first to become a back end web developer.
Iceberg+
https://iceberg.apache.org/spark-quickstart/
This guide will get you up and running with Apache Iceberg™ using Apache Spark™, including sample code to highlight some powerful features.
https://www.baeldung.com/apache-iceberg-intro
This tutorial will discuss Apache Iceberg, a popular open table format in today’s big data landscape.
https://www.youtube.com/watch?v=TsmhRZElPvM
You’ve probably heard about Apache Iceberg™—after all, it’s been getting a lot of buzz.
相关职位
社招微信交易平台技术
1.负责微信电商治理相关算法研发与应用,保障微信电商体系下的用户购物体验、公平合规的商家达人经营环境; 2.负责微信电商生态下,商家、达人、短视频、直播、图文等等不同对象主体的理解、问题识别与画像构建; 3.跟踪前沿技术并落地创新,处理海量数据,与产品团队协同提升运营业务效果; 4.探索最前沿的AI技术,并落地到微信电商治理业务场景中。
更新于 2025-06-22
社招1年以上微信交易平台技术
1.负责微信电商治理相关算法研发与应用,保障微信电商体系下的用户购物体验、公平合规的商家达人经营环境; 2.负责微信电商生态下,商家、达人、短视频、直播、图文等等不同对象主体的理解、问题识别与画像构建; 3.跟踪前沿技术并落地创新,处理海量数据,与产品团队协同提升运营业务效果; 4.探索最前沿的AI技术,并落地到微信电商治理业务场景中。
更新于 2025-09-22
社招2年以上微信交易平台技术
1.探索大模型在电商场景的应用; 2.利用大模型优化电商推荐效果,包括但不限于召回、排序等环节; 3.跟踪大模型的前沿进展,研究数据合成、后训练等方法,推动模型在实际场景中的效果优化。
更新于 2025-09-08