vivo大数据工程师
社招全职5年以上研发类地点:深圳状态:招聘
任职要求
1、本科及以上学历,计算机、数学、统计等相关专业, 5年以上互联网行业工作经验; 2、有丰富用户画像、标签体系建设经验,精通常用的机器学习模型和算法,有良好的逻辑思维能力和数据驱动思维,善于分析和解决问题; 3、熟练掌握至少一种编程语言:Java/Scala/Python,熟练掌握HIVE SQL,熟悉Linux系统及常用shell命令,熟悉大数据常用软件,如Hadoop、Spark、Flink、Kafka、HBase等,熟练运用常用算法和数据结构; 4、较好的沟通能力、团队协作能力,积极主动,推动力强,愿意接受挑战; 5、熟悉计算广告框架或者推荐系统架构,有互联网行业广告业务、推荐业务、DMP平台建设经验者优先。
工作职责
1、负责海量用户数据的分析和挖掘,构建用户画像体系; 2、负责用户特征分析与洞察,搭建用户价值分级模型,以及在广告场景/个性化推荐等场景的落地应用,助力业务转化效果提升; 3、负责应用链路效果分析,洞察优化点,持续深入挖掘数据策略应用机会。
包括英文材料
学历+
机器学习+
https://www.youtube.com/watch?v=0oyDqO8PjIg
Learn about machine learning and AI with this comprehensive 11-hour course from @LunarTech_ai.
https://www.youtube.com/watch?v=i_LwzRVP7bg
Learn Machine Learning in a way that is accessible to absolute beginners.
https://www.youtube.com/watch?v=NWONeJKn6kc
Learn the theory and practical application of machine learning concepts in this comprehensive course for beginners.
https://www.youtube.com/watch?v=PcbuKRNtCUc
Learn about all the most important concepts and terms related to machine learning and AI.
算法+
https://roadmap.sh/datastructures-and-algorithms
Step by step guide to learn Data Structures and Algorithms in 2025
https://www.hellointerview.com/learn/code
A visual guide to the most important patterns and approaches for the coding interview.
https://www.w3schools.com/dsa/
Java+
https://www.youtube.com/watch?v=eIrMbAQSU34
Master Java – a must-have language for software development, Android apps, and more! ☕️ This beginner-friendly course takes you from basics to real coding skills.
Scala+
Python+
https://liaoxuefeng.com/books/python/introduction/index.html
中文,免费,零起点,完整示例,基于最新的Python 3版本。
https://www.learnpython.org/
a free interactive Python tutorial for people who want to learn Python, fast.
https://www.youtube.com/watch?v=K5KVEU3aaeQ
Master Python from scratch 🚀 No fluff—just clear, practical coding skills to kickstart your journey!
https://www.youtube.com/watch?v=rfscVS0vtbw
This course will give you a full introduction into all of the core concepts in python.
Hive+
[英文] Hive Tutorial
https://www.tutorialspoint.com/hive/index.htm
Hive is a data warehouse infrastructure tool to process structured data in Hadoop. It resides on top of Hadoop to summarize Big Data, and makes querying and analyzing easy.
https://www.youtube.com/watch?v=D4HqQ8-Ja9Y
SQL+
https://liaoxuefeng.com/books/sql/introduction/index.html
什么是SQL?简单地说,SQL就是访问和处理关系数据库的计算机标准语言。
https://sqlbolt.com/
Learn SQL with simple, interactive exercises.
https://www.youtube.com/watch?v=p3qvj9hO_Bo
In this video we will cover everything you need to know about SQL in only 60 minutes.
Linux+
https://ryanstutorials.net/linuxtutorial/
Ok, so you want to learn how to use the Bash command line interface (terminal) on Unix/Linux.
https://ubuntu.com/tutorials/command-line-for-beginners
The Linux command line is a text interface to your computer.
https://www.youtube.com/watch?v=6WatcfENsOU
In this Linux crash course, you will learn the fundamental skills and tools you need to become a proficient Linux system administrator.
https://www.youtube.com/watch?v=v392lEyM29A
Never fear the command line again, make it fear you.
https://www.youtube.com/watch?v=ZtqBQ68cfJc
Bash+
[英文] The Bash Guide
https://guide.bash.academy/
A quality-driven guide through the shell's many features.
https://www.youtube.com/watch?v=tK9Oc6AEnR4
Understanding how to use bash scripting will enhance your productivity by automating tasks, streamlining processes, and making your workflow more efficient.
大数据+
https://www.youtube.com/watch?v=bAyrObl7TYE
https://www.youtube.com/watch?v=H4bf_uuMC-g
With all this talk of Big Data, we got Rebecca Tickle to explain just what makes data into Big Data.
Hadoop+
https://www.runoob.com/w3cnote/hadoop-tutorial.html
Hadoop 为庞大的计算机集群提供可靠的、可伸缩的应用层计算和存储支持,它允许使用简单的编程模型跨计算机群集分布式处理大型数据集,并且支持在单台计算机到几千台计算机之间进行扩展。
[英文] Hadoop Tutorial
https://www.tutorialspoint.com/hadoop/index.htm
Hadoop is an open-source framework that allows to store and process big data in a distributed environment across clusters of computers using simple programming models.
Spark+
[英文] Learning Spark Book
https://pages.databricks.com/rs/094-YMS-629/images/LearningSpark2.0.pdf
This new edition has been updated to reflect Apache Spark’s evolution through Spark 2.x and Spark 3.0, including its expanded ecosystem of built-in and external data sources, machine learning, and streaming technologies with which Spark is tightly integrated.
Flink+
https://nightlies.apache.org/flink/flink-docs-release-2.0/docs/learn-flink/overview/
This training presents an introduction to Apache Flink that includes just enough to get you started writing scalable streaming ETL, analytics, and event-driven applications, while leaving out a lot of (ultimately important) details.
https://www.youtube.com/watch?v=WajYe9iA2Uk&list=PLa7VYi0yPIH2GTo3vRtX8w9tgNTTyYSux
Today’s businesses are increasingly software-defined, and their business processes are being automated. Whether it’s orders and shipments, or downloads and clicks, business events can always be streamed. Flink can be used to manipulate, process, and react to these streaming events as they occur.
Kafka+
https://developer.confluent.io/what-is-apache-kafka/
https://www.youtube.com/watch?v=CU44hKLMg7k
https://www.youtube.com/watch?v=j4bqyAMMb7o&list=PLa7VYi0yPIH0KbnJQcMv5N9iW8HkZHztH
In this Apache Kafka fundamentals course, we introduce you to the basic Apache Kafka elements and APIs, as well as the broader Kafka ecosystem.
HBase+
[英文] HBase Tutorial
https://www.tutorialspoint.com/hbase/index.htm
HBase is a data model that is similar to Google's big table designed to provide quick random access to huge amounts of structured data. This tutorial provides an introduction to HBase, the procedures to set up HBase on Hadoop File Systems, and ways to interact with HBase shell.
数据结构+
https://www.youtube.com/watch?v=8hly31xKli0
In this course you will learn about algorithms and data structures, two of the fundamental topics in computer science.
https://www.youtube.com/watch?v=B31LgI4Y4DQ
Learn about data structures in this comprehensive course. We will be implementing these data structures in C or C++.
https://www.youtube.com/watch?v=CBYHwZcbD-s
Data Structures and Algorithms full course tutorial java
推荐系统+
[英文] Recommender Systems
https://www.d2l.ai/chapter_recommender-systems/index.html
Recommender systems are widely employed in industry and are ubiquitous in our daily lives.
相关职位
社招5年以上研发类
1、通过用户分析、平台分析、内容分析等挖掘业务增长机会,通过 AB 实验、模型搭建及落地等助力科学决策与业务增长; 2、洞察业务诉求,提出增长策略,为广告、游戏等业务提供数据基建与数据科学服务;
更新于 2025-07-16
社招3年以上研发类
1、负责海量用户数据的分析和挖掘,构建用户画像体系; 2、负责用户特征分析与洞察,搭建用户价值分级模型,以及在广告场景/个性化推荐等场景的落地应用,助力业务转化效果提升; 3、负责应用链路效果分析,洞察优化点,持续深入挖掘数据策略应用机会。
更新于 2024-04-30
社招5-7年研发类
1、负责海量用户行为数据与内容数据挖掘,构建高质量的用户画像体系,包括用户基础属性、行业兴趣偏好等; 2、负责用户行为分析与预测,搭建人群优选、用户价值分级等算法模型,应用于广告场景/个性化推荐等场景,助力业务效果提升; 3、负责人群应用链路效果分析,洞察优化点,持续深入挖掘数据策略应用机会并落地实践。

社招3年以上
1、负责大数据平台的开发和治理,包括数据采集、存储、处理、计算和展示等全生命周期的工作; 2、负责数据仓库和数据湖的设计和实现,提供高效、可靠、安全、可扩展的数据存储和计算能力; 3、熟悉数据架构、数据模型、数据质量和数据安全等方面的知识,能够设计和优化数据模型、ETL流程和数据治理流程; 4、熟练掌握Hadoop、Spark、Flink、Hive、HBase等大数据技术和工具,能够根据业务需求选择并使用适当的技术; 5、熟悉数据可视化和报表工具,如Tableau、PowerBI、FineBI等,能够根据开源报表系统定制化开发报表; 6、关注最新的大数据技术和行业发展趋势,参与技术选型和架构设计,推动大数据平台的技术创新和业务应用。
更新于 2024-07-19