vivo大数据工程师
社招全职5-7年研发类地点:深圳状态:招聘
任职要求
1、计算机、数学、统计等相关专业本科及以上学历, 5年以上互联网行业经验(广告/推荐/NLP等); 2、有丰富的用户画像、文本数据处理经验,有良好的逻辑思维能力和数据驱动思维,善于分析和解决问题; 3、具备传统机器学习,深度学习相关知识背景,熟练掌握TensorFlow/PyTorch 框架;至少精通一门编程语言,包括但是不限于python/java/scala; 4、扎实的数据结构和算法功底,有海量数据处理和分布式计算开发经验,熟悉 Hadoop、Spark 框架; 5、较好的沟通能力、团队协作能力,积极主动,推动力强,愿意接受挑战; 6、熟悉计算广告框架或者推荐系统架构,有互联网行业广告业务、推荐业务、DMP平台建设经验者优先。
工作职责
1、负责海量用户行为数据与内容数据挖掘,构建高质量的用户画像体系,包括用户基础属性、行业兴趣偏好等; 2、负责用户行为分析与预测,搭建人群优选、用户价值分级等算法模型,应用于广告场景/个性化推荐等场景,助力业务效果提升; 3、负责人群应用链路效果分析,洞察优化点,持续深入挖掘数据策略应用机会并落地实践。
包括英文材料
学历+
NLP+
https://www.youtube.com/watch?v=fNxaJsNG3-s&list=PLQY2H8rRoyvzDbLUZkbudP-MFQZwNmU4S
Welcome to Zero to Hero for Natural Language Processing using TensorFlow!
https://www.youtube.com/watch?v=R-AG4-qZs1A&list=PLeo1K3hjS3uuvuAXhYjV2lMEShq2UYSwX
Natural Language Processing tutorial for beginners series in Python.
https://www.youtube.com/watch?v=rmVRLeJRkl4&list=PLoROMvodv4rMFqRtEuo6SGjY4XbRIVRd4
The foundations of the effective modern methods for deep learning applied to NLP.
机器学习+
https://www.youtube.com/watch?v=0oyDqO8PjIg
Learn about machine learning and AI with this comprehensive 11-hour course from @LunarTech_ai.
https://www.youtube.com/watch?v=i_LwzRVP7bg
Learn Machine Learning in a way that is accessible to absolute beginners.
https://www.youtube.com/watch?v=NWONeJKn6kc
Learn the theory and practical application of machine learning concepts in this comprehensive course for beginners.
https://www.youtube.com/watch?v=PcbuKRNtCUc
Learn about all the most important concepts and terms related to machine learning and AI.
深度学习+
https://d2l.ai/
Interactive deep learning book with code, math, and discussions.
TensorFlow+
https://www.youtube.com/watch?v=tpCFfeUEGs8
Ready to learn the fundamentals of TensorFlow and deep learning with Python? Well, you’ve come to the right place.
https://www.youtube.com/watch?v=ZUKz4125WNI
This part continues right where part one left off so get that Google Colab window open and get ready to write plenty more TensorFlow code.
PyTorch+
https://datawhalechina.github.io/thorough-pytorch/
PyTorch是利用深度学习进行数据科学研究的重要工具,在灵活性、可读性和性能上都具备相当的优势,近年来已成为学术界实现深度学习算法最常用的框架。
https://www.youtube.com/watch?v=V_xro1bcAuA
Learn PyTorch for deep learning in this comprehensive course for beginners. PyTorch is a machine learning framework written in Python.
Python+
https://liaoxuefeng.com/books/python/introduction/index.html
中文,免费,零起点,完整示例,基于最新的Python 3版本。
https://www.learnpython.org/
a free interactive Python tutorial for people who want to learn Python, fast.
https://www.youtube.com/watch?v=K5KVEU3aaeQ
Master Python from scratch 🚀 No fluff—just clear, practical coding skills to kickstart your journey!
https://www.youtube.com/watch?v=rfscVS0vtbw
This course will give you a full introduction into all of the core concepts in python.
Java+
https://www.youtube.com/watch?v=eIrMbAQSU34
Master Java – a must-have language for software development, Android apps, and more! ☕️ This beginner-friendly course takes you from basics to real coding skills.
Scala+
数据结构+
https://www.youtube.com/watch?v=8hly31xKli0
In this course you will learn about algorithms and data structures, two of the fundamental topics in computer science.
https://www.youtube.com/watch?v=B31LgI4Y4DQ
Learn about data structures in this comprehensive course. We will be implementing these data structures in C or C++.
https://www.youtube.com/watch?v=CBYHwZcbD-s
Data Structures and Algorithms full course tutorial java
算法+
https://roadmap.sh/datastructures-and-algorithms
Step by step guide to learn Data Structures and Algorithms in 2025
https://www.hellointerview.com/learn/code
A visual guide to the most important patterns and approaches for the coding interview.
https://www.w3schools.com/dsa/
Hadoop+
https://www.runoob.com/w3cnote/hadoop-tutorial.html
Hadoop 为庞大的计算机集群提供可靠的、可伸缩的应用层计算和存储支持,它允许使用简单的编程模型跨计算机群集分布式处理大型数据集,并且支持在单台计算机到几千台计算机之间进行扩展。
[英文] Hadoop Tutorial
https://www.tutorialspoint.com/hadoop/index.htm
Hadoop is an open-source framework that allows to store and process big data in a distributed environment across clusters of computers using simple programming models.
Spark+
[英文] Learning Spark Book
https://pages.databricks.com/rs/094-YMS-629/images/LearningSpark2.0.pdf
This new edition has been updated to reflect Apache Spark’s evolution through Spark 2.x and Spark 3.0, including its expanded ecosystem of built-in and external data sources, machine learning, and streaming technologies with which Spark is tightly integrated.
推荐系统+
[英文] Recommender Systems
https://www.d2l.ai/chapter_recommender-systems/index.html
Recommender systems are widely employed in industry and are ubiquitous in our daily lives.
相关职位
社招5年以上研发类
1、通过用户分析、平台分析、内容分析等挖掘业务增长机会,通过 AB 实验、模型搭建及落地等助力科学决策与业务增长; 2、洞察业务诉求,提出增长策略,为广告、游戏等业务提供数据基建与数据科学服务;
更新于 2025-07-16
社招3年以上研发类
1、负责海量用户数据的分析和挖掘,构建用户画像体系; 2、负责用户特征分析与洞察,搭建用户价值分级模型,以及在广告场景/个性化推荐等场景的落地应用,助力业务转化效果提升; 3、负责应用链路效果分析,洞察优化点,持续深入挖掘数据策略应用机会。
更新于 2024-04-30
社招5年以上研发类
1、负责海量用户数据的分析和挖掘,构建用户画像体系; 2、负责用户特征分析与洞察,搭建用户价值分级模型,以及在广告场景/个性化推荐等场景的落地应用,助力业务转化效果提升; 3、负责应用链路效果分析,洞察优化点,持续深入挖掘数据策略应用机会。

社招3年以上
1、负责大数据平台的开发和治理,包括数据采集、存储、处理、计算和展示等全生命周期的工作; 2、负责数据仓库和数据湖的设计和实现,提供高效、可靠、安全、可扩展的数据存储和计算能力; 3、熟悉数据架构、数据模型、数据质量和数据安全等方面的知识,能够设计和优化数据模型、ETL流程和数据治理流程; 4、熟练掌握Hadoop、Spark、Flink、Hive、HBase等大数据技术和工具,能够根据业务需求选择并使用适当的技术; 5、熟悉数据可视化和报表工具,如Tableau、PowerBI、FineBI等,能够根据开源报表系统定制化开发报表; 6、关注最新的大数据技术和行业发展趋势,参与技术选型和架构设计,推动大数据平台的技术创新和业务应用。
更新于 2024-07-19