快手高级数据挖掘工程师
社招全职3年以上D0599地点:北京状态:招聘
任职要求
1、具备机器学习或者数据挖掘的研究和项目背景;熟练掌握分类、回归、聚类等机器学习模型,能够把业务问题拆解成适合的数据、算法问题,并完成价值落地; 2、扎实的编程基础,精通至少一门编程语言; 有大数据计算、分布式算法开发经验; 3、好奇心,有良好的的数据和业务敏感度,对数据驱动业务有极大的兴趣; 4、本科及以上学历,3年以上数据挖掘、机器学习、大规模数据分析相关经验; 5、熟练掌握至少一种编程语言GO/Java/C++/Scala/Python,了解Hadoop/MapReduce/Spark/Hive等常用大数据处理工具; 6、有互联网公司大规模用户画像实践经验,参与过广告投放、用户属性建模等工作优先。
工作职责
1、整合海量多维数据,进行全站数据挖掘,构建用户、客户、内容等多个实体的资产标签体系; 2、分析和研究数据与实际业务,针对具体业务场景,挖掘各类人群标签,整合三方数据对用户进行分级建设,精准刻画用户各属性; 3、深度参与到用户标签体系建设、广告投放效果优化等方向的工作。
包括英文材料
机器学习+
https://www.youtube.com/watch?v=0oyDqO8PjIg
Learn about machine learning and AI with this comprehensive 11-hour course from @LunarTech_ai.
https://www.youtube.com/watch?v=i_LwzRVP7bg
Learn Machine Learning in a way that is accessible to absolute beginners.
https://www.youtube.com/watch?v=NWONeJKn6kc
Learn the theory and practical application of machine learning concepts in this comprehensive course for beginners.
https://www.youtube.com/watch?v=PcbuKRNtCUc
Learn about all the most important concepts and terms related to machine learning and AI.
数据挖掘+
https://www.youtube.com/watch?v=-bSkREem8dM
Database vs Data Warehouse vs Data Lake
https://www.youtube.com/watch?v=7rs0i-9nOjo
算法+
https://roadmap.sh/datastructures-and-algorithms
Step by step guide to learn Data Structures and Algorithms in 2025
https://www.hellointerview.com/learn/code
A visual guide to the most important patterns and approaches for the coding interview.
https://www.w3schools.com/dsa/
大数据+
https://www.youtube.com/watch?v=bAyrObl7TYE
https://www.youtube.com/watch?v=H4bf_uuMC-g
With all this talk of Big Data, we got Rebecca Tickle to explain just what makes data into Big Data.
学历+
数据分析+
[英文] Data Analyst Roadmap
https://roadmap.sh/data-analyst
Step by step guide to becoming an Data Analyst in 2025
Go+
https://www.youtube.com/watch?v=8uiZC0l4Ajw
学习Golang的完整教程!从开始到结束不到一个小时,包括如何在Go中构建API的完整演示。没有多余的内容,只有你需要知道的知识。
Java+
https://www.youtube.com/watch?v=eIrMbAQSU34
Master Java – a must-have language for software development, Android apps, and more! ☕️ This beginner-friendly course takes you from basics to real coding skills.
C+++
https://www.learncpp.com/
LearnCpp.com is a free website devoted to teaching you how to program in modern C++.
https://www.youtube.com/watch?v=ZzaPdXTrSb8
Scala+
Python+
https://liaoxuefeng.com/books/python/introduction/index.html
中文,免费,零起点,完整示例,基于最新的Python 3版本。
https://www.learnpython.org/
a free interactive Python tutorial for people who want to learn Python, fast.
https://www.youtube.com/watch?v=K5KVEU3aaeQ
Master Python from scratch 🚀 No fluff—just clear, practical coding skills to kickstart your journey!
https://www.youtube.com/watch?v=rfscVS0vtbw
This course will give you a full introduction into all of the core concepts in python.
Hadoop+
https://www.runoob.com/w3cnote/hadoop-tutorial.html
Hadoop 为庞大的计算机集群提供可靠的、可伸缩的应用层计算和存储支持,它允许使用简单的编程模型跨计算机群集分布式处理大型数据集,并且支持在单台计算机到几千台计算机之间进行扩展。
[英文] Hadoop Tutorial
https://www.tutorialspoint.com/hadoop/index.htm
Hadoop is an open-source framework that allows to store and process big data in a distributed environment across clusters of computers using simple programming models.
MapReduce+
https://www.youtube.com/watch?v=bcjSe0xCHbE
https://www.youtube.com/watch?v=cHGaQz0E7AU
In this video I explain the basics of Map Reduce model, an important concept for any software engineer to be aware of.
Spark+
[英文] Learning Spark Book
https://pages.databricks.com/rs/094-YMS-629/images/LearningSpark2.0.pdf
This new edition has been updated to reflect Apache Spark’s evolution through Spark 2.x and Spark 3.0, including its expanded ecosystem of built-in and external data sources, machine learning, and streaming technologies with which Spark is tightly integrated.
Hive+
[英文] Hive Tutorial
https://www.tutorialspoint.com/hive/index.htm
Hive is a data warehouse infrastructure tool to process structured data in Hadoop. It resides on top of Hadoop to summarize Big Data, and makes querying and analyzing easy.
https://www.youtube.com/watch?v=D4HqQ8-Ja9Y
相关职位
社招信息技术类
1、深入理解业务,分析业务数据,给业务输出数据解决方案,应用机器学习/深度学习算法实现数据建模; 2、负责用户/商品画像体系建设,针对海量用户行为和内容信息持续迭代、评估、完善用户/商品标签; 3、参与智能增长相关业务,包括但不限搜推、用增、营销等,深入挖掘价值增长点,推动业务效果提升。
更新于 2025-04-16
社招3年以上D12854
利用快手平台海量用户的静态和动态数据,运用机器学习、数据挖掘等技术,对用户进行标签识别、人群挖掘、兴趣挖掘、表征学习等等。具体职责包括: 1、通过对用户全域数据的挖掘和分析,进行用户建模,实现对用户各属性的精准刻画; 2、负责追踪业界先进算法的研发和优化,以提高画像模型的效果和效率; 3、设计和建模用户认知标签,如兴趣、意愿等,参与相关业务效果的优化; 4、分析和研究数据与实际业务需求,针对具体业务场景,挖掘各类人群标签。
更新于 2024-07-15
社招3-8年SOFTWARE
1.负责海外锁屏图文的个性化推荐,提高锁屏内容的展示效率和用户体验,能从全局视角,优化推荐系统的全链路 2.建立和管理锁屏图文标签体系,开发用户画像特征,刻画用户偏好 3.关注和跟踪业务长期指标,优化算法策略,驱动业务增长
更新于 2025-06-16