快手数据挖掘算法工程师
校招全职J1002地点:北京状态:招聘
任职要求
1、计算机/数学/物理/信息等专业硕士及以上学历; 2、扎实的统计学功底,擅长数据分析和科学实验方法,对机器学习、深度学习等主流算法理解深刻,且应用熟练; 3、熟练掌握Python、C++、JAVA、R等至少一门编程语言,优秀的编码能力和习惯,良好的数据结构和算法功底; 4、熟悉基于Hive、Spark、ElasticSearch、Mongodb等大数据平…
登录查看完整任职要求
微信扫码,1秒登录
工作职责
1、负责海量短视频生产链路算法优化,基于视频特效、用户画像、行为序列、消费反馈等大规模数据信号进行算法建模,加强特效、美颜等视频生产业务的智能化; 2、通过异常检测、因果推断、自动归因等算法等对生产、消费数据进行挖掘,洞察业务痛点,指导业务优化方向; 3、挖掘热点事件、预测流行趋势,帮助视频特效等业务更好运营和生产; 4、挖掘用户特征,用于提升广告与用户匹配的效率、业务反欺诈、渠道反作弊、搜索索引等业务场景。
包括英文材料
学历+
数据分析+
[英文] Data Analyst Roadmap
https://roadmap.sh/data-analyst
Step by step guide to becoming an Data Analyst in 2025
机器学习+
https://www.youtube.com/watch?v=0oyDqO8PjIg
Learn about machine learning and AI with this comprehensive 11-hour course from @LunarTech_ai.
https://www.youtube.com/watch?v=i_LwzRVP7bg
Learn Machine Learning in a way that is accessible to absolute beginners.
https://www.youtube.com/watch?v=NWONeJKn6kc
Learn the theory and practical application of machine learning concepts in this comprehensive course for beginners.
https://www.youtube.com/watch?v=PcbuKRNtCUc
Learn about all the most important concepts and terms related to machine learning and AI.
深度学习+
https://d2l.ai/
Interactive deep learning book with code, math, and discussions.
算法+
https://roadmap.sh/datastructures-and-algorithms
Step by step guide to learn Data Structures and Algorithms in 2025
https://www.hellointerview.com/learn/code
A visual guide to the most important patterns and approaches for the coding interview.
https://www.w3schools.com/dsa/
Python+
https://liaoxuefeng.com/books/python/introduction/index.html
中文,免费,零起点,完整示例,基于最新的Python 3版本。
https://www.learnpython.org/
a free interactive Python tutorial for people who want to learn Python, fast.
https://www.youtube.com/watch?v=K5KVEU3aaeQ
Master Python from scratch 🚀 No fluff—just clear, practical coding skills to kickstart your journey!
https://www.youtube.com/watch?v=rfscVS0vtbw
This course will give you a full introduction into all of the core concepts in python.
C+++
https://www.learncpp.com/
LearnCpp.com is a free website devoted to teaching you how to program in modern C++.
https://www.youtube.com/watch?v=ZzaPdXTrSb8
Java+
https://www.youtube.com/watch?v=eIrMbAQSU34
Master Java – a must-have language for software development, Android apps, and more! ☕️ This beginner-friendly course takes you from basics to real coding skills.
R+
[英文] R Tutorial
https://www.w3schools.com/r/
R is often used for statistical computing and graphical presentation to analyze and visualize data.
数据结构+
https://www.youtube.com/watch?v=8hly31xKli0
In this course you will learn about algorithms and data structures, two of the fundamental topics in computer science.
https://www.youtube.com/watch?v=B31LgI4Y4DQ
Learn about data structures in this comprehensive course. We will be implementing these data structures in C or C++.
https://www.youtube.com/watch?v=CBYHwZcbD-s
Data Structures and Algorithms full course tutorial java
Hive+
[英文] Hive Tutorial
https://www.tutorialspoint.com/hive/index.htm
Hive is a data warehouse infrastructure tool to process structured data in Hadoop. It resides on top of Hadoop to summarize Big Data, and makes querying and analyzing easy.
https://www.youtube.com/watch?v=D4HqQ8-Ja9Y
Spark+
[英文] Learning Spark Book
https://pages.databricks.com/rs/094-YMS-629/images/LearningSpark2.0.pdf
This new edition has been updated to reflect Apache Spark’s evolution through Spark 2.x and Spark 3.0, including its expanded ecosystem of built-in and external data sources, machine learning, and streaming technologies with which Spark is tightly integrated.
还有更多 •••
相关职位
社招1-3年J0012
1、整合海量多维数据,进行全站数据挖掘,构建用户画像体系、时空知识体系,并搭建全站核心数据资产管理平台; 2、深入业务场景,利用全站海量多维数据,综合运用统计和数据挖掘/机器学习的方法,探索平台新的业务增长点,为各类业务系统提供特征和模型支撑; 3、深度参与归因分析、异常检测、知识图谱等专题类建设工作。
更新于 2026-03-17北京
社招1年以上D6255
1、整合海量多维数据,进行全站数据挖掘,构建用户画像体系、时空知识体系,并搭建全站核心数据资产管理平台; 2、深入业务场景,利用全站海量多维数据,综合运用统计和数据挖掘/机器学习的方法,探索平台新的业务增长点,为各类业务系统提供特征和模型支撑; 3、深度参与归因分析、异常检测、知识图谱等专题类建设工作。
更新于 2024-07-15北京
社招2年以上G341
1、研究数据挖掘或统计学习领域的前沿技术,针对海量用户行为和内容信息,构建和优化用户画像以及用户属性; 2、基于对用户理解和大量数据特征,参与风控、精准营销、个性化定价等模型建设和领域研究,提升产品效果; 3、根据公司需要寻找和采集相关数据,对原始数据进行清理、甄别、归类和整合,并实现流程自动化。
更新于 2020-11-11北京
