快手【留用实习】音频/音乐AIGC算法工程师
实习兼职J1010地点:北京状态:招聘
任职要求
1、硕士及以上学历,机器学习、模式识别、信号处理等计算机相关专业优先; 2、有较丰富的语音/音频/音乐生成大模型相关领域经验; 3、熟练掌握C/C++、Python,有较强的代码实现能力; 4、具有独立解决问题的能力,良好的表达能力、沟通能力和团队合作意识。 加分项: 1、有T2A、V2A、TTS和音乐生成大模型技术研发经验者优先; 2、相关顶会或期刊上发表论文者优先(ICASSP,Interspeech,ISMIR,ICML,AAAI,NIPS等)。
工作职责
1、负责AI音频/音乐生成大模型关键算法研发和优化,包含但不局限于T2A、V2A和AI歌曲生成等方向; 2、负责跟进行业前沿技术发展趋势,跟踪国际最新技术发展方向; 3、推动音频/音乐AIGC技术在快手各业务场景中的落地,探索音频/音乐生成技术在业务中的新玩法和业务创新。
包括英文材料
学历+
机器学习+
https://www.youtube.com/watch?v=0oyDqO8PjIg
Learn about machine learning and AI with this comprehensive 11-hour course from @LunarTech_ai.
https://www.youtube.com/watch?v=i_LwzRVP7bg
Learn Machine Learning in a way that is accessible to absolute beginners.
https://www.youtube.com/watch?v=NWONeJKn6kc
Learn the theory and practical application of machine learning concepts in this comprehensive course for beginners.
https://www.youtube.com/watch?v=PcbuKRNtCUc
Learn about all the most important concepts and terms related to machine learning and AI.
模式识别+
https://www.mathworks.com/discovery/pattern-recognition.html
Pattern recognition is the process of classifying input data into objects, classes, or categories using computer algorithms based on key features or regularities.
https://www.microsoft.com/en-us/research/wp-content/uploads/2006/01/Bishop-Pattern-Recognition-and-Machine-Learning-2006.pdf
Pattern recognition has its origins in engineering, whereas machine learning grew out of computer science.
大模型+
https://www.youtube.com/watch?v=xZDB1naRUlk
You will build projects with LLMs that will enable you to create dynamic interfaces, interact with vast amounts of text data, and even empower LLMs with the capability to browse the internet for research papers.
https://www.youtube.com/watch?v=zjkBMFhNj_g
C+
https://www.freecodecamp.org/chinese/news/the-c-beginners-handbook/
本手册遵循二八定律。你将在 20% 的时间内学习 80% 的 C 编程语言。
https://www.youtube.com/watch?v=87SH2Cn0s9A
https://www.youtube.com/watch?v=KJgsSFOSQv0
This course will give you a full introduction into all of the core concepts in the C programming language.
https://www.youtube.com/watch?v=PaPN51Mm5qQ
In this complete C programming course, Dr. Charles Severance (aka Dr. Chuck) will help you understand computer architecture and low-level programming with the help of the classic C Programming language book written by Brian Kernighan and Dennis Ritchie.
C+++
https://www.learncpp.com/
LearnCpp.com is a free website devoted to teaching you how to program in modern C++.
https://www.youtube.com/watch?v=ZzaPdXTrSb8
Python+
https://liaoxuefeng.com/books/python/introduction/index.html
中文,免费,零起点,完整示例,基于最新的Python 3版本。
https://www.learnpython.org/
a free interactive Python tutorial for people who want to learn Python, fast.
https://www.youtube.com/watch?v=K5KVEU3aaeQ
Master Python from scratch 🚀 No fluff—just clear, practical coding skills to kickstart your journey!
https://www.youtube.com/watch?v=rfscVS0vtbw
This course will give you a full introduction into all of the core concepts in python.
ICML+
https://icml.cc/
NeurIPS+
https://neurips.cc/
相关职位
实习J1016
1、负责直播连麦音频SDK相关功能的开发和问题调试; 2、负责音频算法在iOS和Android端的集成和问题调试; 3、负责持续优化音频SDK性能和用户体验。
更新于 2025-03-07
实习J1001
1、研究多模态数据(如音频、视频、自然语言、用户交互)的分析、处理和生成算法,实现多种模态间的融合、转化和交互; 2、探索新颖的多模态交互与生成的应用场景,推进多模态信息处理在不同业务场景下的落地。
更新于 2025-05-08
实习gamePlan
1、游戏世界的“创世者”,负责游戏内人设、怪物设定,与其他同事合作搭建令人感动投入的虚拟世界; 2、游戏世界的“绝对编剧”,负责主线、支线故事剧本的写作与实现,讲述游戏角色的生死喜乐; 3、协同美术音频等其他职能同学完善游戏世界,铺设世界细节。
更新于 2025-06-19