腾讯混元大模型语音算法工程师/专家
社招全职TEG技术地点:北京状态:招聘
任职要求
1.计算机科学、机器学习、人工智能、应用数学等相关专业,硕士及以上学历; 2.在语音信号处理、大语言模型、深度学习等领域具备扎实的研究基础,掌握领域内的最新技术进展; 3.较强的工程实现能力,熟练掌握C/C++, JAVA,Python等至少一种语言,熟练使用主流深度学习框架; 4.有较强的学术比赛经验、或者在重要数据集的Leaderboard上排名靠前、或在开源社区有较大影响力等优先; 5.有高质量论文发表者优先(如INTERSPEECH,ICASSP,CVPR,AAAI,NIPS,TIP,ICCV,ECCV等); 6.具备激情,好学,良好的团队合作和沟通能力。
工作职责
1.负责大模型语音模态的设计、开发和优化,包括但不限于语音/音频数据清洗、模型设计、训练策略等方面的研究与应用; 2.参与语音识别、语音合成、声音克隆等相关大模型语音模态能力的建设,提高跨模态整体效果。
包括英文材料
机器学习+
https://www.youtube.com/watch?v=0oyDqO8PjIg
Learn about machine learning and AI with this comprehensive 11-hour course from @LunarTech_ai.
https://www.youtube.com/watch?v=i_LwzRVP7bg
Learn Machine Learning in a way that is accessible to absolute beginners.
https://www.youtube.com/watch?v=NWONeJKn6kc
Learn the theory and practical application of machine learning concepts in this comprehensive course for beginners.
https://www.youtube.com/watch?v=PcbuKRNtCUc
Learn about all the most important concepts and terms related to machine learning and AI.
学历+
深度学习+
https://d2l.ai/
Interactive deep learning book with code, math, and discussions.
C+
https://www.freecodecamp.org/chinese/news/the-c-beginners-handbook/
本手册遵循二八定律。你将在 20% 的时间内学习 80% 的 C 编程语言。
https://www.youtube.com/watch?v=87SH2Cn0s9A
https://www.youtube.com/watch?v=KJgsSFOSQv0
This course will give you a full introduction into all of the core concepts in the C programming language.
https://www.youtube.com/watch?v=PaPN51Mm5qQ
In this complete C programming course, Dr. Charles Severance (aka Dr. Chuck) will help you understand computer architecture and low-level programming with the help of the classic C Programming language book written by Brian Kernighan and Dennis Ritchie.
C+++
https://www.learncpp.com/
LearnCpp.com is a free website devoted to teaching you how to program in modern C++.
https://www.youtube.com/watch?v=ZzaPdXTrSb8
Java+
https://www.youtube.com/watch?v=eIrMbAQSU34
Master Java – a must-have language for software development, Android apps, and more! ☕️ This beginner-friendly course takes you from basics to real coding skills.
Python+
https://liaoxuefeng.com/books/python/introduction/index.html
中文,免费,零起点,完整示例,基于最新的Python 3版本。
https://www.learnpython.org/
a free interactive Python tutorial for people who want to learn Python, fast.
https://www.youtube.com/watch?v=K5KVEU3aaeQ
Master Python from scratch 🚀 No fluff—just clear, practical coding skills to kickstart your journey!
https://www.youtube.com/watch?v=rfscVS0vtbw
This course will give you a full introduction into all of the core concepts in python.
CVPR+
https://cvpr.thecvf.com/
ICCV+
https://iccv.thecvf.com/
ICCV is the premier international computer vision event comprising the main conference and several co-located workshops and tutorials.
ECCV+
https://eccv.ecva.net/
ECCV is the official event under the European Computer Vision Association and is biannual on even numbered years.
相关职位
社招TEG技术
1.多模态驱动引擎开发,通过对文本/语音/视觉等信息,构建虚拟人表情、动作的驱动大模型; 2.设计多模态条件生成框架,实现语音、表情、镜头、肢体动作的联合优化; 3.开发多模态特征同步技术:语音-表情时序对齐、文本语义-镜头运动关联建模。
更新于 2025-05-30
社招2年以上混元助手-其他技
1.跟踪业界最新的语音生成算法研究,探索下一代语音、音频生成新范式,拓展语音生成边界能力; 2.探索多模态语音大模型的前沿技术,结合文本、语音、视觉等技术提升语音交互体验; 3.负责语音大模型的技术研发工作,推动模型性能提升与创新应用。
更新于 2025-10-16
社招3年以上TEG技术
1.负责混元大模型相关研发工作,包括文本创作、文本理解、数学、翻译、Agent FunctionCalls等专项; 2.负责混元在公司内相关业务场景落地,根据业务需求优化混元模型,提升业务效果; 3.负责跟踪和探索大语言模型的前沿问题,结合实际场景,提供全面的技术解决方案,参与前沿算法与应用的研究。
更新于 2025-06-19