腾讯语音合成算法工程师
社招全职CSIG技术地点:深圳状态:招聘
任职要求
1.具有语音合成工作经验,语音、信号、计算机等相关专业背景或者项目经验 ; 2.深刻理解TTS原理,理解相关技术,参与过前沿的语音合成技术算法研发和实际项目落地; 3.熟练使用python,至少精通pytorch或者tensorflow其中之一的训练框架; 4.熟悉使用C/C++语音,具有较强的算法实现和工程优化能力; 5.具有较强的学习能力,能够快速掌握和应用新技术; 6.具备良好的团队合作精神和沟通能力; 7.有在顶级会议或期刊发表论文者优先; 8.熟悉大模型、zeroshot语音合成等技术的优先。
工作职责
1.负责语音合成的算法研发与落地,包括但不限于TTS前端、声学模型、声码器的算法实现和改进; 2.推动语音合成技术在产品中落地,针对业务场景做算法调优和效果提升; 3.追踪业界前沿的语音合成及相关技术,探索并储备新技术能力。
包括英文材料
算法+
https://roadmap.sh/datastructures-and-algorithms
Step by step guide to learn Data Structures and Algorithms in 2025
https://www.hellointerview.com/learn/code
A visual guide to the most important patterns and approaches for the coding interview.
https://www.w3schools.com/dsa/
Python+
https://liaoxuefeng.com/books/python/introduction/index.html
中文,免费,零起点,完整示例,基于最新的Python 3版本。
https://www.learnpython.org/
a free interactive Python tutorial for people who want to learn Python, fast.
https://www.youtube.com/watch?v=K5KVEU3aaeQ
Master Python from scratch 🚀 No fluff—just clear, practical coding skills to kickstart your journey!
https://www.youtube.com/watch?v=rfscVS0vtbw
This course will give you a full introduction into all of the core concepts in python.
PyTorch+
https://datawhalechina.github.io/thorough-pytorch/
PyTorch是利用深度学习进行数据科学研究的重要工具,在灵活性、可读性和性能上都具备相当的优势,近年来已成为学术界实现深度学习算法最常用的框架。
https://www.youtube.com/watch?v=V_xro1bcAuA
Learn PyTorch for deep learning in this comprehensive course for beginners. PyTorch is a machine learning framework written in Python.
TensorFlow+
https://www.youtube.com/watch?v=tpCFfeUEGs8
Ready to learn the fundamentals of TensorFlow and deep learning with Python? Well, you’ve come to the right place.
https://www.youtube.com/watch?v=ZUKz4125WNI
This part continues right where part one left off so get that Google Colab window open and get ready to write plenty more TensorFlow code.
C+
https://www.freecodecamp.org/chinese/news/the-c-beginners-handbook/
本手册遵循二八定律。你将在 20% 的时间内学习 80% 的 C 编程语言。
https://www.youtube.com/watch?v=87SH2Cn0s9A
https://www.youtube.com/watch?v=KJgsSFOSQv0
This course will give you a full introduction into all of the core concepts in the C programming language.
https://www.youtube.com/watch?v=PaPN51Mm5qQ
In this complete C programming course, Dr. Charles Severance (aka Dr. Chuck) will help you understand computer architecture and low-level programming with the help of the classic C Programming language book written by Brian Kernighan and Dennis Ritchie.
C+++
https://www.learncpp.com/
LearnCpp.com is a free website devoted to teaching you how to program in modern C++.
https://www.youtube.com/watch?v=ZzaPdXTrSb8
大模型+
https://www.youtube.com/watch?v=xZDB1naRUlk
You will build projects with LLMs that will enable you to create dynamic interfaces, interact with vast amounts of text data, and even empower LLMs with the capability to browse the internet for research papers.
https://www.youtube.com/watch?v=zjkBMFhNj_g
相关职位
社招IEG技术
1.探索游戏领域语音大模型的预训练、微调、RAG、评测等技术; 2.探索语音技术尤其是语音大模型技术在游戏场景中的应用,为游戏创作、运营、交互等各环节提供更智能化的能力; 3.优化现有线上算法,包括TTS、音乐生成、歌声合成等算法的研发工作; 4.跟踪探索语音信号处理其他前沿技术并探索应用落地。
更新于 2025-04-25
社招CSIG技术
1.负责语音合成的算法研发与落地,包括但不限于TTS前端、声学模型、声码器的算法实现和改进; 2.推动语音合成技术在产品中落地,针对业务场景做算法调优和效果提升; 3.追踪业界前沿的语音合成及相关技术,探索并储备新技术能力。
更新于 2025-04-24
实习技术类
1. 负责QQ音乐/长音频有声书中语音合成相关工作,应用场景包括QQ音乐中的播客解读、AI助手、AI伴听、AI互动聊天等场景,以及QQ音乐电台/懒人听书等长音频平台中AI有声书生产、声播AIGC制作工具等落地场景; 2. 负责最前沿的语音合成大模型的模型训练,算法优化,推理提速,业务上线等工作; 3. 负责音频理解大模型; 4. 负责全双工通信语音大模型的算法研究和实现。
更新于 2025-07-14