腾讯音乐语音合成算法工程师
实习兼职技术类地点:深圳状态:招聘
任职要求
1. 博士学历,计算机、信息、通信、信号类及相关专业; 2. 熟悉语音合成相关技术,理解Diffusion,Transformer,LLM原理,对于VITS,VALLE,cosyvoice等语音合成模型熟悉; 3. 具备良好的音频理论和信号理论基础,具备机器学习理论基础; 4. 熟悉linux系统,擅长python编程,代码风格严谨; 5. 良好的中英文文献阅读能力,能够快速使用开源框架; 6. 沟通良好,对技术有热情,勤奋学习,积极向上。
工作职责
1. 负责QQ音乐/长音频有声书中语音合成相关工作,应用场景包括QQ音乐中的播客解读、AI助手、AI伴听、AI互动聊天等场景,以及QQ音乐电台/懒人听书等长音频平台中AI有声书生产、声播AIGC制作工具等落地场景; 2. 负责最前沿的语音合成大模型的模型训练,算法优化,推理提速,业务上线等工作; 3. 负责音频理解大模型; 4. 负责全双工通信语音大模型的算法研究和实现。
包括英文材料
学历+
Transformer+
https://huggingface.co/learn/llm-course/en/chapter1/4
Breaking down how Large Language Models work, visualizing how data flows through.
https://poloclub.github.io/transformer-explainer/
An interactive visualization tool showing you how transformer models work in large language models (LLM) like GPT.
https://www.youtube.com/watch?v=wjZofJX0v4M
Breaking down how Large Language Models work, visualizing how data flows through.
大模型+
https://www.youtube.com/watch?v=xZDB1naRUlk
You will build projects with LLMs that will enable you to create dynamic interfaces, interact with vast amounts of text data, and even empower LLMs with the capability to browse the internet for research papers.
https://www.youtube.com/watch?v=zjkBMFhNj_g
机器学习+
https://www.youtube.com/watch?v=0oyDqO8PjIg
Learn about machine learning and AI with this comprehensive 11-hour course from @LunarTech_ai.
https://www.youtube.com/watch?v=i_LwzRVP7bg
Learn Machine Learning in a way that is accessible to absolute beginners.
https://www.youtube.com/watch?v=NWONeJKn6kc
Learn the theory and practical application of machine learning concepts in this comprehensive course for beginners.
https://www.youtube.com/watch?v=PcbuKRNtCUc
Learn about all the most important concepts and terms related to machine learning and AI.
Linux+
https://ryanstutorials.net/linuxtutorial/
Ok, so you want to learn how to use the Bash command line interface (terminal) on Unix/Linux.
https://ubuntu.com/tutorials/command-line-for-beginners
The Linux command line is a text interface to your computer.
https://www.youtube.com/watch?v=6WatcfENsOU
In this Linux crash course, you will learn the fundamental skills and tools you need to become a proficient Linux system administrator.
https://www.youtube.com/watch?v=v392lEyM29A
Never fear the command line again, make it fear you.
https://www.youtube.com/watch?v=ZtqBQ68cfJc
Python+
https://liaoxuefeng.com/books/python/introduction/index.html
中文,免费,零起点,完整示例,基于最新的Python 3版本。
https://www.learnpython.org/
a free interactive Python tutorial for people who want to learn Python, fast.
https://www.youtube.com/watch?v=K5KVEU3aaeQ
Master Python from scratch 🚀 No fluff—just clear, practical coding skills to kickstart your journey!
https://www.youtube.com/watch?v=rfscVS0vtbw
This course will give you a full introduction into all of the core concepts in python.
相关职位
社招IEG技术
1.探索游戏领域语音大模型的预训练、微调、RAG、评测等技术; 2.探索语音技术尤其是语音大模型技术在游戏场景中的应用,为游戏创作、运营、交互等各环节提供更智能化的能力; 3.优化现有线上算法,包括TTS、音乐生成、歌声合成等算法的研发工作; 4.跟踪探索语音信号处理其他前沿技术并探索应用落地。
更新于 2025-04-25
社招CSIG技术
1.负责语音合成的算法研发与落地,包括但不限于TTS前端、声学模型、声码器的算法实现和改进; 2.推动语音合成技术在产品中落地,针对业务场景做算法调优和效果提升; 3.追踪业界前沿的语音合成及相关技术,探索并储备新技术能力。
更新于 2025-04-24
社招CSIG技术
1.负责语音合成的算法研发与落地,包括但不限于TTS前端、声学模型、声码器的算法实现和改进; 2.推动语音合成技术在产品中落地,针对业务场景做算法调优和效果提升; 3.追踪业界前沿的语音合成及相关技术,探索并储备新技术能力。
更新于 2025-04-24