网易语音交互算法工程师(精英实习)
实习兼职人工智能地点:杭州状态:招聘
任职要求
1. 计算机/语音信号处理或相关专业,应届硕士或博士; 2. 扎实的机器学习/深度学习算法基础,熟悉常见的生成式模型的基本原理和调优; 3. 优秀的编程能力和良好的编码习惯; 4. 熟练使用至少一种主流深度学习框架(PyTorch); 5. 有以下至少一个方向的项目经验: - 语音生成大模型(CosyVoice 2/F5 TTS等); - 语音识别与理解(FunASR/Whisper/Qwen Audio等); - 其他生成式模型相关项目(GAN/VAE/LLM/Diffusion); 6. 加分项 - 有语音顶会(ICASSP/Interspeech)或更高级别会议论文; - 有高Star开源项目; - 热爱游戏;
工作职责
1. 深度参与雷火各旗舰游戏,实时语音交互、语音内容生产、语音创新玩法等场景研发和落地,为玩家创造崭新的互动娱乐体验; 2. 跟踪语音前沿技术,将最新的语音生成大模型、端到端语音大模型等先进技术落地至业务中; 3. 参与语音算法方案的整个生命周期,包括方案设计、算法实现、数据工程、线上服务等全流程。
包括英文材料
机器学习+
https://www.youtube.com/watch?v=0oyDqO8PjIg
Learn about machine learning and AI with this comprehensive 11-hour course from @LunarTech_ai.
https://www.youtube.com/watch?v=i_LwzRVP7bg
Learn Machine Learning in a way that is accessible to absolute beginners.
https://www.youtube.com/watch?v=NWONeJKn6kc
Learn the theory and practical application of machine learning concepts in this comprehensive course for beginners.
https://www.youtube.com/watch?v=PcbuKRNtCUc
Learn about all the most important concepts and terms related to machine learning and AI.
深度学习+
https://d2l.ai/
Interactive deep learning book with code, math, and discussions.
算法+
https://roadmap.sh/datastructures-and-algorithms
Step by step guide to learn Data Structures and Algorithms in 2025
https://www.hellointerview.com/learn/code
A visual guide to the most important patterns and approaches for the coding interview.
https://www.w3schools.com/dsa/
编程规范+
[英文] Google Style Guides
https://google.github.io/styleguide/
Every major open-source project has its own style guide: a set of conventions (sometimes arbitrary) about how to write code for that project. It is much easier to understand a large codebase when all the code in it is in a consistent style.
PyTorch+
https://datawhalechina.github.io/thorough-pytorch/
PyTorch是利用深度学习进行数据科学研究的重要工具,在灵活性、可读性和性能上都具备相当的优势,近年来已成为学术界实现深度学习算法最常用的框架。
https://www.youtube.com/watch?v=V_xro1bcAuA
Learn PyTorch for deep learning in this comprehensive course for beginners. PyTorch is a machine learning framework written in Python.
大模型+
https://www.youtube.com/watch?v=xZDB1naRUlk
You will build projects with LLMs that will enable you to create dynamic interfaces, interact with vast amounts of text data, and even empower LLMs with the capability to browse the internet for research papers.
https://www.youtube.com/watch?v=zjkBMFhNj_g
语音识别+
https://www.youtube.com/watch?v=mYUyaKmvu6Y
Learn how to implement speech recognition in Python by building five projects.
https://www.youtube.com/watch?v=sR6_bZ6VkAg
How Rev.com harnesses human-in-the-loop and deep learning to build the world's best English speech recognition engine
相关职位
校招人工智能
1. 深度参与雷火各旗舰游戏,实时语音交互、语音内容生产、语音创新玩法等场景研发和落地,为玩家创造崭新的互动娱乐体验; 2. 跟踪语音前沿技术,将最新的语音生成大模型、端到端语音大模型等先进技术落地至业务中; 3. 参与语音算法方案的整个生命周期,包括方案设计、算法实现、数据工程、线上服务等全流程。
社招3-5年网易伏羲
1、深度参与雷火各旗舰游戏,实时语音交互、语音内容生产、语音创新玩法等场景研发和落地,为玩家创造崭新的互动娱乐体验; 2、跟踪语音前沿技术,将最新的语音生成大模型、端到端语音大模型等先进技术落地至业务中; 3、参与语音算法方案的整个生命周期,包括方案设计、算法实现、数据工程、线上服务等全流程;
更新于 2025-09-01
社招A29448
1、支持语音交互技术在字节跳动公司内外丰富的业务场景落地,解决落地过程中的前沿问题,持续优化在智能硬件中的音频理解及处理,以及语音助手核心技术效果; 2、专注端侧智能交互的前沿技术和算法效果,追求和探索业界最前沿算法; 3、负责字节跳动旗下音频内容创作和消费业务场景的智能移频理解和处理算法研发和业务支持; 4、跟踪智能音频领域的最新技术进展并升级团队自研的各算法系统,包括回声消除、AI降噪、多通道音频处理、音频事件理解与检测; 5、跟踪研发业界先进的音频进展,统计模型/机器学习/深度学习技术在语音/音频领域研发并落地产品。
更新于 2025-03-24