
商汤语音算法工程师
社招全职算法研究地点:北京 | 深圳状态:招聘
任职要求
1.人工智能、机器学习、信号处理或计算机科学等相关专业研究生以上学历,基础扎实 2.熟悉主流的语音识别模型算法,如RNN-T、conformer、CTC 3.熟悉kaldi / K2 / wenet / espnet / whisper / FunASR 中至少两种工具 4.学习研究能力强,能够独立阅读英文文献,对解决具有挑战性的问题充满激情 5.具有扎实的机器学习理论基础,优秀的算法实现能力,熟练使用 PyTorch 等深度学习框架,掌握SSL、LLM、diffusion、对比学习等机器学习技术在音频生成领域的应用; 6.具有…
登录查看完整任职要求
微信扫码,1秒登录
工作职责
1. 负责多模态语音交互场景下的语音理解、语音生成和语音交互大模型的算法研发、性能优化与落地实现; 2.负责数字人场景下的个性化实时情感对话语音合成、低资源音色克隆、语音识别、语音增强、语音检测、语种识别、声纹识别、说话人分割、变声、音乐生成等技术研发; 3.负责语音相关算法引擎的流式改造、推理优化、大并发低延迟云服务、私有化服务定制开发; 4.跟进学术界、行业最新的研究趋势,产出新的科研成果,并落地于实际产品。
包括英文材料
机器学习+
https://www.youtube.com/watch?v=0oyDqO8PjIg
Learn about machine learning and AI with this comprehensive 11-hour course from @LunarTech_ai.
https://www.youtube.com/watch?v=i_LwzRVP7bg
Learn Machine Learning in a way that is accessible to absolute beginners.
https://www.youtube.com/watch?v=NWONeJKn6kc
Learn the theory and practical application of machine learning concepts in this comprehensive course for beginners.
https://www.youtube.com/watch?v=PcbuKRNtCUc
Learn about all the most important concepts and terms related to machine learning and AI.
学历+
语音识别+
https://developer.nvidia.com/blog/essential-guide-to-automatic-speech-recognition-technology/
Over the past decade, AI-powered speech recognition systems have slowly become part of our everyday lives, from voice search to virtual assistants in contact centers, cars, hospitals, and restaurants.
算法+
https://roadmap.sh/datastructures-and-algorithms
Step by step guide to learn Data Structures and Algorithms in 2025
https://www.hellointerview.com/learn/code
A visual guide to the most important patterns and approaches for the coding interview.
https://www.w3schools.com/dsa/
RNN+
https://d2l.ai/chapter_recurrent-neural-networks/rnn.html
A neural network that uses recurrent computation for hidden states is called a recurrent neural network (RNN).
https://www.deeplearningbook.org/contents/rnn.html
Recurrent neural networks, or RNNs (Rumelhart et al., 1986a), are a family of neural networks for processing sequential data.
https://www.ibm.com/think/topics/recurrent-neural-networks
A recurrent neural network or RNN is a deep neural network trained on sequential or time series data to create a machine learning (ML) model that can make sequential predictions or conclusions based on sequential inputs.
PyTorch+
https://datawhalechina.github.io/thorough-pytorch/
PyTorch是利用深度学习进行数据科学研究的重要工具,在灵活性、可读性和性能上都具备相当的优势,近年来已成为学术界实现深度学习算法最常用的框架。
https://www.youtube.com/watch?v=V_xro1bcAuA
Learn PyTorch for deep learning in this comprehensive course for beginners. PyTorch is a machine learning framework written in Python.
深度学习+
https://d2l.ai/
Interactive deep learning book with code, math, and discussions.
大模型+
https://www.youtube.com/watch?v=xZDB1naRUlk
You will build projects with LLMs that will enable you to create dynamic interfaces, interact with vast amounts of text data, and even empower LLMs with the capability to browse the internet for research papers.
https://www.youtube.com/watch?v=zjkBMFhNj_g
还有更多 •••
相关职位

社招
岗位职责 1. 负责语音合成、语音克隆、双工语音通话等语音生成相关技术的数据和模型开发,并协助业务落地; 2. 负责持续跟进业界前沿算法发展方向,支持公司在核心技术上的影响力发展。
更新于 2024-12-09北京
校招研发类
1、负责参与语音算法能力构建,包括不限于语音识别、声学模型、语言模型、热词技术、语音合成、音频鉴伪等; 2、负责语音领域算法压缩量化、推理加速、小型化部署; 3、跟踪语音算法领域的前沿技术规划,参与核心算法与系统方案在业务的落地。
更新于 2025-08-08北京