滴滴资深语音算法工程师(J250529004)

社招全职3-5年技术2025-10-28地点：北京状态：招聘

扫码手机上打开

任职要求

1、电子、计算机或相关声学、信号处理专业毕业，具备一定语音信号处理基础
2、熟悉Pytorch框架，良好的编程能力，熟练使用python编程语言，具备Linux平台开发经验
3、3-5年语音识别、音频事件检测、声纹识别、语音合成等算法经验
4、在ICASSP、Interspeech、ASRU等语音顶会或国际竞赛有论文发表或优异成绩优先
5、ACM竞赛取得优异成绩或有优秀C++编程能力优先
6、有大型语音识别、语音合成项目经验者优先
Qualifications
1.	Bachelor’s or higher degree in electronics, computer science, acoustics, signal processing, or related fields, with a solid foundation in speech signal processing.
2.	Proficient in PyTorch framework, strong programming skills, fluency in Python, and experience in Linux platform de…

登录查看完整任职要求

微信扫码，1秒登录

工作职责

1、负责语音理解和语音生成算法在滴滴场景的落地使用
2、跟进最新技术，结合业务场景，提升语音识别、音频事件检测、声纹识别、语音合成等算法效果
3、探索语音大模型或多模态大模型在语音理解及语音生成场景的应用范式
4、算法优化，从模型架构、推理框架、量化压缩等角度提升模型推理速度、降低推理成本

Job Description
1.	Responsible for the implementation of speech understanding and speech generation algorithms in Didi’s business scenarios.
2.	Stay updated with the latest technologies and improve the performance of algorithms such as speech recognition, audio event detection, speaker recognition in real-world applications.
3.	Explore the application paradigms of large language models or multimodal models in speech understanding and generation scenarios.
4.	Optimize algorithms by enhancing inference speed and reducing costs through improvements in frameworks and quantization

📮 投递简历 ✨AI模拟面试

难度：

包括英文材料

PyTorch+