阿里巴巴业务技术-音频算法工程师-3A与智能语音交互方向
社招全职2年以上地点:杭州状态:招聘
任职要求
1. 计算机科学、人工智能、信号处理、声学等相关专业硕士及以上学历; 2. 精通3A算法原理与工程实现,熟悉语音识别(ASR)、语音合成(TTS)、语音对话与理解大模型等技术,有实际项目经验者优先; 3. 熟练掌握Python、C++等编程语言,熟悉TensorFlow、PyTorch等深度…
登录查看完整任职要求
微信扫码,1秒登录
工作职责
1. 负责3A核心算法的研发与迭代,包括回声消除(aec)、自动噪声抑制(ans)、自动增益控制(agc),确保算法在复杂声学环境下的鲁棒性,保障全双工通话的高保真音质与交互体验。 2. 推动3A算法的轻量化部署与端侧适配,结合深度学习与传统信号处理技术,平衡算法性能与资源消耗,支撑智能体语音交互技术基座建设。 3. 针对人机语音交互、智能体对话等前沿场景,优化VAD判停和语义完整性检查,实现“快速打断,智能判停,实时响应”的自然对话能力; 4. 研发语音识别(ASR)、语音合成(TTS)技术,并探索交互式语音模型在智能体对话中的应用,搭建可控双流模型的数据,训练和评估体系,优化对话模型的边说边听和情感识别能力; 5. 与产品、工程团队紧密合作,将算法模型落地到实际应用中,并持续优化系统性能。 6. 跟踪学术界和工业界的最新进展,探索前沿技术在对话系统中的应用。
包括英文材料
学历+
算法+
https://roadmap.sh/datastructures-and-algorithms
Step by step guide to learn Data Structures and Algorithms in 2025
https://www.hellointerview.com/learn/code
A visual guide to the most important patterns and approaches for the coding interview.
https://www.w3schools.com/dsa/
语音识别+
https://developer.nvidia.com/blog/essential-guide-to-automatic-speech-recognition-technology/
Over the past decade, AI-powered speech recognition systems have slowly become part of our everyday lives, from voice search to virtual assistants in contact centers, cars, hospitals, and restaurants.
语音合成+
https://www.ibm.com/think/topics/text-to-speech
Text to speech (TTS) is a type of technology that converts text on a digital interface into natural-sounding audio.
大模型+
https://www.youtube.com/watch?v=xZDB1naRUlk
You will build projects with LLMs that will enable you to create dynamic interfaces, interact with vast amounts of text data, and even empower LLMs with the capability to browse the internet for research papers.
https://www.youtube.com/watch?v=zjkBMFhNj_g
Python+
https://liaoxuefeng.com/books/python/introduction/index.html
中文,免费,零起点,完整示例,基于最新的Python 3版本。
https://www.learnpython.org/
a free interactive Python tutorial for people who want to learn Python, fast.
https://www.youtube.com/watch?v=K5KVEU3aaeQ
Master Python from scratch 🚀 No fluff—just clear, practical coding skills to kickstart your journey!
https://www.youtube.com/watch?v=rfscVS0vtbw
This course will give you a full introduction into all of the core concepts in python.
C+++
https://www.learncpp.com/
LearnCpp.com is a free website devoted to teaching you how to program in modern C++.
https://www.youtube.com/watch?v=ZzaPdXTrSb8
TensorFlow+
https://www.youtube.com/watch?v=tpCFfeUEGs8
Ready to learn the fundamentals of TensorFlow and deep learning with Python? Well, you’ve come to the right place.
https://www.youtube.com/watch?v=ZUKz4125WNI
This part continues right where part one left off so get that Google Colab window open and get ready to write plenty more TensorFlow code.
还有更多 •••