蚂蚁金服蚂蚁集团-语音交互系统专家/高级专家-杭州
社招全职5年以上技术-多媒体技术地点:杭州状态:招聘
任职要求
1、计算机、电子信息、自动化、声学或相关专业,统招本科及以上学历(硕士/博士优先); 2、具备扎实的音频信号处理或机器学习基础,有智能硬件/IoT设备等场景的音频算法落地与性能优化经验; 3、深入理解语音识别或音频理解系统,对基于大语言模型(LLM)的端到端语音识别与理解有深入研究和实践优化经验; 4、编程功底扎实,精通 Pytho…
登录查看完整任职要求
微信扫码,1秒登录
工作职责
1、负责全双工语音交互系统的核心算法研发,包括但不限于3A算法、流式语音端点检测(VAD)、说话人分离(Diarization)、话语权决策/轮次检测(Turn-taking)等模块的设计与优化; 2、探索并落地统一的音频理解前端,实现复杂场景下的多人对话理解与交互,解决复杂场景下的端到端语音识别与对话理解问题; 3、跟踪业界前沿的语音/音频技术发展(如最新深度学习架构在音频领域的应用),持续提升现有算法系统的性能与鲁棒性,解决实际业务中的长尾问题
包括英文材料
学历+
机器学习+
https://www.youtube.com/watch?v=0oyDqO8PjIg
Learn about machine learning and AI with this comprehensive 11-hour course from @LunarTech_ai.
https://www.youtube.com/watch?v=i_LwzRVP7bg
Learn Machine Learning in a way that is accessible to absolute beginners.
https://www.youtube.com/watch?v=NWONeJKn6kc
Learn the theory and practical application of machine learning concepts in this comprehensive course for beginners.
https://www.youtube.com/watch?v=PcbuKRNtCUc
Learn about all the most important concepts and terms related to machine learning and AI.
IOT+
https://microsoft.github.io/IoT-For-Beginners/#/
Azure Cloud Advocates at Microsoft are pleased to offer a 12-week, 24-lesson curriculum all about IoT basics.
https://www.ibm.com/think/topics/internet-of-things
The Internet of Things (IoT) refers to a network of physical devices, vehicles, appliances, and other physical objects that are embedded with sensors, software, and network connectivity, allowing them to collect and share data.
https://www.youtube.com/watch?v=1KVrBjSqS5s
The term 'Internet of Things' was coined by Kevin Ashton in 1999 to refer to connecting the Internet to the physical world via sensors.
算法+
https://roadmap.sh/datastructures-and-algorithms
Step by step guide to learn Data Structures and Algorithms in 2025
https://www.hellointerview.com/learn/code
A visual guide to the most important patterns and approaches for the coding interview.
https://www.w3schools.com/dsa/
语音识别+
https://developer.nvidia.com/blog/essential-guide-to-automatic-speech-recognition-technology/
Over the past decade, AI-powered speech recognition systems have slowly become part of our everyday lives, from voice search to virtual assistants in contact centers, cars, hospitals, and restaurants.
大模型+
https://www.youtube.com/watch?v=xZDB1naRUlk
You will build projects with LLMs that will enable you to create dynamic interfaces, interact with vast amounts of text data, and even empower LLMs with the capability to browse the internet for research papers.
https://www.youtube.com/watch?v=zjkBMFhNj_g
还有更多 •••