
第四范式语音算法工程师
社招全职技术类地点:北京状态:招聘
任职要求
•本科及以上学历,具备语音识别、声纹识别、语音评测、语音合成等方向相关经历•具备良好的编程能力,熟练掌握python/C++等编程语言,优秀的分析问题和解决问题的能力,对解决具有挑战性的问题充满激情•较强的算法实现能力,熟悉深度学习平台如tensorflow/pytorch等加分项•有较强的代码能力优先,有各类竞赛获奖经历(如kaggle,天池、DF、DC等比赛平台)、有过ACM等编程竞赛经历,或代码开源在github上并有较大影响•在Interspeech/ICASSP/ACL/EMNLP/ NAACL等顶会顶级会议或者期刊发表论文者•在大模型多模态领域有相关技术经验或竞赛经验base地点:北京/上海/武汉/深圳均可
工作职责
第四范式是中国智能决策市场的最大参与者。公司致力于实现企业级人工智能快速规模化落地,为企业提供以“决策型AI”、“生成式AI”为核心的技术、产品及解决方案,推动传统企业的数字化转型进程。2023年2月发布自研的多模态大模型产品“式说(4Paradigm SageGPT)”,已积累了数家国内最早的AIGC产业应用。目前已上市,有机会争取股票激励。•负责语音方向的设计和研发,模型的效果优化,包括不限于:参与语音识别、语音合成、声纹识别、语音评测等方向•将语音领域的算法应用于实际场景,解决真实业务问题•将实践中的创新点以Github Repo/Paper/Tech Report等形式开源
包括英文材料
学历+
语音识别+
https://developer.nvidia.com/blog/essential-guide-to-automatic-speech-recognition-technology/
Over the past decade, AI-powered speech recognition systems have slowly become part of our everyday lives, from voice search to virtual assistants in contact centers, cars, hospitals, and restaurants.
语音合成+
https://www.ibm.com/think/topics/text-to-speech
Text to speech (TTS) is a type of technology that converts text on a digital interface into natural-sounding audio.
Python+
https://liaoxuefeng.com/books/python/introduction/index.html
中文,免费,零起点,完整示例,基于最新的Python 3版本。
https://www.learnpython.org/
a free interactive Python tutorial for people who want to learn Python, fast.
https://www.youtube.com/watch?v=K5KVEU3aaeQ
Master Python from scratch 🚀 No fluff—just clear, practical coding skills to kickstart your journey!
https://www.youtube.com/watch?v=rfscVS0vtbw
This course will give you a full introduction into all of the core concepts in python.
C+++
https://www.learncpp.com/
LearnCpp.com is a free website devoted to teaching you how to program in modern C++.
https://www.youtube.com/watch?v=ZzaPdXTrSb8
算法+
https://roadmap.sh/datastructures-and-algorithms
Step by step guide to learn Data Structures and Algorithms in 2025
https://www.hellointerview.com/learn/code
A visual guide to the most important patterns and approaches for the coding interview.
https://www.w3schools.com/dsa/
深度学习+
https://d2l.ai/
Interactive deep learning book with code, math, and discussions.
TensorFlow+
https://www.youtube.com/watch?v=tpCFfeUEGs8
Ready to learn the fundamentals of TensorFlow and deep learning with Python? Well, you’ve come to the right place.
https://www.youtube.com/watch?v=ZUKz4125WNI
This part continues right where part one left off so get that Google Colab window open and get ready to write plenty more TensorFlow code.
PyTorch+
https://datawhalechina.github.io/thorough-pytorch/
PyTorch是利用深度学习进行数据科学研究的重要工具,在灵活性、可读性和性能上都具备相当的优势,近年来已成为学术界实现深度学习算法最常用的框架。
https://www.youtube.com/watch?v=V_xro1bcAuA
Learn PyTorch for deep learning in this comprehensive course for beginners. PyTorch is a machine learning framework written in Python.
Kaggle+
[英文] Kaggle Learn
https://www.kaggle.com/learn
Gain the skills you need to do independent data science projects.
GitHub+
[英文] GitHub Learn
https://learn.github.com/
Discover a wide range of beginner-friendly tutorials, hands-on learning, and expert-led lessons.
ACL+
https://www.aclweb.org/portal/
Computational linguistics is the scientific study of language from a computational perspective.
EMNLP+
大模型+
https://www.youtube.com/watch?v=xZDB1naRUlk
You will build projects with LLMs that will enable you to create dynamic interfaces, interact with vast amounts of text data, and even empower LLMs with the capability to browse the internet for research papers.
https://www.youtube.com/watch?v=zjkBMFhNj_g
相关职位

社招
岗位职责 1. 负责语音合成、语音克隆、双工语音通话等语音生成相关技术的数据和模型开发,并协助业务落地; 2. 负责持续跟进业界前沿算法发展方向,支持公司在核心技术上的影响力发展。
更新于 2024-12-09
校招研发类
1、负责参与语音算法能力构建,包括不限于语音识别、声学模型、语言模型、热词技术、语音合成、音频鉴伪等; 2、负责语音领域算法压缩量化、推理加速、小型化部署; 3、跟踪语音算法领域的前沿技术规划,参与核心算法与系统方案在业务的落地。
更新于 2025-08-08