虎牙语音大模型算法兼职实习生
社招全职MJ003945地点:广州状态:招聘
任职要求
1、本科及以上学历,AI、EE、CS等相关专业,研究方向为语音合成 / 语音大模型、自然语言处理或多模态等相关领域。 2、熟悉机器学习和深度学习理论,掌握生成模型、多模态大模型等一个或多个方向的理论与算法,具备相关方案实现能力与经验。 3、熟悉端到端语音大模型结构(如VocalNet、SLAM-Omni、GLM-Voice等)。 4、熟悉常见语音合成大模型框架(如CosyVoice、F5-TTS、Index-tts、…
登录查看完整任职要求
微信扫码,1秒登录
工作职责
【】 1、负责语音合成系统/语音端到端大模型、全链路算法的技术预研和研发工作。 2、负责大模型的数据积累、框架建设等基建工作。 3、跟踪业界前沿技术,持续探索语音合成、端到端技术的新能力和新应用,提升核心能力。
包括英文材料
学历+
大模型+
https://www.youtube.com/watch?v=xZDB1naRUlk
You will build projects with LLMs that will enable you to create dynamic interfaces, interact with vast amounts of text data, and even empower LLMs with the capability to browse the internet for research papers.
https://www.youtube.com/watch?v=zjkBMFhNj_g
NLP+
https://www.youtube.com/watch?v=fNxaJsNG3-s&list=PLQY2H8rRoyvzDbLUZkbudP-MFQZwNmU4S
Welcome to Zero to Hero for Natural Language Processing using TensorFlow!
https://www.youtube.com/watch?v=R-AG4-qZs1A&list=PLeo1K3hjS3uuvuAXhYjV2lMEShq2UYSwX
Natural Language Processing tutorial for beginners series in Python.
https://www.youtube.com/watch?v=rmVRLeJRkl4&list=PLoROMvodv4rMFqRtEuo6SGjY4XbRIVRd4
The foundations of the effective modern methods for deep learning applied to NLP.
机器学习+
https://www.youtube.com/watch?v=0oyDqO8PjIg
Learn about machine learning and AI with this comprehensive 11-hour course from @LunarTech_ai.
https://www.youtube.com/watch?v=i_LwzRVP7bg
Learn Machine Learning in a way that is accessible to absolute beginners.
https://www.youtube.com/watch?v=NWONeJKn6kc
Learn the theory and practical application of machine learning concepts in this comprehensive course for beginners.
https://www.youtube.com/watch?v=PcbuKRNtCUc
Learn about all the most important concepts and terms related to machine learning and AI.
深度学习+
https://d2l.ai/
Interactive deep learning book with code, math, and discussions.
算法+
https://roadmap.sh/datastructures-and-algorithms
Step by step guide to learn Data Structures and Algorithms in 2025
https://www.hellointerview.com/learn/code
A visual guide to the most important patterns and approaches for the coding interview.
https://www.w3schools.com/dsa/
SLAM+
https://docs.mrpt.org/reference/latest/tutorial-slam-for-beginners-the-basics.html
[英文] SLAM for Dummies
https://dspace.mit.edu/bitstream/handle/1721.1/119149/16-412j-spring-2005/contents/projects/1aslam_blas_repo.pdf
A Tutorial Approach to Simultaneous Localization and Mapping
https://ouster.com/insights/blog/introduction-to-slam-simultaneous-localization-and-mapping
SLAM is an essential piece in robotics that helps robots to estimate their pose – the position and orientation – on the map while creating the map of the environment to carry out autonomous activities.
[英文] What Is SLAM?
https://www.mathworks.com/discovery/slam.html
How it works, types of SLAM algorithms, and getting started
还有更多 •••
相关职位
社招技术类
1、负责语音大模型的迭代与优化,涵盖语音识别、语音翻译、语音合成、音色克隆、智能语音对话、音乐生成等通用模型或垂直领域模型的技术升级; 2、跟踪前沿技术动态,开展深入研究,并撰写和发表相关领域高水平学术论文; 3、优化强化学习在语音大模型场景中的应用,推动多模态技术的深度融合; 4、深入研究端到端语音实时交互技术,解决跨语言理解、翻译与合成的关键问题,优化语音输入到多模态输出的全链路效果。
更新于 2025-02-05上海
实习核心本地商业-基
你将做什么: 1. 从事情语音大模型方向的前沿技术探索,包括但不限于语音交互大模型、omni 大模型、ASR、TTS、音频理解、音乐合成、音频多模态等方向。 2. 调研前沿工作,跟踪业界相关进展。 3. 算法研发和模型训练,包括但不限于代码编写、数据处理。
更新于 2025-07-21北京|上海