腾讯元宝-语音大模型应用算法工程师需求
社招全职2年以上元宝技术地点:深圳状态:招聘
任职要求
1.精通语音大模型核心技术,如tokenizer,LLM,vocoder等,熟悉主流模型结构和原理; 2.熟悉语音数据生产流程;熟悉语音/文本数据的采集,标注和处理流程,精通Prompt Engineering技术; 3.精通机器学习模型的评测技术,熟悉机器学习模型常用评测指标和方法,具备设计和实施复杂评测方案的能力; 4.精通大模型的post-training,如SFT和RL,熟悉至少一种大模型框架,如deepspeed,fsdp,megatron等; 5.了解reasoning等技术; 6.沟通协作能力好,自驱力和分析力强; 7.有闲聊、情感陪伴实际场景应用经验者加分。 加分项 1.有对话模型的开发经验者优先; 2.有教育、心理学相关背景知识或项目经验者优先。
工作职责
1.负责语音大模型post-training (SFT和RL),针对业务需求进行优化,提升模型的特定能力(如共情能力、知识准确性); 2.负责后训练数据挖掘,分析,清洗和构建,建立数据驱动优化闭环,持续提升模型能力; 3.负责业务侧相关评估方法的开发,研发能够反映产品真实体感的评测体系标准与自动化评测技术,指导后训练优化方向; 4.探索多模态大模型的前沿技术,如端到端语音对话,情感交互等,并落地到业务产品。
包括英文材料
大模型+
https://www.youtube.com/watch?v=xZDB1naRUlk
You will build projects with LLMs that will enable you to create dynamic interfaces, interact with vast amounts of text data, and even empower LLMs with the capability to browse the internet for research papers.
https://www.youtube.com/watch?v=zjkBMFhNj_g
Prompt+
https://cloud.google.com/vertex-ai/generative-ai/docs/learn/prompts/introduction-prompt-design
A prompt is a natural language request submitted to a language model to receive a response back.
https://learn.microsoft.com/en-us/azure/ai-foundry/openai/concepts/prompt-engineering
These techniques aren't recommended for reasoning models like gpt-5 and o-series models.
https://www.youtube.com/watch?v=LWiMwhDZ9as
Learn and master the fundamentals of Prompt Engineering and LLMs with this 5-HOUR Prompt Engineering Crash Course!
机器学习+
https://www.youtube.com/watch?v=0oyDqO8PjIg
Learn about machine learning and AI with this comprehensive 11-hour course from @LunarTech_ai.
https://www.youtube.com/watch?v=i_LwzRVP7bg
Learn Machine Learning in a way that is accessible to absolute beginners.
https://www.youtube.com/watch?v=NWONeJKn6kc
Learn the theory and practical application of machine learning concepts in this comprehensive course for beginners.
https://www.youtube.com/watch?v=PcbuKRNtCUc
Learn about all the most important concepts and terms related to machine learning and AI.
SFT+
https://cameronrwolfe.substack.com/p/understanding-and-using-supervised
Understanding how SFT works from the idea to a working implementation...
DeepSpeed+
https://www.youtube.com/watch?v=pDGI668pNg0
FSDP+
https://docs.pytorch.org/tutorials/intermediate/FSDP_tutorial.html
In DistributedDataParallel (DDP) training, each rank owns a model replica and processes a batch of data, finally it uses all-reduce to sync gradients across ranks.
https://www.youtube.com/watch?v=PjEwLgyzuzQ
FSDP provides a comprehensive framework for large model training in PyTorch.
Megatron+
https://www.youtube.com/watch?v=hc0u4avAkuM
相关职位
社招3年以上元宝技术
1.优化大规模商用语音识别系统,提高系统的鲁棒性和性能; 2.负责声学前端、声学模型、语言模型、后处理、解码器等主要模块的迭代和改进; 3.追踪业界前沿的语音技术,探索语音大模型在业务场景下的应用。
更新于 2025-08-02
社招3年以上元宝技术
1.负责分析大语言模型产品的安全风险,探索模型脆弱性并针对具体产品功能提出防御方案; 2.研究和应用NLP、机器学习等技术,采用不同安全识别算法解决不同场景的业务风险,降低有害内容生成概率; 3.跟踪安全大模型领域的前沿技术与行业动态,持续创新安全应用解决方案,保持产品的安全竞争力。
更新于 2025-08-02
社招元宝技术
1.负责大模型训练和推理系统的研发与性能优化,包括但不限于:模型计算性能优化、分布式大模型推理系统、大规模推理流量调度等; 2.负责解决系统高并发、高可靠性、高可扩展性等技术难关; 3.负责大模型训练和推理前瞻性技术架构的调研和引入,技术方案不限于子图匹配、编译优化、模型量化、本地及mooncake分布式kv store等; 4.与算法部门深度合作,进行算法与系统的联合优化。
更新于 2025-06-19