腾讯混元AGI模型架构研究员
社招全职3年以上公共技术地点:深圳状态:招聘
任职要求
1.精通 Transformer 类模型及其在语言、多模态领域的架构设计与优化; 2.有构建或优化超大规模模型(>Billion-scale)经验,熟悉SFT、RLHF、自监督等训练范式; 3.在以下方向有深入理解或实践经验者优先:; 4.a、多模态模型(如视觉语言模型、音视频模型); 5.b、强…
登录查看完整任职要求
微信扫码,1秒登录
工作职责
1.设计具备多模态联合感知、推理、记忆与生成能力的统一大模型架构(视觉/音频/文本); 2.构建支持持续学习、多级记忆、主动探索和自演进的大模型系统; 3.推进 agent化方向,使模型具备自主任务规划、跨模态交互、工具使用和自我优化能力; 4.深度参与通用表征、音视频同频建模、世界模型、稀疏建模等关键模块的设计与实现; 5.跟踪并研究前沿技术趋势,推动创新技术在项目中的应用。
包括英文材料
Transformer+
https://huggingface.co/learn/llm-course/en/chapter1/4
Breaking down how Large Language Models work, visualizing how data flows through.
https://poloclub.github.io/transformer-explainer/
An interactive visualization tool showing you how transformer models work in large language models (LLM) like GPT.
https://www.youtube.com/watch?v=wjZofJX0v4M
Breaking down how Large Language Models work, visualizing how data flows through.
系统设计+
https://roadmap.sh/system-design
Everything you need to know about designing large scale systems.
https://www.youtube.com/watch?v=F2FmTdLtb_4
This complete system design tutorial covers scalability, reliability, data handling, and high-level architecture with clear explanations, real-world examples, and practical strategies.
SFT+
https://cameronrwolfe.substack.com/p/understanding-and-using-supervised
Understanding how SFT works from the idea to a working implementation...
强化学习+
https://cloud.google.com/discover/what-is-reinforcement-learning?hl=en
Reinforcement learning (RL) is a type of machine learning where an "agent" learns optimal behavior through interaction with its environment.
https://huggingface.co/learn/deep-rl-course/unit0/introduction
This course will teach you about Deep Reinforcement Learning from beginner to expert. It’s completely free and open-source!
https://www.kaggle.com/learn/intro-to-game-ai-and-reinforcement-learning
Build your own video game bots, using classic and cutting-edge algorithms.
C+
https://www.freecodecamp.org/chinese/news/the-c-beginners-handbook/
本手册遵循二八定律。你将在 20% 的时间内学习 80% 的 C 编程语言。
https://www.youtube.com/watch?v=87SH2Cn0s9A
https://www.youtube.com/watch?v=KJgsSFOSQv0
This course will give you a full introduction into all of the core concepts in the C programming language.
https://www.youtube.com/watch?v=PaPN51Mm5qQ
In this complete C programming course, Dr. Charles Severance (aka Dr. Chuck) will help you understand computer architecture and low-level programming with the help of the classic C Programming language book written by Brian Kernighan and Dennis Ritchie.
大模型+
https://www.youtube.com/watch?v=xZDB1naRUlk
You will build projects with LLMs that will enable you to create dynamic interfaces, interact with vast amounts of text data, and even empower LLMs with the capability to browse the internet for research papers.
https://www.youtube.com/watch?v=zjkBMFhNj_g
还有更多 •••
相关职位
社招3年以上AI技术
1.负责大模型全量数据的向量化检索和向量化排重服务,实现实时向量化排重系统; 2.通过向量化检索性能的优化,提升文本排重的准确率和性能; 3.建立大模型数据清洗的反馈机制和反馈系统,提升全网数据抓取效率; 4.参与大模型的数据工程开发,为大模型提供基础数据。
更新于 2025-11-21深圳
社招3年以上TEG产品
1.通过对广泛业务中用户行为和反馈的研究,确定自研LLM的改进空间、优先级,以及相应的改进手段; 2.与业务团队合作,将混元模型能力整合到产品及服务中; 3.对齐数据采集和生产的方法,确保数据质量保持在高标准,并根据定量和定性反馈不断改进流程,有一到两个行业的专有数据经验优先。
更新于 2025-06-20深圳
社招3年以上AI技术
1.负责TTS、ASR、声学前处理、自然语言处理、多模态大模型等AI系统的工程开发(包括训练工具和推理引擎的开发、优化、交付等); 2.负责AI系统最新算法的集成、工程化、实际场景效果验证、优化、上线; 3.负责AI相关业务、产品的工程支持,在效果和性能上更好的落地。
更新于 2025-09-12深圳