腾讯混元大语言模型后训练算法工程师-垂域方向(北京/深圳/上海)
社招全职1年以上混元-模型算法技术地点:北京状态:招聘
任职要求
1.背景要求:计算机、数学、统计学、法学、金融、医学信息等相关专业硕士及以上学历,2年以上NLP、搜索、知识工程或智能体相关经验,有大模型落地经验者优先; 2.算法能力:深入理解 Transformer 和 LLM 训练流程,熟悉 RAG 技术栈,对检索、上下文处理、多文档理解等有较好基础; 3.Agent 能力:熟悉推理类模型、Agent 框架及其在复杂专业任务中的应用,能够将搜索、数据分析、表格处理、文档处理、报告生成等能力组合成完整工作流; 4.技术功底:熟练掌握 PyTorch 等深度学习框…
登录查看完整任职要求
微信扫码,1秒登录
工作职责
1.专业领域智能体架构升级:负责面向金融、法务、医疗等专业领域的大模型智能体体系建设,利用 Agentic RAG 架构实现从基础检索问答到深度分析、专业研判与复杂任务执行的演进; 2.深度推理与规划:针对复杂专业任务指令,设计并实现基于推理类模型的任务拆解、多步规划(Planning)与流程编排策略,提升系统处理专业分析、跨文档归纳、决策支持等复杂问题的能力; 3.事实核查与结果可靠性:建立可靠的事实核查(Fact-checking)与结果校验机制,通过证据溯源、多源信息比对、结构化校验等手段,解决大模型在专业场景中的幻觉与失真问题,确保生成结果的真实性、严谨性与可追溯性; 4.前沿技术转化:探索推理类模型在专业领域 Agent 的落地,包括基于过程监督的思维链(CoT)优化、面向专业任务反馈的强化学习(RL)策略,以及搜索、数据分析、文件处理等多工具协同能力的持续演进。
包括英文材料
学历+
NLP+
https://www.youtube.com/watch?v=fNxaJsNG3-s&list=PLQY2H8rRoyvzDbLUZkbudP-MFQZwNmU4S
Welcome to Zero to Hero for Natural Language Processing using TensorFlow!
https://www.youtube.com/watch?v=R-AG4-qZs1A&list=PLeo1K3hjS3uuvuAXhYjV2lMEShq2UYSwX
Natural Language Processing tutorial for beginners series in Python.
https://www.youtube.com/watch?v=rmVRLeJRkl4&list=PLoROMvodv4rMFqRtEuo6SGjY4XbRIVRd4
The foundations of the effective modern methods for deep learning applied to NLP.
智能体+
https://learn.microsoft.com/en-us/shows/ai-agents-for-beginners/
In this 10-lesson course we take you from concept to code while covering the fundamentals of building AI agents.
https://www.ibm.com/think/ai-agents
Your one-stop resource for gaining in-depth knowledge and hands-on applications of AI agents.
大模型+
https://www.youtube.com/watch?v=xZDB1naRUlk
You will build projects with LLMs that will enable you to create dynamic interfaces, interact with vast amounts of text data, and even empower LLMs with the capability to browse the internet for research papers.
https://www.youtube.com/watch?v=zjkBMFhNj_g
算法+
https://roadmap.sh/datastructures-and-algorithms
Step by step guide to learn Data Structures and Algorithms in 2025
https://www.hellointerview.com/learn/code
A visual guide to the most important patterns and approaches for the coding interview.
https://www.w3schools.com/dsa/
Transformer+
https://huggingface.co/learn/llm-course/en/chapter1/4
Breaking down how Large Language Models work, visualizing how data flows through.
https://poloclub.github.io/transformer-explainer/
An interactive visualization tool showing you how transformer models work in large language models (LLM) like GPT.
https://www.youtube.com/watch?v=wjZofJX0v4M
Breaking down how Large Language Models work, visualizing how data flows through.
还有更多 •••