蚂蚁金服蚂蚁集团-AI大模型高级算法专家-杭州【AI Force】
社招全职8年以上技术类-算法地点:杭州状态:招聘
任职要求
1. 应用数学、计算机、人工智能等相关专业硕士及以上学历,8年以上相关工作经验。 2. 在大模型、多模态、AI-Agent开发领域具备2年以上的独立负责项目的经验,有智能客服、智能催收、电销外呼等方向经验是加分项。 3. 精通主流大模型,精通SFT、RLHF、RAG,在角色扮演、指令跟随、推理加速等大模型技术,精通大模型推理加速。 4. 精通Python、Java等编程语言,熟悉TensorFlow、PyTorch等常用机器学习框架。 5. 具备敏锐的洞察力和探索精神,对挑战性问题、新技术充满好奇心,善于理论结合实践,主动学习并快速适应变化的业务环境。 加分项: 在AI顶级会议(如ACL,NeurIPS,ICML等)以第一作者发表过高质量论文(特别是大模型方向)
工作职责
1. 在金融信贷、营销、催收场景下,负责AI-Agent算法架构设计、核心Agent的研发工作。 2、负责大语言模型的后训练与高效学习,应用指令遵循、强化学习、持续学习等,优化对话机器人的响应质量和转化效率 3. 运用大模型、AI-Agent、多模态、声纹和传统机器学习等算法能力,解决生成式AI在金融领域的关键算法问题,深入研究并解决大模型后训练中的效率瓶颈与收敛性问题,提升模型的逻辑、推理、生成能力。 4. 紧跟大模型前沿技术的发展趋势,学习、探索,并落地于业务场景,加速算法迭代,全面提升业务效率。
包括英文材料
学历+
大模型+
https://www.youtube.com/watch?v=xZDB1naRUlk
You will build projects with LLMs that will enable you to create dynamic interfaces, interact with vast amounts of text data, and even empower LLMs with the capability to browse the internet for research papers.
https://www.youtube.com/watch?v=zjkBMFhNj_g
AI agent+
https://www.ibm.com/think/ai-agents
Your one-stop resource for gaining in-depth knowledge and hands-on applications of AI agents.
SFT+
https://cameronrwolfe.substack.com/p/understanding-and-using-supervised
Understanding how SFT works from the idea to a working implementation...
RAG+
https://www.youtube.com/watch?v=sVcwVQRHIc8
Learn how to implement RAG (Retrieval Augmented Generation) from scratch, straight from a LangChain software engineer.
Python+
https://liaoxuefeng.com/books/python/introduction/index.html
中文,免费,零起点,完整示例,基于最新的Python 3版本。
https://www.learnpython.org/
a free interactive Python tutorial for people who want to learn Python, fast.
https://www.youtube.com/watch?v=K5KVEU3aaeQ
Master Python from scratch 🚀 No fluff—just clear, practical coding skills to kickstart your journey!
https://www.youtube.com/watch?v=rfscVS0vtbw
This course will give you a full introduction into all of the core concepts in python.
Java+
https://www.youtube.com/watch?v=eIrMbAQSU34
Master Java – a must-have language for software development, Android apps, and more! ☕️ This beginner-friendly course takes you from basics to real coding skills.
TensorFlow+
https://www.youtube.com/watch?v=tpCFfeUEGs8
Ready to learn the fundamentals of TensorFlow and deep learning with Python? Well, you’ve come to the right place.
https://www.youtube.com/watch?v=ZUKz4125WNI
This part continues right where part one left off so get that Google Colab window open and get ready to write plenty more TensorFlow code.
PyTorch+
https://datawhalechina.github.io/thorough-pytorch/
PyTorch是利用深度学习进行数据科学研究的重要工具,在灵活性、可读性和性能上都具备相当的优势,近年来已成为学术界实现深度学习算法最常用的框架。
https://www.youtube.com/watch?v=V_xro1bcAuA
Learn PyTorch for deep learning in this comprehensive course for beginners. PyTorch is a machine learning framework written in Python.
机器学习+
https://www.youtube.com/watch?v=0oyDqO8PjIg
Learn about machine learning and AI with this comprehensive 11-hour course from @LunarTech_ai.
https://www.youtube.com/watch?v=i_LwzRVP7bg
Learn Machine Learning in a way that is accessible to absolute beginners.
https://www.youtube.com/watch?v=NWONeJKn6kc
Learn the theory and practical application of machine learning concepts in this comprehensive course for beginners.
https://www.youtube.com/watch?v=PcbuKRNtCUc
Learn about all the most important concepts and terms related to machine learning and AI.
NeurIPS+
https://neurips.cc/
ICML+
https://icml.cc/
ACL+
https://www.aclweb.org/portal/
Computational linguistics is the scientific study of language from a computational perspective.
RLHF+
[英文] What is RLHF?
https://aws.amazon.com/what-is/reinforcement-learning-from-human-feedback/
Reinforcement learning from human feedback (RLHF) is a machine learning (ML) technique that uses human feedback to optimize ML models to self-learn more efficiently.
https://www.ibm.com/think/topics/rlhf
Reinforcement learning from human feedback (RLHF) is a machine learning technique in which a “reward model” is trained with direct human feedback, then used to optimize the performance of an artificial intelligence agent through reinforcement learning.
相关职位
社招3年以上技术类-算法
负责训练端到端的语音或者多模态大模型,实现语音呼入智能客服的极致体验,不断提升机器占比降低人工占比: 1. 高质量训练数据构建,包括业务数据和通用数据; 2. 模型预训练、微调、后训练等; 3. 协调工程团队开发高效的语音智能客服系统; 4. 根据实际业务问题不断迭代系统提升指标; 5. 跟踪业界最新进展,结合业务进行创新,并沉淀为顶会论文。
更新于 2025-10-13
社招8年以上技术类-算法
1、行业算法攻坚 主导2B商业化场景客服agent行业级解决方案的实现与落地,推动算法模型在b端客户的解决方案与效果交付。 熟悉包括但不限于LLM、Agent/Multi-agent、 Tool Learning、RAG、RLHF等技术,探索大模型和商业化2b领域的结合,实现在业务中的应用落地。 2、跨域协同创新 联动产品、运营团队定义商业化指标和整体解决方案设计,构建算法-业务闭环优化机制,协调数据中台、云计算资源,保障算法服务稳定性。
更新于 2025-10-13
社招5年以上技术类-算法
1、负责端侧语音交互模型(语音+语义双工)算法研发、协同工程团队落地和性能优化。 2、负责AI产品的语音识别、语音合成算法的应用和落地,提升识别准确率语与语音合成流畅度,及根据业务场景调优音色和综合的用户体验。 3、跟踪前沿语音AI技术和大语言模型在语音领域的结合应用,跟踪业界端到端的语音大模型能力,评估适配方案并推动技术落地。 4、与产品、后端研发团队协作,推动语音AI技术的快速迭代和业务落地。
更新于 2025-09-12