阿里巴巴1688技术部-LLM算法工程师-AI
社招全职2年以上地点:杭州状态:招聘
任职要求
1.计算机科学、人工智能或相关专业本科及以上学历,并且对AI有着大量的热情; 2.具有AI模型开发经验,熟悉至少一种深度学习框架(如TensorFlow、PyTorch); 3.精通Python编程语言,了解C/C++或Java等其他编程语言者优先; 4.熟悉常用的机器学习和深度学习算法,有实际项目开发经验者优先; 5.具有良好的项目管理能力和团队协作精神,能够在压力下工作; 6.具备优秀的分析问题和解决问题的能力,对新技术有强烈的好奇心和学习意愿; 7.良好的英文阅读和写作能力,能够熟练阅读和理解技术文档
工作职责
业务描述:中国跨境电商发展迅速、空间巨大,1688基于自身在源头厂货和供应链上的优势,面向未来打造以AI为驱动的去中心化的新一代数字化供应链体系,重塑中国跨境电商整个上下游的产业链条,一端从用户需求出发构建跨境电商垂直领域的通用Agent,连接和赋能传统行业SaaS,成为AI时代的卖家经营入口;一端从供应链的源头出发打造全球数据全、准确率高、趋势感知强的商机Agent,通过商机调度整个供应链系统,并逐步将选品、寻源、组货、跟单Agent化,构建AI驱动的数字化供应链系统。 具体范围: 1.基于跨境业务在基础LLM的模型上蒸进后训练工作,同时提供面向Agents研发模式下的,大模型算法数据+测评技术工作; 2.基于电商用户的B类需求,设计和扩展LLM的应用场景范围及规模,提高模型微调后再垂直领域的应用及专家模式的架构尝试,不限于AIGC素材、多语言智能客服、AI选品工具等核心AI能力建设。
包括英文材料
学历+
深度学习+
https://d2l.ai/
Interactive deep learning book with code, math, and discussions.
TensorFlow+
https://www.youtube.com/watch?v=tpCFfeUEGs8
Ready to learn the fundamentals of TensorFlow and deep learning with Python? Well, you’ve come to the right place.
https://www.youtube.com/watch?v=ZUKz4125WNI
This part continues right where part one left off so get that Google Colab window open and get ready to write plenty more TensorFlow code.
PyTorch+
https://datawhalechina.github.io/thorough-pytorch/
PyTorch是利用深度学习进行数据科学研究的重要工具,在灵活性、可读性和性能上都具备相当的优势,近年来已成为学术界实现深度学习算法最常用的框架。
https://www.youtube.com/watch?v=V_xro1bcAuA
Learn PyTorch for deep learning in this comprehensive course for beginners. PyTorch is a machine learning framework written in Python.
Python+
https://liaoxuefeng.com/books/python/introduction/index.html
中文,免费,零起点,完整示例,基于最新的Python 3版本。
https://www.learnpython.org/
a free interactive Python tutorial for people who want to learn Python, fast.
https://www.youtube.com/watch?v=K5KVEU3aaeQ
Master Python from scratch 🚀 No fluff—just clear, practical coding skills to kickstart your journey!
https://www.youtube.com/watch?v=rfscVS0vtbw
This course will give you a full introduction into all of the core concepts in python.
C+
https://www.freecodecamp.org/chinese/news/the-c-beginners-handbook/
本手册遵循二八定律。你将在 20% 的时间内学习 80% 的 C 编程语言。
https://www.youtube.com/watch?v=87SH2Cn0s9A
https://www.youtube.com/watch?v=KJgsSFOSQv0
This course will give you a full introduction into all of the core concepts in the C programming language.
https://www.youtube.com/watch?v=PaPN51Mm5qQ
In this complete C programming course, Dr. Charles Severance (aka Dr. Chuck) will help you understand computer architecture and low-level programming with the help of the classic C Programming language book written by Brian Kernighan and Dennis Ritchie.
C+++
https://www.learncpp.com/
LearnCpp.com is a free website devoted to teaching you how to program in modern C++.
https://www.youtube.com/watch?v=ZzaPdXTrSb8
Java+
https://www.youtube.com/watch?v=eIrMbAQSU34
Master Java – a must-have language for software development, Android apps, and more! ☕️ This beginner-friendly course takes you from basics to real coding skills.
机器学习+
https://www.youtube.com/watch?v=0oyDqO8PjIg
Learn about machine learning and AI with this comprehensive 11-hour course from @LunarTech_ai.
https://www.youtube.com/watch?v=i_LwzRVP7bg
Learn Machine Learning in a way that is accessible to absolute beginners.
https://www.youtube.com/watch?v=NWONeJKn6kc
Learn the theory and practical application of machine learning concepts in this comprehensive course for beginners.
https://www.youtube.com/watch?v=PcbuKRNtCUc
Learn about all the most important concepts and terms related to machine learning and AI.
算法+
https://roadmap.sh/datastructures-and-algorithms
Step by step guide to learn Data Structures and Algorithms in 2025
https://www.hellointerview.com/learn/code
A visual guide to the most important patterns and approaches for the coding interview.
https://www.w3schools.com/dsa/
相关职位
社招A82032
1、负责AIOps领域算法和解决方案设计实现,包括时序分析、日志挖掘、故障预测、根因关联推断和智能决策等; 2、探索LLM x AIOps的落地应用,包括但不限于异常检测、根因定位、止损容灾等场景; 3、持续跟进LLM前沿技术、开源方案及其在AIOps领域的应用。
更新于 2024-05-17
社招1年以上
背景介绍: 我们正在构建一个深度理解淘宝研发上下文的领域大模型,目标是打造一个具备“架构师级”洞察力的AI模型,从根本上提升研发效率与质量。如果你渴望在一个真实、复杂的场景中,将LLM的能力推向新的高度,并亲手塑造下一代软件研发的未来,我们期待你的加入! 1. 领域模型训练: 负责淘宝研发领域大模型的核心算法,主导持续预训练(Continual Pre-training)、指令微调(SFT)和对齐(RLHF/DPO)等训练流程; 2. 知识注入与推理: 设计并实践创新的数据方案,将代码、文档、配置等异构研发知识高效注入模型;通过多任务学习、FIM等范式,增强模型对软件工程的深度理解与复杂推理能力; 3. 能力评估与迭代: 建立科学的评测体系,精准评估模型在代码溯源、影响分析、故障排查等高阶任务上的能力;分析bad case,驱动数据和算法的闭环优化。
更新于 2025-08-25
社招A161843
1、负责 Memory 算法工作,推动最前沿技术的探索和应用; 2、提升自然语言理解的能力,比如意图识别,NL2SQL,向量召回,结构化/非结构化,短文本/长文本的表征学习等; 3、提升排序模型效果,挖掘特征,升级模型结构,优化信息查找效率; 4、结合最前沿的LLM技术,对用户行为进行总结、理解、画像等探索。
更新于 2023-11-29