京东大模型算法工程师(LLM/MLLM)
社招全职算法开发岗地点:北京状态:招聘
任职要求
1.具备扎实的机器学习理论基础,对大语言模型或多模态大模型技术有深入的理解。有自然语言处理(NLP)或多模态实践经验,特别是大模型相关技术(比如SFT、强化学习、RAG、Agent等)应用经验。对至少一项生成式模型的原理与应用具有深入了解,如DeepSeek、Qwen系列、GPT4o系列、LLaVa系列等模型; 2.具备良好的算法工程实践能力,有扎实的编程能力,熟练掌握pytorch框架,具有良好的质控意识。加分项:具备大数据计算经验,熟悉hive/(py)spark等计算引擎; 3.具备良好的独立思考、解决问题和沟通协作的能力,能够协同团队实现技术落地,具备带领团队独立推进核心业务的能力; 4.具备良好的算法调研能力,能够对具体方向进行系统性研究并应用。发表过大模型相关研究AI顶会论文优先;具有影响力竞赛,如Kaggle等。 符合京东价值观:客户为先、创新、拼搏、担当、感恩、诚信。
工作职责
1.负责大语言模型或多模态大模型算法在电商域相关业务场景的赋能,包括电商标签生产/商品信息抽取/商品表征/知识问答/内容理解/内容生成等; 2.负责大语言模型或多模态大模型设计、开发和落地工作,包括高质量数据集构建、Prompt设计、大模型训练(继续预训练、SFT、RLHF)、高性能服务部署等; 3.紧跟业界大语言模型或多模态大模型等方向进展,探索前沿技术并结合具体场景进行应用,为业务提效,形成系列算法/大模型解决方案,推动大模型效果达到行业领先。
包括英文材料
机器学习+
https://www.youtube.com/watch?v=0oyDqO8PjIg
Learn about machine learning and AI with this comprehensive 11-hour course from @LunarTech_ai.
https://www.youtube.com/watch?v=i_LwzRVP7bg
Learn Machine Learning in a way that is accessible to absolute beginners.
https://www.youtube.com/watch?v=NWONeJKn6kc
Learn the theory and practical application of machine learning concepts in this comprehensive course for beginners.
https://www.youtube.com/watch?v=PcbuKRNtCUc
Learn about all the most important concepts and terms related to machine learning and AI.
大模型+
https://www.youtube.com/watch?v=xZDB1naRUlk
You will build projects with LLMs that will enable you to create dynamic interfaces, interact with vast amounts of text data, and even empower LLMs with the capability to browse the internet for research papers.
https://www.youtube.com/watch?v=zjkBMFhNj_g
NLP+
https://www.youtube.com/watch?v=fNxaJsNG3-s&list=PLQY2H8rRoyvzDbLUZkbudP-MFQZwNmU4S
Welcome to Zero to Hero for Natural Language Processing using TensorFlow!
https://www.youtube.com/watch?v=R-AG4-qZs1A&list=PLeo1K3hjS3uuvuAXhYjV2lMEShq2UYSwX
Natural Language Processing tutorial for beginners series in Python.
https://www.youtube.com/watch?v=rmVRLeJRkl4&list=PLoROMvodv4rMFqRtEuo6SGjY4XbRIVRd4
The foundations of the effective modern methods for deep learning applied to NLP.
SFT+
https://cameronrwolfe.substack.com/p/understanding-and-using-supervised
Understanding how SFT works from the idea to a working implementation...
强化学习+
https://cloud.google.com/discover/what-is-reinforcement-learning?hl=en
Reinforcement learning (RL) is a type of machine learning where an "agent" learns optimal behavior through interaction with its environment.
https://huggingface.co/learn/deep-rl-course/unit0/introduction
This course will teach you about Deep Reinforcement Learning from beginner to expert. It’s completely free and open-source!
https://www.kaggle.com/learn/intro-to-game-ai-and-reinforcement-learning
Build your own video game bots, using classic and cutting-edge algorithms.
RAG+
https://www.youtube.com/watch?v=sVcwVQRHIc8
Learn how to implement RAG (Retrieval Augmented Generation) from scratch, straight from a LangChain software engineer.
AI agent+
https://www.ibm.com/think/ai-agents
Your one-stop resource for gaining in-depth knowledge and hands-on applications of AI agents.
算法+
https://roadmap.sh/datastructures-and-algorithms
Step by step guide to learn Data Structures and Algorithms in 2025
https://www.hellointerview.com/learn/code
A visual guide to the most important patterns and approaches for the coding interview.
https://www.w3schools.com/dsa/
PyTorch+
https://datawhalechina.github.io/thorough-pytorch/
PyTorch是利用深度学习进行数据科学研究的重要工具,在灵活性、可读性和性能上都具备相当的优势,近年来已成为学术界实现深度学习算法最常用的框架。
https://www.youtube.com/watch?v=V_xro1bcAuA
Learn PyTorch for deep learning in this comprehensive course for beginners. PyTorch is a machine learning framework written in Python.
大数据+
https://www.youtube.com/watch?v=bAyrObl7TYE
https://www.youtube.com/watch?v=H4bf_uuMC-g
With all this talk of Big Data, we got Rebecca Tickle to explain just what makes data into Big Data.
Hive+
[英文] Hive Tutorial
https://www.tutorialspoint.com/hive/index.htm
Hive is a data warehouse infrastructure tool to process structured data in Hadoop. It resides on top of Hadoop to summarize Big Data, and makes querying and analyzing easy.
https://www.youtube.com/watch?v=D4HqQ8-Ja9Y
Spark+
[英文] Learning Spark Book
https://pages.databricks.com/rs/094-YMS-629/images/LearningSpark2.0.pdf
This new edition has been updated to reflect Apache Spark’s evolution through Spark 2.x and Spark 3.0, including its expanded ecosystem of built-in and external data sources, machine learning, and streaming technologies with which Spark is tightly integrated.
Kaggle+
[英文] Kaggle Learn
https://www.kaggle.com/learn
Gain the skills you need to do independent data science projects.
相关职位
社招A83233
1、负责优化抖音电商短视频/带货直播的安全算法,解决虚假宣传/违规营销/服务履约等问题; 2、通过大模型赋能治理全链路做自动化:感知,召回,处置,实现治理业务全面智能化; 3、负责强化电商治理场景下,大模型推理和反思能力,做到治理业务下高Cot在各环节自动拆解任务并执行; 4、围绕大模型建立起在线规则库和案例库,通过Rag等方式实现治理知识体系动态运维管理和自动化上线; 5、探索推理大模型相关前沿技术,并在考虑性能和推理成本最佳平衡方案中大规模落地应用。
更新于 2025-02-19
社招A34934
1、负责优化抖音电商短视频/带货直播的安全算法,解决虚假宣传/违规营销/服务履约等问题; 2、通过大模型赋能治理全链路做自动化:感知,召回,处置,实现治理业务全面智能化; 3、负责强化电商治理场景下,大模型推理和反思能力,做到治理业务下高Cot在各环节自动拆解任务并执行; 4、围绕大模型建立起在线规则库和案例库,通过Rag等方式实现治理知识体系动态运维管理和自动化上线; 5、探索推理大模型相关前沿技术,并在考虑性能和推理成本最佳平衡方案中大规模落地应用。
更新于 2025-02-19
社招A196013
1、负责优化抖音电商售后体验方向相关算法工作,解决消费者商家服务和仲裁环节的售后体验、服务权益等问题; 2、通过优化算法,在对消费者遇到问题时进行精准识别,支持平台主动介入解决问题,提升消费者购物体验; 3、建设售后服务MLLM基座大模型,并利用RAG/Agent/RL等技术,解决复杂场景下对体验问题的理解能力; 4、协同业务方,进行问题分析拆解,并整体规划算法工作。
更新于 2025-02-19