京东LLM大模型算法工程师
社招全职算法开发岗地点:北京状态:招聘
任职要求
1.具备扎实的机器学习理论基础,对自然语言处理(NLP)技术有深入的理解;有自然语言处理(NLP)实践经验,特别是大模型相关技术(比如SFT、RLHF、RAG、Agent等)应用经验; 2.具备良好的算法工程实践能力,有扎实的编程能力,熟练掌握pytorch框架,具有良好的质控意识; 3.具备良好的独立思考、解决问题和沟通协作的能力,能够协同团队实现技术落地,具备带领团队独立推进核心业务的能力; 4.具备良好的算法调研能力,能够对具体方向进行系统性研究并应用; 5.加分项:熟悉LLM相关算法、发表过顶会论文、具备大数据计算经验、熟悉hive/(py)spark等计算引擎。 符合京东价值观:客户为先、创新、拼搏、担当、感恩、诚信。
工作职责
1.负责自然语言处理(NLP)算法在电商域相关业务场景的赋能,包括电商标签生产/商品信息抽取/商品表征/知识问答/内容理解/内容生成等; 2.负责大模型设计、开发和落地工作,包括高质量数据集构建、Prompt设计、大模型训练(继续预训练、SFT、RLHF)、高性能服务部署等; 3.负责自然语言处理(NLP)前沿技术的调研,并结合具体场景进行应用,为业务提效。
包括英文材料
机器学习+
https://www.youtube.com/watch?v=0oyDqO8PjIg
Learn about machine learning and AI with this comprehensive 11-hour course from @LunarTech_ai.
https://www.youtube.com/watch?v=i_LwzRVP7bg
Learn Machine Learning in a way that is accessible to absolute beginners.
https://www.youtube.com/watch?v=NWONeJKn6kc
Learn the theory and practical application of machine learning concepts in this comprehensive course for beginners.
https://www.youtube.com/watch?v=PcbuKRNtCUc
Learn about all the most important concepts and terms related to machine learning and AI.
NLP+
https://www.youtube.com/watch?v=fNxaJsNG3-s&list=PLQY2H8rRoyvzDbLUZkbudP-MFQZwNmU4S
Welcome to Zero to Hero for Natural Language Processing using TensorFlow!
https://www.youtube.com/watch?v=R-AG4-qZs1A&list=PLeo1K3hjS3uuvuAXhYjV2lMEShq2UYSwX
Natural Language Processing tutorial for beginners series in Python.
https://www.youtube.com/watch?v=rmVRLeJRkl4&list=PLoROMvodv4rMFqRtEuo6SGjY4XbRIVRd4
The foundations of the effective modern methods for deep learning applied to NLP.
大模型+
https://www.youtube.com/watch?v=xZDB1naRUlk
You will build projects with LLMs that will enable you to create dynamic interfaces, interact with vast amounts of text data, and even empower LLMs with the capability to browse the internet for research papers.
https://www.youtube.com/watch?v=zjkBMFhNj_g
SFT+
https://cameronrwolfe.substack.com/p/understanding-and-using-supervised
Understanding how SFT works from the idea to a working implementation...
RAG+
https://www.youtube.com/watch?v=sVcwVQRHIc8
Learn how to implement RAG (Retrieval Augmented Generation) from scratch, straight from a LangChain software engineer.
AI agent+
https://www.ibm.com/think/ai-agents
Your one-stop resource for gaining in-depth knowledge and hands-on applications of AI agents.
算法+
https://roadmap.sh/datastructures-and-algorithms
Step by step guide to learn Data Structures and Algorithms in 2025
https://www.hellointerview.com/learn/code
A visual guide to the most important patterns and approaches for the coding interview.
https://www.w3schools.com/dsa/
PyTorch+
https://datawhalechina.github.io/thorough-pytorch/
PyTorch是利用深度学习进行数据科学研究的重要工具,在灵活性、可读性和性能上都具备相当的优势,近年来已成为学术界实现深度学习算法最常用的框架。
https://www.youtube.com/watch?v=V_xro1bcAuA
Learn PyTorch for deep learning in this comprehensive course for beginners. PyTorch is a machine learning framework written in Python.
大数据+
https://www.youtube.com/watch?v=bAyrObl7TYE
https://www.youtube.com/watch?v=H4bf_uuMC-g
With all this talk of Big Data, we got Rebecca Tickle to explain just what makes data into Big Data.
Hive+
[英文] Hive Tutorial
https://www.tutorialspoint.com/hive/index.htm
Hive is a data warehouse infrastructure tool to process structured data in Hadoop. It resides on top of Hadoop to summarize Big Data, and makes querying and analyzing easy.
https://www.youtube.com/watch?v=D4HqQ8-Ja9Y
Spark+
[英文] Learning Spark Book
https://pages.databricks.com/rs/094-YMS-629/images/LearningSpark2.0.pdf
This new edition has been updated to reflect Apache Spark’s evolution through Spark 2.x and Spark 3.0, including its expanded ecosystem of built-in and external data sources, machine learning, and streaming technologies with which Spark is tightly integrated.
相关职位
社招2年以上算法开发岗
1.基于京东平台场景优势,主导大语言模型/多模态大模型的技术研发与工程化落地,构建新一代AI驱动的电商C端解决方案; 2.负责用户行为理解、个性化推荐、智能导购等关键系统的算法优化,通过AIGC技术创新显著提升购物转化率与用户留存; 3.探索大模型在电商领域的创新应用场景,持续迭代Prompt Engineering、RAG、Agent等前沿技术方案; 4.构建行业领先的视觉、语言跨模态系统,攻克多模态语义对齐、长上下文建模等技术难题。
更新于 2025-07-09
社招3年以上搜一搜技术
1.大模型在微信搜索的应用落地研究,推进AI生成式问答、复杂语义和多模态检索、多场景查询推荐等业务场景的应用落地; 2.结合大模型技术前沿和搜索场景需求,从训练数据、模型设计、训练工艺等角度深入探索研发高效的大模型算法,包括Post Pretrain、SFT、RM/RL、RAG等方向。
更新于 2025-09-22
社招3年以上技术类-算法
负责 LLM 在软件研发领域的应用与落地,包括但不限于LLM、Agent/Multi-agent、 Tool Learning、RAG、RLHF等技术,探索大模型和软件研发领域的结合,实现在业务中的应用落地。 1、负责算法模型研发,包含但不限于Embedding、Pre-train、SFT、Self-instruct; 2、参与领域模型的全流程工作,包括但不限于数据、训练、评测、推理部署,保证数据的高质量和有效性; 3、探索Agent在复杂任务中的应用,实现基于LLM的复杂任务在软件研发领域场景的应用落地。
更新于 2025-08-19