京东LLM/大模型算法工程师
社招全职2年以上算法开发岗地点:北京状态:招聘
任职要求
1.硕士及以上学历,计算机/人工智能/NLP相关专业,2年以上大模型项目实战经验; 2.精通PyTorch/TensorFlow框架,具备分布式训练优化经验,熟练掌握Linux环境下CUDA编程; 3.扎实的算法基础:深入理解Transformer架构,在RLHF、RAG、Agent等至少两个方向有落地经验; 4.突出的工程能力:熟练掌握Python/Java/Scala中两种以上,精通Spark/Flink等大数据处理工具; 5.对多模态学习、跨模态表示学习有深入研究,具备视觉-语言联合建模项目经验。 加分项: 1.在ACL/EMNLP/CVPR等顶会发表过相关论文; 2.有亿级用户规模的电商推荐/搜索系统优化经验; 3.熟悉电商业务场景。 符合京东价值观:客户为先、创新、拼搏、担当、感恩、诚信。
工作职责
1.基于京东平台场景优势,主导大语言模型/多模态大模型的技术研发与工程化落地,构建新一代AI驱动的电商C端解决方案; 2.负责用户行为理解、个性化推荐、智能导购等关键系统的算法优化,通过AIGC技术创新显著提升购物转化率与用户留存; 3.探索大模型在电商领域的创新应用场景,持续迭代Prompt Engineering、RAG、Agent等前沿技术方案; 4.构建行业领先的视觉、语言跨模态系统,攻克多模态语义对齐、长上下文建模等技术难题。
包括英文材料
学历+
NLP+
https://www.youtube.com/watch?v=fNxaJsNG3-s&list=PLQY2H8rRoyvzDbLUZkbudP-MFQZwNmU4S
Welcome to Zero to Hero for Natural Language Processing using TensorFlow!
https://www.youtube.com/watch?v=R-AG4-qZs1A&list=PLeo1K3hjS3uuvuAXhYjV2lMEShq2UYSwX
Natural Language Processing tutorial for beginners series in Python.
https://www.youtube.com/watch?v=rmVRLeJRkl4&list=PLoROMvodv4rMFqRtEuo6SGjY4XbRIVRd4
The foundations of the effective modern methods for deep learning applied to NLP.
大模型+
https://www.youtube.com/watch?v=xZDB1naRUlk
You will build projects with LLMs that will enable you to create dynamic interfaces, interact with vast amounts of text data, and even empower LLMs with the capability to browse the internet for research papers.
https://www.youtube.com/watch?v=zjkBMFhNj_g
PyTorch+
https://datawhalechina.github.io/thorough-pytorch/
PyTorch是利用深度学习进行数据科学研究的重要工具,在灵活性、可读性和性能上都具备相当的优势,近年来已成为学术界实现深度学习算法最常用的框架。
https://www.youtube.com/watch?v=V_xro1bcAuA
Learn PyTorch for deep learning in this comprehensive course for beginners. PyTorch is a machine learning framework written in Python.
TensorFlow+
https://www.youtube.com/watch?v=tpCFfeUEGs8
Ready to learn the fundamentals of TensorFlow and deep learning with Python? Well, you’ve come to the right place.
https://www.youtube.com/watch?v=ZUKz4125WNI
This part continues right where part one left off so get that Google Colab window open and get ready to write plenty more TensorFlow code.
Linux+
https://ryanstutorials.net/linuxtutorial/
Ok, so you want to learn how to use the Bash command line interface (terminal) on Unix/Linux.
https://ubuntu.com/tutorials/command-line-for-beginners
The Linux command line is a text interface to your computer.
https://www.youtube.com/watch?v=6WatcfENsOU
In this Linux crash course, you will learn the fundamental skills and tools you need to become a proficient Linux system administrator.
https://www.youtube.com/watch?v=v392lEyM29A
Never fear the command line again, make it fear you.
https://www.youtube.com/watch?v=ZtqBQ68cfJc
CUDA+
https://developer.nvidia.com/blog/even-easier-introduction-cuda/
This post is a super simple introduction to CUDA, the popular parallel computing platform and programming model from NVIDIA.
https://www.youtube.com/watch?v=86FAWCzIe_4
Lean how to program with Nvidia CUDA and leverage GPUs for high-performance computing and deep learning.
算法+
https://roadmap.sh/datastructures-and-algorithms
Step by step guide to learn Data Structures and Algorithms in 2025
https://www.hellointerview.com/learn/code
A visual guide to the most important patterns and approaches for the coding interview.
https://www.w3schools.com/dsa/
Transformer+
https://huggingface.co/learn/llm-course/en/chapter1/4
Breaking down how Large Language Models work, visualizing how data flows through.
https://poloclub.github.io/transformer-explainer/
An interactive visualization tool showing you how transformer models work in large language models (LLM) like GPT.
https://www.youtube.com/watch?v=wjZofJX0v4M
Breaking down how Large Language Models work, visualizing how data flows through.
RAG+
https://www.youtube.com/watch?v=sVcwVQRHIc8
Learn how to implement RAG (Retrieval Augmented Generation) from scratch, straight from a LangChain software engineer.
AI agent+
https://www.ibm.com/think/ai-agents
Your one-stop resource for gaining in-depth knowledge and hands-on applications of AI agents.
Python+
https://liaoxuefeng.com/books/python/introduction/index.html
中文,免费,零起点,完整示例,基于最新的Python 3版本。
https://www.learnpython.org/
a free interactive Python tutorial for people who want to learn Python, fast.
https://www.youtube.com/watch?v=K5KVEU3aaeQ
Master Python from scratch 🚀 No fluff—just clear, practical coding skills to kickstart your journey!
https://www.youtube.com/watch?v=rfscVS0vtbw
This course will give you a full introduction into all of the core concepts in python.
Java+
https://www.youtube.com/watch?v=eIrMbAQSU34
Master Java – a must-have language for software development, Android apps, and more! ☕️ This beginner-friendly course takes you from basics to real coding skills.
Scala+
Spark+
[英文] Learning Spark Book
https://pages.databricks.com/rs/094-YMS-629/images/LearningSpark2.0.pdf
This new edition has been updated to reflect Apache Spark’s evolution through Spark 2.x and Spark 3.0, including its expanded ecosystem of built-in and external data sources, machine learning, and streaming technologies with which Spark is tightly integrated.
Flink+
https://nightlies.apache.org/flink/flink-docs-release-2.0/docs/learn-flink/overview/
This training presents an introduction to Apache Flink that includes just enough to get you started writing scalable streaming ETL, analytics, and event-driven applications, while leaving out a lot of (ultimately important) details.
https://www.youtube.com/watch?v=WajYe9iA2Uk&list=PLa7VYi0yPIH2GTo3vRtX8w9tgNTTyYSux
Today’s businesses are increasingly software-defined, and their business processes are being automated. Whether it’s orders and shipments, or downloads and clicks, business events can always be streamed. Flink can be used to manipulate, process, and react to these streaming events as they occur.
CVPR+
https://cvpr.thecvf.com/
相关职位
社招算法开发岗
1.负责自然语言处理(NLP)算法在电商域相关业务场景的赋能,包括电商标签生产/商品信息抽取/商品表征/知识问答/内容理解/内容生成等; 2.负责大模型设计、开发和落地工作,包括高质量数据集构建、Prompt设计、大模型训练(继续预训练、SFT、RLHF)、高性能服务部署等; 3.负责自然语言处理(NLP)前沿技术的调研,并结合具体场景进行应用,为业务提效。
更新于 2025-07-08
社招3年以上搜一搜技术
1.大模型在微信搜索的应用落地研究,推进AI生成式问答、复杂语义和多模态检索、多场景查询推荐等业务场景的应用落地; 2.结合大模型技术前沿和搜索场景需求,从训练数据、模型设计、训练工艺等角度深入探索研发高效的大模型算法,包括Post Pretrain、SFT、RM/RL、RAG等方向。
更新于 2025-09-22
社招3年以上技术类-算法
负责 LLM 在软件研发领域的应用与落地,包括但不限于LLM、Agent/Multi-agent、 Tool Learning、RAG、RLHF等技术,探索大模型和软件研发领域的结合,实现在业务中的应用落地。 1、负责算法模型研发,包含但不限于Embedding、Pre-train、SFT、Self-instruct; 2、参与领域模型的全流程工作,包括但不限于数据、训练、评测、推理部署,保证数据的高质量和有效性; 3、探索Agent在复杂任务中的应用,实现基于LLM的复杂任务在软件研发领域场景的应用落地。
更新于 2025-08-19