字节跳动电商场景大模型/AIGC应用平台研发工程师-国际电商
社招全职A72041地点:上海状态:招聘
任职要求
1、对语言LLM/VLM/AIGC等多种模型、智能代理Agent技术、RAG技术、AutoPrompt技术有一定的使用经验; 2、对大模型的部署流程有基本的了解和认识; 3、具备熟练的Python/Go语言编程能力和良好的编程习惯和代码管理能力; 4、参与开发和优化大模型的自动评估技术,以提升模型的易用性和性能表现; 5、熟悉服务端基础技术,包括但不限于Ray、消息队列、ES等,能够高效地进行后端服务的开发与维护; 6、了解并能够使用常见的深度学习框架如Pytorch以及vLLM等,进行模型的推理和部署;需要具备良好的沟通技巧,能够与业务团队有效沟通,理解业务需求,并将其转化为技术实现。
工作职责
1、负责大模型、AIGC服务链路和应用平台的开发,支撑相关业务的生产与高效迭代; 2、设计和实现机器学习相关的基础设施、框架、工具链等,并推动落地到业务中; 3、负责大规模样本数据的管理、标注、预处理、存储等能力建设,提供训练和推理使用的基础设施保障; 4、构建适合电商场景的AI应用Workflow编排框架和平台,方便电商各业务搭建AI应用链路; 5、负责电商GPU资源管理和优化调度,并建设管理工具平台,优化GPU管理效率,提升资源池整体利用率; 6、探索业界前沿的深度学习相关技术,持续提升平台能力、降低研发与算法的使用成本。
包括英文材料
大模型+
https://www.youtube.com/watch?v=xZDB1naRUlk
You will build projects with LLMs that will enable you to create dynamic interfaces, interact with vast amounts of text data, and even empower LLMs with the capability to browse the internet for research papers.
https://www.youtube.com/watch?v=zjkBMFhNj_g
AI agent+
https://www.ibm.com/think/ai-agents
Your one-stop resource for gaining in-depth knowledge and hands-on applications of AI agents.
RAG+
https://www.youtube.com/watch?v=sVcwVQRHIc8
Learn how to implement RAG (Retrieval Augmented Generation) from scratch, straight from a LangChain software engineer.
Python+
https://liaoxuefeng.com/books/python/introduction/index.html
中文,免费,零起点,完整示例,基于最新的Python 3版本。
https://www.learnpython.org/
a free interactive Python tutorial for people who want to learn Python, fast.
https://www.youtube.com/watch?v=K5KVEU3aaeQ
Master Python from scratch 🚀 No fluff—just clear, practical coding skills to kickstart your journey!
https://www.youtube.com/watch?v=rfscVS0vtbw
This course will give you a full introduction into all of the core concepts in python.
Go+
https://www.youtube.com/watch?v=8uiZC0l4Ajw
学习Golang的完整教程!从开始到结束不到一个小时,包括如何在Go中构建API的完整演示。没有多余的内容,只有你需要知道的知识。
编程规范+
[英文] Google Style Guides
https://google.github.io/styleguide/
Every major open-source project has its own style guide: a set of conventions (sometimes arbitrary) about how to write code for that project. It is much easier to understand a large codebase when all the code in it is in a consistent style.
Ray+
https://github.com/ray-project/ray
Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
https://www.youtube.com/watch?v=FhXfEXUUQp0
In this video, I'll teach you everything you need to know about Apache Ray!
https://www.youtube.com/watch?v=fMiAyj2kgac
Using powerful machine learning algorithms is easy using Ray.io and Python.
https://www.youtube.com/watch?v=q_aTbb7XeL4
Parallel and Distributed computing sounds scary until you try this fantastic Python library.
消息队列+
https://www.youtube.com/watch?v=xErwDaOc-Gs
ElasticSearch+
https://www.youtube.com/watch?v=a4HBKEda_F8
Learn about Elasticsearch with this comprehensive course designed for beginners, featuring both theoretical concepts and hands-on applications using Python (though applicable to any programming language). The course is structured in two parts: first covering essential Elasticsearch fundamentals including index management, document storage, text analysis, pipeline creation, search functionality, and advanced features like semantic search and embeddings; followed by a practical section where you'll build a real-world website using Elasticsearch as a search engine, working with the Astronomy Picture of the Day (APOD) dataset to implement features such as data cleaning pipelines, tokenization, pagination, and aggregations.
深度学习+
https://d2l.ai/
Interactive deep learning book with code, math, and discussions.
PyTorch+
https://datawhalechina.github.io/thorough-pytorch/
PyTorch是利用深度学习进行数据科学研究的重要工具,在灵活性、可读性和性能上都具备相当的优势,近年来已成为学术界实现深度学习算法最常用的框架。
https://www.youtube.com/watch?v=V_xro1bcAuA
Learn PyTorch for deep learning in this comprehensive course for beginners. PyTorch is a machine learning framework written in Python.
vLLM+
https://www.newline.co/@zaoyang/ultimate-guide-to-vllm--aad8b65d
vLLM is a framework designed to make large language models faster, more efficient, and better suited for production environments.
https://www.youtube.com/watch?v=Ju2FrqIrdx0
vLLM is a cutting-edge serving engine designed for large language models (LLMs), offering unparalleled performance and efficiency for AI-driven applications.
相关职位
社招D7630
1、负责全国TOP级别的直播电商在B端业务场景中大模型的技术落地,支持业务目标提升; 2、负责大模型在智能经营、诊断分析、多模态创意生成等内容生成类场景中的应用,降低平台和商家的运营成本,提升运营效率; 3、负责大型语言模型的微调、偏好对齐、知识增强等技术探索,积极跟进AIGC业内应用趋势,包括并不限于多模态、RLHF、Agent等方向; 4、负责低代码平台与AI大模型应用场景落地(D2C、AI生成业务流程等),采用先进的算法工程方法,打造下一代AI低代码研发体系。
更新于 2025-04-22
社招D7630
1、负责全国TOP级别的直播电商在B端业务场景中大模型的技术落地,支持业务目标提升; 2、负责大模型在智能经营、诊断分析、多模态创意生成等内容生成类场景中的应用,降低平台和商家的运营成本,提升运营效率; 3、负责大型语言模型的微调、偏好对齐、知识增强等技术探索,积极跟进AIGC业内应用趋势,包括并不限于多模态、RLHF、Agent等方向; 4、负责低代码平台与AI大模型应用场景落地(D2C、AI生成业务流程等),采用先进的算法工程方法,打造下一代AI低代码研发体系。
更新于 2025-05-27
社招技术类
1)负责拼多多核心电商搜索、推荐、商业化场景大模型AIGC算法的开发与优化,支持业务场景(如AI交互式对话搜索,智能导购,图文创意生成、数字人等)高效落地; 2)负责大模型Agent、RAG系统全流程研发工作,包括样本标注,数据处理,模型训练(PreTrain、SFT、RL等),Prompt Engineer,WorkFlow设计与开发,评价指标设计; 3)负责Diffusion、Flux等算法在电商图像、视频生成领域的算法优化,追踪前沿技术,持续提升大模型内容生成的质量,赋能业务创新。
更新于 2025-09-01