字节跳动多模态大模型算法工程师/Agent算法工程师-国际电商
社招全职A249408地点:北京状态:招聘
任职要求
1、在NLP、CV、多模态、大模型或Agent方向有较深入的研究或实践经验,包括但不限于多模态理解、多语言NLP、图像/视频理解、多模态检索、LLM/VLM、Embedding/SID、RAG、Tool Calling、Agent Planning等; 2、熟悉PyTorch/TensorFlow等深度学习框架,具备模型训练、调优和部署经验;了解分布式训练、混合精度训练、推理加速、TensorRT或大模型推理优化者优先; 3、具备较强的工程实践和业务落地能力,有多模态检索、商品理解、内容理解、搜索推荐、价格算法、智能Agent或A…
登录查看完整任职要求
微信扫码,1秒登录
工作职责
1、团队依托千亿级视频内容和全球商品数据,探索NLP、CV、多模态大模型、Video LLM、Multimodal Embedding和SID、Multimodal Reasoning、Agent等技术在电商业务中的落地,支持商品库建设、商品同款识别、内容商品链接、价格比对、AIGC内容生成和生成式搜推技术演进等关键场景,我们的目标是通过多模态大模型和Agent技术,重新理解“卖什么、怎么卖、卖给谁”,重塑内容电商的业务想象力;参与全球商品库和内容理解体系建设,基于NLP、图像理解、多模态大模型等技术,对商品、视频、图文、商家、品牌等对象进行结构化理解和语义建模;负责商品同款、商品/商家/品牌消重、跨语言商品聚合等核心算法系统建设,解决海量数据下的多模态匹配、实时流式聚合、多粒度聚合和跨语言语义对齐问题; 2、负责电商比价、商品定价、价格监控和异常预警等算法能力建设,支持百亿补贴、价格竞争力分析、商品供给优化等重要业务场景,建设视频-商品、商品-商品、视频-视频等多模态语义链接能力,支持内容趋势理解、商品挖掘、AIGC内容生成和内容电商供给优化; 3、探索下一代生成式搜推模型的演进,通过视频、直播、商品、Query等多体裁的表征和SID建模,提升生成式搜推的效率和体验; 4、探索Agent在电商业务中的应用,包括比价Agent、商品分析Agent、内容AIGC Agent、商家运营Agent等方向,结合RAG、工具调用、规划推理、自动评估等能力,推动业务流程智能化,参与从数据到业务落地的全流程,包括数据构建、特征工程、模型训练、效果评估、线上部署、负面案例分析和持续迭代,并探索前沿技术在实际业务中的规模化落地。
包括英文材料
NLP+
https://www.youtube.com/watch?v=fNxaJsNG3-s&list=PLQY2H8rRoyvzDbLUZkbudP-MFQZwNmU4S
Welcome to Zero to Hero for Natural Language Processing using TensorFlow!
https://www.youtube.com/watch?v=R-AG4-qZs1A&list=PLeo1K3hjS3uuvuAXhYjV2lMEShq2UYSwX
Natural Language Processing tutorial for beginners series in Python.
https://www.youtube.com/watch?v=rmVRLeJRkl4&list=PLoROMvodv4rMFqRtEuo6SGjY4XbRIVRd4
The foundations of the effective modern methods for deep learning applied to NLP.
大模型+
https://www.youtube.com/watch?v=xZDB1naRUlk
You will build projects with LLMs that will enable you to create dynamic interfaces, interact with vast amounts of text data, and even empower LLMs with the capability to browse the internet for research papers.
https://www.youtube.com/watch?v=zjkBMFhNj_g
AI agent+
https://www.ibm.com/think/ai-agents
Your one-stop resource for gaining in-depth knowledge and hands-on applications of AI agents.
RAG+
https://www.youtube.com/watch?v=sVcwVQRHIc8
Learn how to implement RAG (Retrieval Augmented Generation) from scratch, straight from a LangChain software engineer.
PyTorch+
https://datawhalechina.github.io/thorough-pytorch/
PyTorch是利用深度学习进行数据科学研究的重要工具,在灵活性、可读性和性能上都具备相当的优势,近年来已成为学术界实现深度学习算法最常用的框架。
https://www.youtube.com/watch?v=V_xro1bcAuA
Learn PyTorch for deep learning in this comprehensive course for beginners. PyTorch is a machine learning framework written in Python.
TensorFlow+
https://www.youtube.com/watch?v=tpCFfeUEGs8
Ready to learn the fundamentals of TensorFlow and deep learning with Python? Well, you’ve come to the right place.
https://www.youtube.com/watch?v=ZUKz4125WNI
This part continues right where part one left off so get that Google Colab window open and get ready to write plenty more TensorFlow code.
深度学习+
https://d2l.ai/
Interactive deep learning book with code, math, and discussions.
TensorRT+
https://docs.nvidia.com/deeplearning/tensorrt/latest/getting-started/quick-start-guide.html
This TensorRT Quick Start Guide is a starting point for developers who want to try out the TensorRT SDK; specifically, it demonstrates how to quickly construct an application to run inference on a TensorRT engine.
算法+
https://roadmap.sh/datastructures-and-algorithms
Step by step guide to learn Data Structures and Algorithms in 2025
https://www.hellointerview.com/learn/code
A visual guide to the most important patterns and approaches for the coding interview.
https://www.w3schools.com/dsa/
还有更多 •••