美团商业增值-大模型应用算法工程师
社招全职核心本地商业-业务研发平台地点:北京状态:招聘
任职要求
1. 硕士及以上学历,计算机科学、人工智能、自然语言处理或相关专业; 2. 扎实的算法基础,熟悉自然语言处理相关理论,熟悉Transformer/BERT/GPT等基础模型结构,熟悉大模型的微调和评估方法; 3. 熟练掌握至少一种深度学习框架,如PyTorch、TensorFlow等; 4. 善于分析和发现问题,具备较强的自驱力,具有良好的沟通能力和跨团队协作能力,热衷于追求技术创新,对解决有挑战性的问题充满激情。 具备以下条件优先 1. 有SIGIR、ACL、WWW等NLP领域顶会论文发表经验验,或在具有影响力的NLP比赛中获得top名次,或开源社区项目开发经验; 2. 具有生成式模型训练及开发经验,如大模型数据处理、模型微调、预训练、强化学习、内容安全等,了解Megatron,deepspeed,vllm等训练或推理加速框架。 3. 有LangChain、AutoGPT或其他大模型框架开发经验者优先; 4. 有智能客服、智能助手、Agent相关项目经验者优先。
工作职责
1. 负责商业增值业务场景中大模型的技术落地,支持业务目标提升; 2. 负责大模型在智能助手、电销、IM等交互式对话场景中的应用,提升场景的自动化与智能化水平,改进商家的交互体验,提升广告供给; 3. 负责大模型在经营诊断分析、多模态创意生成等内容生成类场景中的应用,降低平台和商家的运营成本,提升运营效率; 4. 负责大型语言模型的微调、偏好对齐、知识增强等技术探索,积极跟进AIGC业内应用趋势,包括并不限于多模态、RLHF、Agent等方向; 5. 与其他团队密切合作,包括数据工程师、前后端工程师、产品经理等,实现高质量的产品和解决方案。
包括英文材料
学历+
NLP+
https://www.youtube.com/watch?v=fNxaJsNG3-s&list=PLQY2H8rRoyvzDbLUZkbudP-MFQZwNmU4S
Welcome to Zero to Hero for Natural Language Processing using TensorFlow!
https://www.youtube.com/watch?v=R-AG4-qZs1A&list=PLeo1K3hjS3uuvuAXhYjV2lMEShq2UYSwX
Natural Language Processing tutorial for beginners series in Python.
https://www.youtube.com/watch?v=rmVRLeJRkl4&list=PLoROMvodv4rMFqRtEuo6SGjY4XbRIVRd4
The foundations of the effective modern methods for deep learning applied to NLP.
算法+
https://roadmap.sh/datastructures-and-algorithms
Step by step guide to learn Data Structures and Algorithms in 2025
https://www.hellointerview.com/learn/code
A visual guide to the most important patterns and approaches for the coding interview.
https://www.w3schools.com/dsa/
Transformer+
https://huggingface.co/learn/llm-course/en/chapter1/4
Breaking down how Large Language Models work, visualizing how data flows through.
https://poloclub.github.io/transformer-explainer/
An interactive visualization tool showing you how transformer models work in large language models (LLM) like GPT.
https://www.youtube.com/watch?v=wjZofJX0v4M
Breaking down how Large Language Models work, visualizing how data flows through.
BERT+
https://www.youtube.com/watch?v=xI0HHN5XKDo
Understand the BERT Transformer in and out.
GPT+
https://www.youtube.com/watch?v=kCc8FmEb1nY
We build a Generatively Pretrained Transformer (GPT), following the paper "Attention is All You Need" and OpenAI's GPT-2 / GPT-3.
大模型+
https://www.youtube.com/watch?v=xZDB1naRUlk
You will build projects with LLMs that will enable you to create dynamic interfaces, interact with vast amounts of text data, and even empower LLMs with the capability to browse the internet for research papers.
https://www.youtube.com/watch?v=zjkBMFhNj_g
深度学习+
https://d2l.ai/
Interactive deep learning book with code, math, and discussions.
PyTorch+
https://datawhalechina.github.io/thorough-pytorch/
PyTorch是利用深度学习进行数据科学研究的重要工具,在灵活性、可读性和性能上都具备相当的优势,近年来已成为学术界实现深度学习算法最常用的框架。
https://www.youtube.com/watch?v=V_xro1bcAuA
Learn PyTorch for deep learning in this comprehensive course for beginners. PyTorch is a machine learning framework written in Python.
TensorFlow+
https://www.youtube.com/watch?v=tpCFfeUEGs8
Ready to learn the fundamentals of TensorFlow and deep learning with Python? Well, you’ve come to the right place.
https://www.youtube.com/watch?v=ZUKz4125WNI
This part continues right where part one left off so get that Google Colab window open and get ready to write plenty more TensorFlow code.
强化学习+
https://cloud.google.com/discover/what-is-reinforcement-learning?hl=en
Reinforcement learning (RL) is a type of machine learning where an "agent" learns optimal behavior through interaction with its environment.
https://huggingface.co/learn/deep-rl-course/unit0/introduction
This course will teach you about Deep Reinforcement Learning from beginner to expert. It’s completely free and open-source!
https://www.kaggle.com/learn/intro-to-game-ai-and-reinforcement-learning
Build your own video game bots, using classic and cutting-edge algorithms.
Megatron+
https://www.youtube.com/watch?v=hc0u4avAkuM
DeepSpeed+
https://www.youtube.com/watch?v=pDGI668pNg0
vLLM+
https://www.newline.co/@zaoyang/ultimate-guide-to-vllm--aad8b65d
vLLM is a framework designed to make large language models faster, more efficient, and better suited for production environments.
https://www.youtube.com/watch?v=Ju2FrqIrdx0
vLLM is a cutting-edge serving engine designed for large language models (LLMs), offering unparalleled performance and efficiency for AI-driven applications.
LangChain+
https://python.langchain.com/docs/tutorials/
New to LangChain or LLM app development in general? Read this material to quickly get up and running building your first applications.
https://www.freecodecamp.org/news/beginners-guide-to-langchain/
LangChain is a popular framework for creating LLM-powered apps.
AutoGPT+
[英文] What is AutoGPT?
https://www.ibm.com/think/topics/autogpt
https://www.youtube.com/watch?v=v-5AWQlTFw8
Someone has created a version of ChatGPT called AutoGPT and it’s a lot more powerful.
AI agent+
https://www.ibm.com/think/ai-agents
Your one-stop resource for gaining in-depth knowledge and hands-on applications of AI agents.
相关职位
社招1年以上核心本地商业-业
1. 负责美团餐饮广告机制策略研发,研究合理的广告售卖机制,保障用户体验、优化广告投放效果、提升平台广告收入。 2. 跟产品和运营密切协作,结合行业与场景特点,利用深度学习、强化学习、运筹优化、大模型等前沿技术持续优化广告排序与计费算法、广告与非广告的混排算法。
更新于 2025-02-26
社招2年以上核心本地商业-业
作为境外广告算法核心骨干,您将参与主导系统从0到1的核心算法策略建设,推动我们的广告产品在全球市场的成功。您将在多个关键领域发挥作用,包括但不限于推荐广告的C端模型冷启动优化、浮动广告位机制策略,以及多国场景下的通用化投放能力模式探索;搜索广告的全链路能力建设,包括多语言下的Query理解、召回和相关性等能力构建;以及B端商家智能化投放和诊断能力的开发,包括智能出价、预算分配和投放建议/诊断等。 岗位职责: 1、设计和实现高效的广告推荐算法,优化C端模型的冷启动问题,并提升用户体验。 2、开发和优化浮动广告位机制,确保在多种国际场景下的广告投放效率和效果。 3、构建和维护搜索广告的全链路能力,包括但不限于多语言环境下的理解、召回和相关性算法。 4、为B端商家开发智能化投放工具,包括智能出价、预算优化、投放建议和诊断系统。 5、理解不同市场的广告需求,探索和实现通用化的广告投放模式。
更新于 2025-02-26