快手大模型应用研发工程师(智能客服)-【电商】
社招全职D7630地点:杭州状态:招聘
任职要求
1、本科及以上学历,计算机科学、人工智能、自然语言处理或相关专业; 2、熟悉自然语言处理相关基础理论,熟悉大模型的微调和评估方法;具有生成式模型训练及开发经验,如大模型数据处理、模型微调、预训练、强化学习、内容安全等,了解Megatron,deepspeed,vllm等训练或推理加速框架; 3、对大模型相关技术(Llama、GPT、ChatGLM、LangChain、AutoGen、Code Interpreter等)有Finetuning、应用或落地实践优先; 4、 有LangChain、AutoGPT或其他大模型框架开发经验者或AI低代码、智能助手、Agent相关项目经验者优先; 5、具有良好的沟通能力和跨团队协作能力,热衷于追求技术创新,对解决有挑战性的问题充满激情。
工作职责
1、负责全国TOP级别的直播电商在B端业务场景中大模型的技术落地,支持业务目标提升; 2、负责大模型在智能经营、诊断分析、多模态创意生成等内容生成类场景中的应用,降低平台和商家的运营成本,提升运营效率; 3、负责大型语言模型的微调、偏好对齐、知识增强等技术探索,积极跟进AIGC业内应用趋势,包括并不限于多模态、RLHF、Agent等方向; 4、负责低代码平台与AI大模型应用场景落地(D2C、AI生成业务流程等),采用先进的算法工程方法,打造下一代AI低代码研发体系。
包括英文材料
学历+
NLP+
https://www.youtube.com/watch?v=fNxaJsNG3-s&list=PLQY2H8rRoyvzDbLUZkbudP-MFQZwNmU4S
Welcome to Zero to Hero for Natural Language Processing using TensorFlow!
https://www.youtube.com/watch?v=R-AG4-qZs1A&list=PLeo1K3hjS3uuvuAXhYjV2lMEShq2UYSwX
Natural Language Processing tutorial for beginners series in Python.
https://www.youtube.com/watch?v=rmVRLeJRkl4&list=PLoROMvodv4rMFqRtEuo6SGjY4XbRIVRd4
The foundations of the effective modern methods for deep learning applied to NLP.
大模型+
https://www.youtube.com/watch?v=xZDB1naRUlk
You will build projects with LLMs that will enable you to create dynamic interfaces, interact with vast amounts of text data, and even empower LLMs with the capability to browse the internet for research papers.
https://www.youtube.com/watch?v=zjkBMFhNj_g
强化学习+
https://cloud.google.com/discover/what-is-reinforcement-learning?hl=en
Reinforcement learning (RL) is a type of machine learning where an "agent" learns optimal behavior through interaction with its environment.
https://huggingface.co/learn/deep-rl-course/unit0/introduction
This course will teach you about Deep Reinforcement Learning from beginner to expert. It’s completely free and open-source!
https://www.kaggle.com/learn/intro-to-game-ai-and-reinforcement-learning
Build your own video game bots, using classic and cutting-edge algorithms.
Megatron+
https://www.youtube.com/watch?v=hc0u4avAkuM
DeepSpeed+
https://www.youtube.com/watch?v=pDGI668pNg0
vLLM+
https://www.newline.co/@zaoyang/ultimate-guide-to-vllm--aad8b65d
vLLM is a framework designed to make large language models faster, more efficient, and better suited for production environments.
https://www.youtube.com/watch?v=Ju2FrqIrdx0
vLLM is a cutting-edge serving engine designed for large language models (LLMs), offering unparalleled performance and efficiency for AI-driven applications.
Llama+
https://github.com/LlamaFamily/Llama-Chinese
Llama中文社区,实时汇总最新Llama学习资料,构建最好的中文Llama大模型开源生态,完全开源可商用。
https://www.llama.com/docs/overview/
This guide provides information and resources to help you set up Llama including how to access the model, hosting, how-to and integration guides.
GPT+
https://www.youtube.com/watch?v=kCc8FmEb1nY
We build a Generatively Pretrained Transformer (GPT), following the paper "Attention is All You Need" and OpenAI's GPT-2 / GPT-3.
ChatGLM+
https://www.youtube.com/watch?v=EXUX0MjBzI0
In this step-by-step tutorial, you'll learn how to use ChatGLM, one of the most powerful and completely free AI video generators available today.
https://www.youtube.com/watch?v=fGpXj4bl5LI
Exploring the concept of a GLM (General Language Model) and working with ChatGLM6B.
LangChain+
https://python.langchain.com/docs/tutorials/
New to LangChain or LLM app development in general? Read this material to quickly get up and running building your first applications.
https://www.freecodecamp.org/news/beginners-guide-to-langchain/
LangChain is a popular framework for creating LLM-powered apps.
AutoGen+
https://microsoft.github.io/autogen/0.2/docs/Getting-Started/
AutoGen is an open-source programming framework for building AI agents and facilitating cooperation among multiple agents to solve tasks.
https://www.youtube.com/watch?v=JmjxwTEJSE8
Whether you know everything there to AI Agents or are a complete beginner, I believe there is something to learn here.
FineTuning+
[英文] Fine-Tuning
https://d2l.ai/chapter_computer-vision/fine-tuning.html
In this section, we will introduce a common technique in transfer learning: fine-tuning.
[英文] Fine-tuning
https://huggingface.co/docs/transformers/en/training
Fine-tuning adapts a pretrained model to a specific task with a smaller specialized dataset.
AutoGPT+
[英文] What is AutoGPT?
https://www.ibm.com/think/topics/autogpt
https://www.youtube.com/watch?v=v-5AWQlTFw8
Someone has created a version of ChatGPT called AutoGPT and it’s a lot more powerful.
AI agent+
https://www.ibm.com/think/ai-agents
Your one-stop resource for gaining in-depth knowledge and hands-on applications of AI agents.
相关职位
社招3-5年D7630
1、负责电商侧B&M端业务的算法相关研发和应用:比如智能客服、智慧直播、智能小二、智能取数、智能补贴、智能装修、智能组货等; 2、负责大模型训练和调优需要的数据处理、模型优化、数据评测等工作; 3、实施和优化Fine-tuning策略,以提高大模型在特定应用场景下应用效果; 4、带领部分同学基于大模型技术构算法服务,解决业务中的场景问题并提升业务价值。
更新于 2025-09-15
社招D7094
1、负责大模型在金融支付方向的应用落地,支持达成业务以及技术的指标; 2、负责大模型在智能营销、智能推荐、智能风控等业务领域的应用落地,降低平台运营成本、助力金融支付业务达成目标; 3、负责大模型在智能监控、智能巡检、智能Oncall等技术领域的应用落地,降低平台运营成本、提升金融支付系统稳定性; 4、负责大模型在工程领域的应用落地范式的探索,积极探索微调、检索增强、提示词工程等技术,跟进业内大模型应用趋势。
更新于 2025-07-01
社招D7094
1、负责大模型在金融支付方向的应用落地,支持达成业务以及技术的指标; 2、负责大模型在智能营销、智能推荐、智能风控等业务领域的应用落地,降低平台运营成本、助力金融支付业务达成目标; 3、负责大模型在智能监控、智能巡检、智能Oncall等技术领域的应用落地,降低平台运营成本、提升金融支付系统稳定性; 4、负责大模型在工程领域的应用落地范式的探索,积极探索微调、检索增强、提示词工程等技术,跟进业内大模型应用趋势。
更新于 2025-07-23