百度大模型(LLM和AIGC)策略算法工程师(J73262)
社招全职MEG地点:北京状态:招聘
任职要求
-熟悉c++/python等语言编程,熟练使用hadoop/spark/hive等分布式工具,熟练掌握脚本编程(shell/python/perl等) -具有一定的理论背景和实践经验,包括机器学习/深度学习/强化学习/自然语言处理/推荐系统/信息检索等 -对数据结构和算法有较深的理解,动手实战能力强,有较强的上进心、求知欲,学习能力强 -较好的逻辑思维能力,优秀的分析和解决问题能力 -良好的沟通表达能力,具备较好的团队协作精神
工作职责
-参与LLM和AIGC核心算法工作,基于海量的内容及前沿的大模型,支持文库的算法工作,提高文库创新业务效果 -负责内容生成及AI编辑(prompt优化及个性化、P-tuning、大模型finetune等)、内容理解(质量分级、内容结构化、智能标签/摘要等)、场景应用(需求理解、用户刻画、个性化推荐)等 -负责业界领先AI技术的调研和评估,对产品目标建模,持续优化模型效果,并提供核心技术支持 -关注用户体验提升,通过对数据的洞察,深入挖掘产品潜在价值和需求,通过技术创新推动产品成长
包括英文材料
C+++
https://www.learncpp.com/
LearnCpp.com is a free website devoted to teaching you how to program in modern C++.
https://www.youtube.com/watch?v=ZzaPdXTrSb8
Python+
https://liaoxuefeng.com/books/python/introduction/index.html
中文,免费,零起点,完整示例,基于最新的Python 3版本。
https://www.learnpython.org/
a free interactive Python tutorial for people who want to learn Python, fast.
https://www.youtube.com/watch?v=K5KVEU3aaeQ
Master Python from scratch 🚀 No fluff—just clear, practical coding skills to kickstart your journey!
https://www.youtube.com/watch?v=rfscVS0vtbw
This course will give you a full introduction into all of the core concepts in python.
Hadoop+
https://www.runoob.com/w3cnote/hadoop-tutorial.html
Hadoop 为庞大的计算机集群提供可靠的、可伸缩的应用层计算和存储支持,它允许使用简单的编程模型跨计算机群集分布式处理大型数据集,并且支持在单台计算机到几千台计算机之间进行扩展。
[英文] Hadoop Tutorial
https://www.tutorialspoint.com/hadoop/index.htm
Hadoop is an open-source framework that allows to store and process big data in a distributed environment across clusters of computers using simple programming models.
Spark+
[英文] Learning Spark Book
https://pages.databricks.com/rs/094-YMS-629/images/LearningSpark2.0.pdf
This new edition has been updated to reflect Apache Spark’s evolution through Spark 2.x and Spark 3.0, including its expanded ecosystem of built-in and external data sources, machine learning, and streaming technologies with which Spark is tightly integrated.
Hive+
[英文] Hive Tutorial
https://www.tutorialspoint.com/hive/index.htm
Hive is a data warehouse infrastructure tool to process structured data in Hadoop. It resides on top of Hadoop to summarize Big Data, and makes querying and analyzing easy.
https://www.youtube.com/watch?v=D4HqQ8-Ja9Y
脚本+
[英文] Scripting language
https://en.wikipedia.org/wiki/Scripting_language
https://zhuanlan.zhihu.com/p/571097954
一个脚本通常是解释执行而非编译。脚本语言通常都有简单、易学、易用的特性,目的就是希望能让程序员快速完成程序的编写工作。
Bash+
[英文] The Bash Guide
https://guide.bash.academy/
A quality-driven guide through the shell's many features.
https://www.youtube.com/watch?v=tK9Oc6AEnR4
Understanding how to use bash scripting will enhance your productivity by automating tasks, streamlining processes, and making your workflow more efficient.
Perl+
https://www.perl.org/learn.html
Useful links if you are interested in learning Perl
https://www.runoob.com/perl/perl-tutorial.html
本教程适合想从零开始学习 Perl 编程语言的开发人员。当然本教程也会对一些模块进行深入,让你更好的了解 Perl 的应用。
机器学习+
https://www.youtube.com/watch?v=0oyDqO8PjIg
Learn about machine learning and AI with this comprehensive 11-hour course from @LunarTech_ai.
https://www.youtube.com/watch?v=i_LwzRVP7bg
Learn Machine Learning in a way that is accessible to absolute beginners.
https://www.youtube.com/watch?v=NWONeJKn6kc
Learn the theory and practical application of machine learning concepts in this comprehensive course for beginners.
https://www.youtube.com/watch?v=PcbuKRNtCUc
Learn about all the most important concepts and terms related to machine learning and AI.
深度学习+
https://d2l.ai/
Interactive deep learning book with code, math, and discussions.
强化学习+
https://cloud.google.com/discover/what-is-reinforcement-learning?hl=en
Reinforcement learning (RL) is a type of machine learning where an "agent" learns optimal behavior through interaction with its environment.
https://huggingface.co/learn/deep-rl-course/unit0/introduction
This course will teach you about Deep Reinforcement Learning from beginner to expert. It’s completely free and open-source!
https://www.kaggle.com/learn/intro-to-game-ai-and-reinforcement-learning
Build your own video game bots, using classic and cutting-edge algorithms.
NLP+
https://www.youtube.com/watch?v=fNxaJsNG3-s&list=PLQY2H8rRoyvzDbLUZkbudP-MFQZwNmU4S
Welcome to Zero to Hero for Natural Language Processing using TensorFlow!
https://www.youtube.com/watch?v=R-AG4-qZs1A&list=PLeo1K3hjS3uuvuAXhYjV2lMEShq2UYSwX
Natural Language Processing tutorial for beginners series in Python.
https://www.youtube.com/watch?v=rmVRLeJRkl4&list=PLoROMvodv4rMFqRtEuo6SGjY4XbRIVRd4
The foundations of the effective modern methods for deep learning applied to NLP.
推荐系统+
[英文] Recommender Systems
https://www.d2l.ai/chapter_recommender-systems/index.html
Recommender systems are widely employed in industry and are ubiquitous in our daily lives.
信息检索+
https://nlp.stanford.edu/IR-book/information-retrieval-book.html
Christopher D. Manning, Prabhakar Raghavan and Hinrich Schütze, Introduction to Information Retrieval, Cambridge University Press. 2008.
数据结构+
https://www.youtube.com/watch?v=8hly31xKli0
In this course you will learn about algorithms and data structures, two of the fundamental topics in computer science.
https://www.youtube.com/watch?v=B31LgI4Y4DQ
Learn about data structures in this comprehensive course. We will be implementing these data structures in C or C++.
https://www.youtube.com/watch?v=CBYHwZcbD-s
Data Structures and Algorithms full course tutorial java
算法+
https://roadmap.sh/datastructures-and-algorithms
Step by step guide to learn Data Structures and Algorithms in 2025
https://www.hellointerview.com/learn/code
A visual guide to the most important patterns and approaches for the coding interview.
https://www.w3schools.com/dsa/
相关职位
社招MEG
-参与LLM和AIGC核心算法工作,基于海量的内容(文本、图像、视频)及前沿的大模型,支持文教互娱的算法工作,提高业务效果 -利用LLM前沿技术方向,负责AI内容生成和编辑(prompt设计、大模型SFT和预训练,大模型强化学习等)、内容理解和质量识别(质量分级、内容结构化、智能标签/摘要,优质文案等)、场景应用(需求理解、用户刻画、个性化推荐)等 -熟悉AIGC前沿技术,例如:CLIP,Stable Diffusion,ControlNet,Imagen,Dreambooth等。结合大模型(文心一言)和AIGC技术,支撑PPT生成、个人简历、对话系统等多模态场景 -关注用户体验提升,通过对数据的洞察,深入挖掘产品潜在价值和需求,通过技术创新推动产品成长
更新于 2025-02-05
社招MEG
-负责大模型应用层算法研发与调优,负责对话系统、内容生成、意图理解等核心模块的算法优化,基于LLM深入理解用户所需,提升模型在复杂场景下的推理能力与用户体验 -构建用户-内容动态匹配算法,开发结合大模型能力的个性化推荐系统,研发文本/语音/视觉多模态融合算法,探索新型人机交互范式在移动端的最佳实践,带动产品规模高速增长
更新于 2025-03-04
社招3年以上国际业务AI &
1、参与携程国际化业务用户增长相关的算法研究和策略研发; 2、和产品、运营和工程等团队深度合作,洞察算法策略的机会点并落地实施,提升新用户承接、老用户提频和流失用户召回等场景的业务效果; 3、针对携程国际化业务,探索和迭代个性化推荐技术,在EDM营销和App个性化推送等场景落地应用; 4、基于海量用户行为和商品数据,使用数据挖掘等技术,建立并持续迭代用户画像和商品理解等技术系统; 5、利用多模态理解和AIGC能力,实现营销素材的自动化生成,提升总部和各个国家/地区当地运营团队的工作效率。
更新于 2025-03-04