字节跳动大模型应用算法工程师-飞书People
社招全职A191816A地点:北京状态:招聘
任职要求
1、熟悉NLP、LLM相关的算法和技术,熟悉或应用过多模态大模型,有大模型训练和评测数据集建设、指令微调、偏好对齐、效果评测经验者优先; 2、熟悉NER、文本分类、信息抽取等NLP技术,掌握正则表达式处理技巧,了解LayoutLM、PaddleOCR 、Qwen-vl等文档解析技术者优先; 3、优秀的代码能力、数据结构和基础算法功底,熟练Python或C/C++,ACM/ICPC、Top Coder、Kaggle等比赛获奖者优先; 4、出色的问题分析和解决能力,能深入解决大模型训练和应用存在的问题; 5、良好的沟通协作能力,能和团队一起探索新技术,推进技术进步。
工作职责
1、简历智能解析提取结构化信息、简历筛选和岗位匹配,个性化定制面试题,AI分维度面试评价和整体总结,绩效评估多人反馈总结,对话式数据统计分析; 2、建设和调优满足场景应用的意图识别、实体识别、问题拆解策划、工具调用、相关性排序、理解生成模型能力; 3、People领域高质量语料构建,针对具体应用场景的指令集构建、评测体系和指标设计及评测数据构建; 4、大模型Post-training、Fine-tuning,强化学习偏好对齐,Prompt设计和优化; 5、调研和尝试AI行业的前沿技术,推动技术在实际应用场景落地和效果优化。
包括英文材料
NLP+
https://www.youtube.com/watch?v=fNxaJsNG3-s&list=PLQY2H8rRoyvzDbLUZkbudP-MFQZwNmU4S
Welcome to Zero to Hero for Natural Language Processing using TensorFlow!
https://www.youtube.com/watch?v=R-AG4-qZs1A&list=PLeo1K3hjS3uuvuAXhYjV2lMEShq2UYSwX
Natural Language Processing tutorial for beginners series in Python.
https://www.youtube.com/watch?v=rmVRLeJRkl4&list=PLoROMvodv4rMFqRtEuo6SGjY4XbRIVRd4
The foundations of the effective modern methods for deep learning applied to NLP.
大模型+
https://www.youtube.com/watch?v=xZDB1naRUlk
You will build projects with LLMs that will enable you to create dynamic interfaces, interact with vast amounts of text data, and even empower LLMs with the capability to browse the internet for research papers.
https://www.youtube.com/watch?v=zjkBMFhNj_g
算法+
https://roadmap.sh/datastructures-and-algorithms
Step by step guide to learn Data Structures and Algorithms in 2025
https://www.hellointerview.com/learn/code
A visual guide to the most important patterns and approaches for the coding interview.
https://www.w3schools.com/dsa/
数据结构+
https://www.youtube.com/watch?v=8hly31xKli0
In this course you will learn about algorithms and data structures, two of the fundamental topics in computer science.
https://www.youtube.com/watch?v=B31LgI4Y4DQ
Learn about data structures in this comprehensive course. We will be implementing these data structures in C or C++.
https://www.youtube.com/watch?v=CBYHwZcbD-s
Data Structures and Algorithms full course tutorial java
Python+
https://liaoxuefeng.com/books/python/introduction/index.html
中文,免费,零起点,完整示例,基于最新的Python 3版本。
https://www.learnpython.org/
a free interactive Python tutorial for people who want to learn Python, fast.
https://www.youtube.com/watch?v=K5KVEU3aaeQ
Master Python from scratch 🚀 No fluff—just clear, practical coding skills to kickstart your journey!
https://www.youtube.com/watch?v=rfscVS0vtbw
This course will give you a full introduction into all of the core concepts in python.
C+
https://www.freecodecamp.org/chinese/news/the-c-beginners-handbook/
本手册遵循二八定律。你将在 20% 的时间内学习 80% 的 C 编程语言。
https://www.youtube.com/watch?v=87SH2Cn0s9A
https://www.youtube.com/watch?v=KJgsSFOSQv0
This course will give you a full introduction into all of the core concepts in the C programming language.
https://www.youtube.com/watch?v=PaPN51Mm5qQ
In this complete C programming course, Dr. Charles Severance (aka Dr. Chuck) will help you understand computer architecture and low-level programming with the help of the classic C Programming language book written by Brian Kernighan and Dennis Ritchie.
C+++
https://www.learncpp.com/
LearnCpp.com is a free website devoted to teaching you how to program in modern C++.
https://www.youtube.com/watch?v=ZzaPdXTrSb8
Kaggle+
[英文] Kaggle Learn
https://www.kaggle.com/learn
Gain the skills you need to do independent data science projects.
相关职位
社招A201481A
1、业务应用:负责将自研的算法模型应用于企业协同软件中,在会议、文档、消息、办公智能体等诸多办公场景中打造最好的AI工具和产品,不断改善用户体验; 2、模型优化:负责训练大语言模型不断提高其在办公领域的算法质量;建设高效的评测方法和技术体系;采集、调研并生产办公领域的高质量数据集; 3、技术建设:持续关注业界最新的技术趋势和研究成果,分享行业最佳实践,将前沿技术应用于大模型中。
更新于 2024-10-22
社招A96700
1、业务应用:负责将自研的算法模型应用于企业协同软件中,在会议、文档、消息、办公智能体等诸多办公场景中打造最好的AI工具和产品,不断改善用户体验; 2、模型优化:负责训练大语言模型不断提高其在办公领域的算法质量;建设高效的评测方法和技术体系;采集、调研并生产办公领域的高质量数据集; 3、技术建设:持续关注业界最新的技术趋势和研究成果,分享行业最佳实践,将前沿技术应用于大模型中。
更新于 2024-11-20
社招A60155
1、业务应用:负责将自研的算法模型应用于企业协同软件中,在会议、文档、消息、办公智能体等诸多办公场景中打造最好的AI工具和产品,不断改善用户体验; 2、模型优化:负责训练大语言模型不断提高其在办公领域的算法质量;建设高效的评测方法和技术体系;采集、调研并生产办公领域的高质量数据集; 3、技术建设:持续关注业界最新的技术趋势和研究成果,分享行业最佳实践,将前沿技术应用于大模型中。
更新于 2024-10-22