logo of bytedance

字节跳动Large Model Application Algorithm Research Engineer|大模型应用算法研究工程师-国际化内容安全算法研究-筋斗云人才计划

校招全职A238623地点:新加坡状态:招聘

任职要求


1. Got doctor degree in Computer Science, Electronics, or other related fields.
2. Extensive experience in ML/CV/NLP/Recommendation Systems, including but not limited to:
a. Participation in competitions or industry projects in ML, Data Mining, CV, NLP, or Multimodal.
b. Publications in conferences in ML, data mining, AI, or large models (e.g., KDD, WWW, NIPS, ICML, CVPR, ACL, AAAI etc).
c. Plus points:
1) Research experience or innovation in large models or RL.
2) Strong hands-on skills with contributions to large model projects in the open-source community.
3) Practical experience in deploying large models in real-world business scenarios.
3. Strong programming skills and proficient in Python/C++ or other relevant programming languages.
4. Outstanding problem-solving and analytical skills, with a passion for tackling challenging problems.
5. Strong enthusiasm for technology, with excellent communication skills and collaborative mindset.

1、获得博士学位,计算机、电子等相关背景;
2、丰富的ML/CV/NLP/推荐经验,包括但不限于:
 a. 机器学习/数据挖掘/CV/NLP/多模态等相关竞赛或行业经验;
 b. 机器学习/数据挖掘/人工智能/大模型相关领域的会议论文(KDD/WWW/NIPS/ICML/CVPR/ACL/AAAI...)。
 c. 以下为加分项:
1)有大模型/强化学习相关的研究经历和创新探索;
2)动手快,在开源社区有大模型相关的贡献记录;
3)在业务场景有大模型落地经验。
3、扎实的编程功底,熟悉python/C++等编程语言;
4、优秀的分析问题和解决问题的能力,并热衷于解决具有挑战性的问题;
5、对技术充满热情,良好的沟通能力和团队合作精神。

工作职责


Team Introduction:
TikTok Content Security Algorithm Research Team
The International Content Safety Algorithm Research Team is dedicated to maintaining a safe and trustworthy environment for users of ByteDance's international products. We develop and iterate on machine learning models and information systems to identify risks earlier, respond to incidents faster, and monitor potential threats more effectively. The team also leads the development of foundational large models for products. In the R&D process, we tackle key challenges such as data compliance, model reasoning capability, and multilingual performance optimization. Our goal is to build secure, compliant, and high-performance models that empower various business scenarios across the platform, including content moderation, search, and recommendation.

Research Project Background:
In recent years, Large Language Models (LLMs) have achieved remarkable progress across various domains of natural language processing (NLP) and artificial intelligence. These models have demonstrated impressive capabilities in tasks such as language generation, question answering, and text translation. However, reasoning remains a key area for further improvement. Current approaches to enhancing reasoning abilities often rely on large amounts of Supervised Fine-Tuning (SFT) data. However, acquiring such high-quality SFT data is expensive and poses a significant barrier to scalable model development and deployment.

To address this, OpenAI's o1 series of models have made progress by increasing the length of the Chain-of-Thought (CoT) reasoning process. While this technique has proven effective, how to efficiently scale this approach in practical testing remains an open question. Recent research has explored alternative methods such as Process-based Reward Model (PRM), Reinforcement Learning (RL), and Monte Carlo Tree Search (MCTS) to improve reasoning. However, these approaches still fall short of the general reasoning performance achieved by OpenAI's o1 series of models. Notably, the recent DeepSeek R1 paper suggests that pure RL methods can enable LLM to autonomously develop reasoning skills without relying on the expensive SFT data, revealing the substantial potential of RL in advancing LLM capabilities.

团队介绍:
国际化内容安全算法研究团队致力于为字节跳动国际化产品的用户维护安全可信赖环境,通过开发、迭代机器学习模型和信息系统以更早、更快发掘风险、监控风险、响应紧急事件,团队同时负责产品基座大模型的研发,我们在研发过程中需要解决数据合规、模型推理能力、多语种性能优化等方面的问题,从而为平台上的内容审核、搜索、推荐等多项业务提供安全合规,性能优越的基座模型。

课题介绍:
课题背景:
近年来,大规模语言模型(Large Language Models, LLM)在自然语言处理和人工智能的各个领域都取得了显著的进展。这些模型展示了强大的能力,例如在生成语言、回答问题、翻译文本等任务上表现优异。然而,LLM 的推理能力仍有很大的提升空间。在现有的研究中,通常依赖于大量的监督微调(Supervised Fine-Tuning, SFT)数据来增强模型的推理性能。然而,高质量 SFT 数据的获取成本高昂,这对模型的开发和应用带来了极大的限制。
为了提升推理能力,OpenAI 的 o1 系列模型通过增加思维链(Chain-of-Thought, CoT)的推理过程长度取得了一定的成功。这种方法虽然有效,但在实际测试时如何高效地进行扩展仍是一个开放的问题。一些研究尝试使用基于过程的奖励模型(Process-based Reward Model, PRM)、强化学习(Reinforcement Learning, RL)以及蒙特卡洛树搜索算法(Monte Carlo Tree Search, MCTS)等方法来解决推理问题,然而这些方法尚未能达到 OpenAI o1 系列模型的通用推理性能水平。最近deepseek r1在论文中提到通过纯强化学习的方法,可以使得 LLM 自主发展推理能力,而无需依赖昂贵的 SFT 数据。这一系列的工作都揭示着强化学习对LLM的巨大潜力。
课题挑战:
1、Reward模型的设计:在强化学习过程中,设计一个合适的reward模型是关键。Reward模型需要准确地反映推理过程的效果,并引导模型逐步提升其推理能力。这不仅要求对不同任务精准设定评估标准,还要确保reward模型能够在训练过程中动态调整,以适应模型性能的变化和提高。
2、稳定的训练过程:在缺乏高质量SFT数据的情况下,如何确保强化学习过程中的稳定训练是一个重大挑战。强化学习过程通常涉及大量的探索和试错,这可能导致训练不稳定甚至模型性能下降。需要开发具有鲁棒性的训练方法,以保证模型在训练过程中的稳定性和效果。
3、如何从数学和代码任务上拓展到自然语言任务上:现有的推理强化方法主要应用在数学和代码这些CoT数据量相对丰富的任务上。然而,自然语言任务的开放性和复杂性更高,如何将成功的RL策略从这些相对简单的任务拓展到自然语言处理任务上,要求对数据处理和RL方法进行深入的研究和创新,以实现跨任务的通用推理能力。
4、推理效率的提升:在保证推理性能的前提下,提升推理效率也是一个重要挑战。推理过程的效率直接影响到模型在实际应用中的可用性和经济性。可以考虑利用知识蒸馏技术,将复杂模型的知识传递给较小的模型,以减少计算资源消耗。另外,使用长思维链(Long Chain-of-Thought, Long-CoT)技术来改进短思维链(Short-CoT)模型,也是一种潜在的方法,以在保证推理质量的同时提升推理速度。
包括英文材料
NLP+
ICML+
CVPR+
Python+
C+++
学历+
机器学习+
数据挖掘+
大模型+
C+
强化学习+
相关职位

logo of bytedance
校招A158012A

Team Introduction: Data AML is ByteDance's machine learning middle platform, providing training and inference systems for recommendation, advertising, CV (computer vision), speech, and NLP (natural language processing) across businesses such as Douyin, Toutiao, and Xigua Video. AML provides powerful machine learning computing capabilities to internal business units and conducts research on general and innovative algorithms to solve key business challenges. Additionally, through Volcano Engine, it delivers core machine learning and recommendation system capabilities to external enterprise clients. Beyond business applications, AML is also engaged in cutting-edge research in areas such as AI for Science and scientific computing. Research Project Introduction: Large-scale recommendation systems are being increasingly applied to short video, text community, image and other products, and the role of modal information in recommendation systems has become more prominent. ByteDance's practice has found that modal information can serve as a generalization feature to support business scenarios such as recommendation, and the research on end-to-end ultra-large-scale multimodal recommendation systems has enormous potential. It is expected to further explore directions such as multimodal cotraining, 7B/13B large-scale parameter models, and longer sequence end-to-end based on algorithm-engineering CoDesign. Engineering research directions include: Representation of multimodal samples Construction of high-performance multimodal inference engines based on the PyTorch framework Development of high-performance multimodal training frameworks Application of heterogeneous hardware in multimodal recommendation systems 1. Algorithmic research directions include: 2. Design of reasonable recommendation-advertising and multimodal cotraining architectures 3. Sparse Mixture of Experts (Sparse MOE) 4. Memory Network 5. Hybrid precision techniques 团队介绍: Data AML是字节跳动公司的机器学习中台,为抖音/今日头条/西瓜视频等业务提供推荐/广告/CV/语音/NLP的训练和推理系统。为公司内业务部门提供强大的机器学习算力,并在这些业务的问题上研究一些具有通用性和创新性的算法。同时,也通过火山引擎将一些机器学习/推荐系统的核心能力提供给外部企业客户。此外,AML还在AI for Science,科学计算等领域做一些前沿研究。 课题介绍: 大规模推荐系统正在越来越多的应用到短视频、文本社区、图像等产品上,模态信息在推荐系统中的作用也越来越大。 字节实践中发现模态信息能够很好的作为泛化特征支持推荐等业务场景,端到端的超大规模多模态推荐系统的研究具有非常大的想象空间。 期望在算法和工程CoDesign基础上,对多模态Cotrain、7B/13B大规模参数模型、更长序列端到端等方向进一步进行探索。 工程上研究方向包括多模态样本的表征、基于 pytorch 框架的高性能多模态推理引擎、高性能多模态训练框架的构建、异构硬件在多模态推荐系统上的应用;算法上的研究方向包括设计合理的推荐广告和多模态Cotrain结构、Sparse MOE、Memory Network、混合精度等。 1、负责机器学习系统架构的设计开发,以及系统性能调优; 2、负责解决系统高并发、高可靠性、高可扩展性等技术难关; 3、覆盖机器学习系统多个子方向领域的工作,包括:资源调度、任务编排、模型训练、模型推理、模型管理、数据集管理、工作流编排、ML for System等; 4、负责机器学习系统前瞻技术的调研和引入,比如:最新硬件架构、异构计算系统、GPU优化技术的引入落地; 5、研究基于机器学习方法,实现对集群/服务资源使用情况的分析和优化。

更新于 2025-05-26
logo of tesla
社招IT-应用软件

特斯拉数据算法团队在工业智能研发中扮演关键角色。我们通过自主构建数据算法平台,赋能生产制造、供应链、销售、服务和充电网络等业务领域,将海量信息转化为高价值数据资产,从而打造更卓越的产品并提升用户体验。 作为特斯拉数据算法工程师,您将全程参与自研数据算法产品和项目的孵化、落地与迭代过程。从数据收集、清洗和预处理,到模型训练与生产部署,您将主导整个流程。理想候选人应热爱人工智能,并紧跟领域前沿动态。 本职位聚焦工业领域的计算机视觉应用,包括缺陷检测、视觉引导、尺寸测量以及视觉大模型等。 职责描述 负责对接公司内部计算机视觉项目,独立设计视觉方案、部署落地,并管理项目全生命周期。 处理计算机视觉项目的图像收集、整理、过滤和清洗;执行数据预处理、模型训练、迭代、重训练,以及准确率优化和模型搜索等任务,涵盖分类、识别和图像分割等领域。 探索多模态大模型在工业场景的应用,研究少样本检测、视频理解等方向的创新解决方案。 追踪计算机视觉技术前沿趋势,提出创新方案应对工业生产挑战。 必备条件 计算机科学、数学、统计学或相关学科的本科及以上学历。 扎实的Python和C++开发经验。 精通OpenCV等图像处理算法。 具备TensorFlow或PyTorch模型开发经验。 掌握数据科学工具,如Pandas、NumPy、Matplotlib,以及MongoDB Aggregation等。 有多模态大模型相关项目经验,并在至少一个领域(如多模态大模型、多模态表征或少样本学习)有深入研究。 优先条件 有将计算机视觉技术应用于工业制造或相关领域的实际项目经验。 熟悉机器人/PLC控制、工业相机/激光传感器/光源解决方案。 有在敏捷开发环境中的工作经验。 具备优秀的书面和口头沟通能力。 有项目管理经验,能按时节点完成开发任务。 拥有算法开发背景,例如参与过ACM竞赛。 在相关领域的学术期刊或会议上发表过论文。 加入我们 加入特斯拉,您将在充满活力和创新的环境中,与全球顶尖工程师和科学家合作,通过机器视觉技术推动工业自动化和智能制造的进步。如果您对机器学习、人工智能和计算机视觉充满热情,并渴望在这一前沿领域实现自我价值,欢迎成为我们的一员! The Role Tesla's Data Algorithms Team plays a pivotal role in industrial intelligence research and development. We empower various business areas—including manufacturing, supply chain, sales, service, and charging networks—by building our own data algorithms platform. This transforms vast amounts of information into high-value data assets, enabling us to create superior products and deliver an enhanced user experience. As a Tesla Data Algorithms Engineer, you will be fully involved in the incubation, implementation, and iteration of our in-house data algorithms products and projects. From data collection, cleaning, and preprocessing to model training and production deployment, you will lead the entire process. The ideal candidate is passionate about artificial intelligence and stays abreast of the latest developments in the field. This position focuses on computer vision applications in the industrial sector, including defect detection, visual guidance, dimension measurement, and large vision models. Responsibilities Handle internal computer vision projects, independently design visual solutions, deploy them, and manage the full project lifecycle. Manage image collection, organization, filtering, and cleaning for computer vision projects; perform data preprocessing, model training, iteration, retraining, accuracy optimization, and model search tasks, covering areas such as classification, recognition, and image segmentation. Explore the application of multimodal large models in industrial scenarios, researching innovative solutions in directions like few-shot detection and video understanding. Track cutting-edge trends in computer vision technology and propose innovative solutions to address challenges in industrial production. Required

logo of tesla
社招IT-应用软件

特斯拉数据算法团队在工业智能研发中扮演关键角色。我们通过自主构建数据算法平台,赋能生产制造、供应链、销售、服务和充电网络等业务领域,将海量信息转化为高价值数据资产,从而打造更卓越的产品并提升用户体验。 作为特斯拉数据算法工程师,您将全程参与自研数据算法产品和项目的孵化、落地与迭代过程。从数据收集、清洗和预处理,到模型训练与生产部署,您将主导整个流程。理想候选人应热爱人工智能,并紧跟领域前沿动态。 本职位聚焦工业领域的计算机视觉应用,包括缺陷检测、视觉引导、尺寸测量以及视觉大模型等。 职责描述 负责对接公司内部计算机视觉项目,独立设计视觉方案、部署落地,并管理项目全生命周期。 处理计算机视觉项目的图像收集、整理、过滤和清洗;执行数据预处理、模型训练、迭代、重训练,以及准确率优化和模型搜索等任务,涵盖分类、识别和图像分割等领域。 探索多模态大模型在工业场景的应用,研究少样本检测、视频理解等方向的创新解决方案。 追踪计算机视觉技术前沿趋势,提出创新方案应对工业生产挑战。 必备条件 计算机科学、数学、统计学或相关学科的本科及以上学历。 扎实的Python和C++开发经验。 精通OpenCV等图像处理算法。 具备TensorFlow或PyTorch模型开发经验。 掌握数据科学工具,如Pandas、NumPy、Matplotlib,以及MongoDB Aggregation等。 有多模态大模型相关项目经验,并在至少一个领域(如多模态大模型、多模态表征或少样本学习)有深入研究。 优先条件 有将计算机视觉技术应用于工业制造或相关领域的实际项目经验。 熟悉机器人/PLC控制、工业相机/激光传感器/光源解决方案。 有在敏捷开发环境中的工作经验。 具备优秀的书面和口头沟通能力。 有项目管理经验,能按时节点完成开发任务。 拥有算法开发背景,例如参与过ACM竞赛。 在相关领域的学术期刊或会议上发表过论文。 加入我们 加入特斯拉,您将在充满活力和创新的环境中,与全球顶尖工程师和科学家合作,通过机器视觉技术推动工业自动化和智能制造的进步。如果您对机器学习、人工智能和计算机视觉充满热情,并渴望在这一前沿领域实现自我价值,欢迎成为我们的一员! The Role Tesla's Data Algorithms Team plays a pivotal role in industrial intelligence research and development. We empower various business areas—including manufacturing, supply chain, sales, service, and charging networks—by building our own data algorithms platform. This transforms vast amounts of information into high-value data assets, enabling us to create superior products and deliver an enhanced user experience. As a Tesla Data Algorithms Engineer, you will be fully involved in the incubation, implementation, and iteration of our in-house data algorithms products and projects. From data collection, cleaning, and preprocessing to model training and production deployment, you will lead the entire process. The ideal candidate is passionate about artificial intelligence and stays abreast of the latest developments in the field. This position focuses on computer vision applications in the industrial sector, including defect detection, visual guidance, dimension measurement, and large vision models. Responsibilities Handle internal computer vision projects, independently design visual solutions, deploy them, and manage the full project lifecycle. Manage image collection, organization, filtering, and cleaning for computer vision projects; perform data preprocessing, model training, iteration, retraining, accuracy optimization, and model search tasks, covering areas such as classification, recognition, and image segmentation. Explore the application of multimodal large models in industrial scenarios, researching innovative solutions in directions like few-shot detection and video understanding. Track cutting-edge trends in computer vision technology and propose innovative solutions to address challenges in industrial production. Required

logo of sensetime
社招算法工程

•Participate in the development of knowledge-based Q&A or conversational AI products, enhancing the capabilities of large language models in areas such as RAG (Retrieval-Augmented Generation) and Agents, including data systems, algorithm optimization, prompt engineering, and evaluation iteration. •Build a systematic and specialized knowledge base, optimize the full-link retrieval technology, and continuously improve retrieval precision and recall. •Improve the performance of large language models in Q&A and conversational scenarios through techniques such as RAG, SFT (Supervised Fine-Tuning), and RLHF (Reinforcement Learning from Human Feedback).

更新于 2025-07-23