英伟达Deep Learning Solution Architect

社招全职2026-04-07地点：北京状态：招聘

扫码手机上打开

任职要求

• MS or PhD in Computer Science, Artificial Intelligence, Mathematics, or related fields, with solid foundations in algorithms and programming.
• 5+ years of experience (including research) in Reinforcement Learning, Large Language Model training, or multimodal learning.
• Proficient in PyTorch and familiar with RL training frameworks and workflows.
• Strong engineering skills with experience in distributed training, task orchestration, or evaluation pipelines.
• Ability to work independently with minimal day-to-day direction, and willingness to conduct exploratory experiments on frontier problems.
• Desire to be involved in multiple diverse and innovative projects.
• Outstanding verbal and written communication skills.

Ways to stand out from the crowd:
• Experience with RLH…

登录查看完整任职要求

微信扫码，1秒登录

工作职责

NVIDIA is leading company of AI computing. At NVIDIA, our employees are passionate about AI, HPC , VISUAL, GAMING. Our SA team is more focusing to bring NVIDIA new technology into difference industries. We help to design the architecture of AI computing platform, analysis the AI and HPC applications to deliver our value to customers.  You will work closely with industry sales, developer relationship managers and product teams in the hiring position. 
What you’ll be doing:
• Drive research, development, and optimization of Reinforcement Learning algorithms and infrastructure for Large Language Models and multimodal models.
• Collaborate with internal research and engineering teams to adapt and validate state-of-the-art RL methods on NVIDIA GPU platforms at scale.
• Improve Reinforcement Learning initiatives and engagements with customers, providing technical guidance on integrating NVIDIA RL technologies into their AI workflows.
• Develop and maintain reusable toolchains, experiment management workflows, and technical documentation to accelerate both internal and customer-facing projects.

📮 投递简历 ✨AI模拟面试

难度：

包括英文材料

PyTorch+

RLHF+

还有更多 •••

登录查看完整学习资料

相关职位

Recommendation Large Model Algorithm Engineer | 推荐大模型算法研究工程师-TikTok 算法-筋斗云人才计划

校招A177421

更新于 2025-05-26新加坡

Machine Learning Algorithm Research Engineer | 机器学习算法研究员-TikTok直播-筋斗云人才计划

校招A234692

Team Introduction: Research & Development (R&D) Team: The R&D team is dedicated to building and maintaining industry-leading products that drive the success of global business. By joining us, you'll work on core scenarios such as user growth, social features, live streaming, e-commerce consumer side, content creation, and content consumption, helping our products scale rapidly across global markets. You'll also face deep technical challenges in areas like service architecture and infrastructure engineering, ensuring our systems operate with high quality, efficiency, and security. Meanwhile, our team also provides comprehensive technical solutions across diverse business needs, continuously optimizing product metrics and improving user experience. Research Project Introduction: As the world's leading short-video platform, we faces multiple challenges in its recommendation systems, including data sparsity for new users leading to insufficient personalisation, high timeliness requirements for live steaming recommendations, difficulty in maintaining user interest diversity, and complex e-commerce recommendation system chains. Traditional recommendation methods heavily rely on historical behaviour modeling, which struggles with the cold-start problem for new users. Live-streaming recommendations demand real-time responsiveness to rapidly changing content dynamics (e.g., host interactions, traffic fluctuations) within extremely short time windows (typically within 30 minutes) posing higher demands on the system's real-time perception and decision-making capabilities. Additionally, the immersive single-feed format amplifies the challenge of maintaining content diversity, requiring a careful balance between multi-interest learning and the risk of content drift caused by exploratory recommendations. The current e-commerce recommendation system follows a multi-stage funnel architecture (recall–ranking–re-ranking), which often leads to inconsistent chains, high maintenance costs, and an overreliance on short-term value prediction. This leads users to fall into content homogenization fatigue. To address these pain points, this project proposes leveraging large language models (LLMs) and large model technologies to achieve significant breakthroughs. On one hand, LLMs—with their vast knowledge base and few-shot reasoning capabilities—can infer new users' potential intentions from registration data and external knowledge, thereby alleviating cold-start issues. On the other hand, by integrating graph neural networks (GNNs) and full-lifecycle user behavior sequences for modeling social preferences, we aim to improve the accuracy of interest prediction. Additionally, the project explores the generalization capabilities, long-context awareness, and end-to-end modeling strengths of large models to simplify the e-commerce recommendation chains, enhance adaptability to real-time changes, and improve exploratory recommendation effectiveness. The ultimate goal is to build a more streamlined system with more accurate recommendations, enhancing user experience and retention while driving sustainable business growth. 团队介绍： TikTok是一个覆盖150个国家和地区的国际短视频平台，我们希望通过TikTok发现真实、有趣的瞬间，让生活更美好。TikTok 在全球各地设有办公室，全球总部位于洛杉矶和新加坡，办公地点还包括纽约、伦敦、都柏林、巴黎、柏林、迪拜、雅加达、首尔和东京等多个城市。 TikTok研发团队，旨在实现TikTok业务的研发工作，搭建及维护业界领先的产品。加入我们，你能接触到包括用户增长、社交、直播、电商C端、内容创造、内容消费等核心业务场景，支持产品在全球赛道上高速发展；也能接触到包括服务架构、基础技术等方向上的技术挑战，保障业务持续高质量、高效率、且安全地为用户服务；同时还能为不同业务场景提供全面的技术解决方案，优化各项产品指标及用户体验。在这里，有大牛带队与大家一同不断探索前沿，突破想象空间。在这里，你的每一行代码都将服务亿万用户。在这里，团队专业且纯粹，合作氛围平等且轻松。目前在北京，上海，杭州、广州、深圳分别开放多个岗位机会。课题介绍： TikTok作为全球领先的短视频平台，面临新用户数据稀疏导致的个性化推荐不足、直播推荐时效性要求高、用户兴趣多样性维护困难以及电商推荐系统链路复杂等多重挑战。传统推荐方法依赖历史行为建模，难以解决新用户冷启动问题，且直播推荐需在极短窗口期内（通常30分钟内）实时捕捉内容动态变化（如主播互动、流量波动），这对系统的实时感知与快速决策能力提出更高要求。此外，单列沉浸式场景放大了多样性问题，需平衡多峰兴趣学习与探索引发的内容穿越风险。当前电商推荐系统采用多阶段漏斗架构（召回-排序-混排），存在链路不一致、维护成本高、过度依赖短期价值预测等问题，导致用户易陷入内容同质化疲劳。针对上述痛点，项目提出结合大语言模型（LLM）和大模型技术实现突破：一方面利用LLM的海量知识储备与Few-shot推理能力，通过注册信息与外部知识推理新用户潜在意图，缓解冷启动问题；另一方面，在社交偏好建模中融合GNN与用户全生命周期行为序列，提升兴趣预测精准度。同时，探索大模型的泛化能力、长上下文感知及端到端建模优势，简化电商推荐链路，增强实时动态适应性与兴趣探索能力，最终实现系统更简洁、推荐更精准、用户体验与留存双提升的目标，推动业务可持续增长。

更新于 2025-05-28新加坡

Recommendation Large Model Researcher | 推荐大模型算法工程师-电商-筋斗云人才计划

校招A221696

Team Introduction： The team primarily focuses on recommendation services for the International E-commerce Mall, covering information flow recommendation in core scenarios such as the mall homepage, transaction funnels, product detail pages, stores & showcases. Committed to providing hundreds of millions of users daily with precise and personalized recommendations for products, live streams, and short videos, the team dedicates itself to solving challenging problems in modern recommendation systems. Through algorithmic innovations, we continuously enhance user experience and efficiency, creating greater user and social value. Project Background/Objectives: This project aims to explore new paradigms for large models in the recommendation field, breaking through the long-standing structures of recommendation models and Infra solutions, achieving significantly better performance than current baseline models, and applying them across multiple business scenarios such as Douyin short videos/LIVE/E-commerce/Toutiao. Developing large models for recommendation is particularly challenging due to the high demands on engineering efficiency and the personalized nature of user recommendation experiences. The project will conduct in-depth research across the following directions to explore and establish large model solutions for recommendation scenarios: Project Challenges/Necessity: The emergence of LLMs in the natural language field has outperformed SOTA models in numerous vertical tasks. In contrast, industrial-grade recommendation systems have seen limited major innovations in recent years. This project seeks to revolutionize the long-standing paradigms of recommendation model architectures and Infra in the recommendation field, delivering models with significantly improved performance and applying them to scenarios like Douyin short video and LIVE. Key challenges include: High engineering efficiency requirements for recommendation systems; Personalized nature of user recommendation experiences; Effective content representation for media formats like short videos and live streams. The project will address these through deep research in model parameter scaling, content/user representation learning, multimodal content understanding, ultra-long sequence modeling, and generative recommendation models, driving systematic upgrades to recommendation models. Project Content: 1. Representation Learning Based on Content Understanding and User Behavior 2. Scaling of Recommendation Model Parameters and computing 3. Ultra-Long Sequence Modeling 4. Generative Recommendation Models Involved Research Directions: Recommendation Algorithms, Large Recommendation Models. 团队介绍：推荐与营销团队，主要负责国际电商商城推荐业务，涵盖商城首页、交易链路、商品详情页、店铺&橱窗等多个核心场景的信息流推荐业务，致力于每天为亿量级用户提供精准个性化商品、直播、短视频推荐服务；团队致力于解决现代推荐系统中各种有挑战的问题，通过算法不断提升用户体验和效率、创造更大的用户和社会价值。课题背景/目标：本项目旨在探索推荐领域下的大模型新范式，突破现在持续了较长时间的推荐模型结构和Infra的方案，且效果大幅好于现在的基线模型，在抖音短视频/直播/电商/头条等多个业务场景上得到应用。推荐领域的大模型是比较有挑战的事情，推荐对工程效率的要求更高，且用户的推荐体验上是个性化的，本课题会以下多个方向来做深入的研究，探索和建设推荐场景的大模型方案。课题挑战/必要性：自然语言领域LLM的出现，效果在众多垂直任务上都好于sota模型，从推荐领域看过去工业级推荐系统在较长的时间没有大幅的变化过。本项目旨在探索推荐领域下的大模型方案，改变现在持续了较长时间的推荐模型结构和Infra的基本范式，且效果大幅好于现在的模型，在抖音短视频/直播等多个业务场景上得到应用。但是怎么做好推荐领域的大模型也是一个比较有挑战的事情，推荐对工程效率的要求更高，且用户的推荐体验上是个性化的，以及如何短视频、直播等体裁上做号内容的表征也是需要被解决的问题，这里会从模型参数scaling up、内容和用户的表征学习、内容理解多模态、超长序列建模、生成式推荐模型等多个方向来做深入的研究，对推荐场景的模型做系统性的升级。课题内容： 1、基于内容理解和用户行为的表征学习； 2、推荐模型参数和算力scaling up； 3、超长序列建模； 4、生成式推荐模型。涉及研究方向：推荐算法、推荐大模型。

更新于 2025-05-26新加坡

Recommendation Algorithm Engineer｜推荐算法工程师-TikTok 算法 -筋斗云人才计划

校招A54374

Team Introduction: TikTok is a global short-video platform available in 150 countries and regions. Our mission is to inspire creativity and bring joy by helping users discover real and interesting moments that make life better. TikTok's global headquarters are in Los Angeles and Singapore, and we also have offices in New York City, London, Dublin, Paris, Berlin, Dubai, Jakarta, Seoul, and Tokyo. TikTok Research & Development (R&D) Team: The TikTok R&D team is dedicated to building and maintaining industry-leading products that drive the success of TikTok’s global business. By joining us, you'll work on core scenarios such as user growth, social features, live streaming, e-commerce consumer side, content creation, and content consumption, helping our products scale rapidly across global markets. You'll also face deep technical challenges in areas like service architecture and infrastructure engineering, ensuring our systems operate with high quality, efficiency, and security. Meanwhile, our team also provides comprehensive technical solutions across diverse business needs, continuously optimizing product metrics and improving user experience. Here, you'll collaborate with leading experts in exploring cutting-edge technologies and pushing the boundaries of what's possible. Every line of your code will serve hundreds of millions of users. Our team is professional and goal-oriented, with an egalitarian and easy-going collaborative environment. Research Project Introduction: As the world's leading short-video platform, TikTok faces multiple challenges in its recommendation systems, including data sparsity for new users leading to insufficient personalisation, high timeliness requirements for live steaming recommendations, difficulty in maintaining user interest diversity, and complex e-commerce recommendation system chains. Traditional recommendation methods heavily rely on historical behaviour modeling, which struggles with the cold-start problem for new users. Live-streaming recommendations demand real-time responsiveness to rapidly changing content dynamics (e.g., host interactions, traffic fluctuations) within extremely short time windows (typically within 30 minutes) posing higher demands on the system's real-time perception and decision-making capabilities. Additionally, the immersive single-feed format amplifies the challenge of maintaining content diversity, requiring a careful balance between multi-interest learning and the risk of content drift caused by exploratory recommendations. The current e-commerce recommendation system follows a multi-stage funnel architecture (recall–ranking–re-ranking), which often leads to inconsistent chains, high maintenance costs, and an overreliance on short-term value prediction. This leads users to fall into content homogenization fatigue. To address these pain points, this project proposes leveraging large language models (LLMs) and large model technologies to achieve significant breakthroughs. On one hand, LLMs—with their vast knowledge base and few-shot reasoning capabilities—can infer new users' potential intentions from registration data and external knowledge, thereby alleviating cold-start issues. On the other hand, by integrating graph neural networks (GNNs) and full-lifecycle user behavior sequences for modeling social preferences, we aim to improve the accuracy of interest prediction. Additionally, the project explores the generalization capabilities, long-context awareness, and end-to-end modeling strengths of large models to simplify the e-commerce recommendation chains, enhance adaptability to real-time changes, and improve exploratory recommendation effectiveness. The ultimate goal is to build a more streamlined system with more accurate recommendations, enhancing user experience and retention while driving sustainable business growth. 团队介绍： TikTok是一个覆盖150个国家和地区的国际短视频平台，我们希望通过TikTok发现真实、有趣的瞬间，让生活更美好。TikTok 在全球各地设有办公室，全球总部位于洛杉矶和新加坡，办公地点还包括纽约、伦敦、都柏林、巴黎、柏林、迪拜、雅加达、首尔和东京等多个城市。 TikTok研发团队，旨在实现TikTok业务的研发工作，搭建及维护业界领先的产品。加入我们，你能接触到包括用户增长、社交、直播、电商C端、内容创造、内容消费等核心业务场景，支持产品在全球赛道上高速发展；也能接触到包括服务架构、基础技术等方向上的技术挑战，保障业务持续高质量、高效率、且安全地为用户服务；同时还能为不同业务场景提供全面的技术解决方案，优化各项产品指标及用户体验。在这里，有大牛带队与大家一同不断探索前沿，突破想象空间。在这里，你的每一行代码都将服务亿万用户。在这里，团队专业且纯粹，合作氛围平等且轻松。课题介绍： TikTok作为全球领先的短视频平台，面临新用户数据稀疏导致的个性化推荐不足、直播推荐时效性要求高、用户兴趣多样性维护困难以及电商推荐系统链路复杂等多重挑战。传统推荐方法依赖历史行为建模，难以解决新用户冷启动问题，且直播推荐需在极短窗口期内（通常30分钟内）实时捕捉内容动态变化（如主播互动、流量波动），这对系统的实时感知与快速决策能力提出更高要求。此外，单列沉浸式场景放大了多样性问题，需平衡多峰兴趣学习与探索引发的内容穿越风险。当前电商推荐系统采用多阶段漏斗架构（召回-排序-混排），存在链路不一致、维护成本高、过度依赖短期价值预测等问题，导致用户易陷入内容同质化疲劳。针对上述痛点，项目提出结合大语言模型（LLM）和大模型技术实现突破：一方面利用LLM的海量知识储备与Few-shot推理能力，通过注册信息与外部知识推理新用户潜在意图，缓解冷启动问题；另一方面，在社交偏好建模中融合GNN与用户全生命周期行为序列，提升兴趣预测精准度。同时，探索大模型的泛化能力、长上下文感知及端到端建模优势，简化电商推荐链路，增强实时动态适应性与兴趣探索能力，最终实现系统更简洁、推荐更精准、用户体验与留存双提升的目标，推动业务可持续增长。

更新于 2025-05-26新加坡