logo of sensetime

商汤AI Large Language Model R&D

社招全职算法研究地点:利雅得状态:招聘

任职要求


•	Fluent in English (written and spoken). 
•	Must be based in Riyadh; remote work is not supported.
•	Bachelor’s or Master’s degree from global top tier university, major in Computer Science, Artificial Intelligence, Software Engineering, or a related field.
•	2 years of software development experience, including internships or personal projects.
•	Solid programming skills in Python/C++/Go or similar languages; familiar with Deep learning framework such as TF, Pytorch; with solid experience in algorithms and implementa…
登录查看完整任职要求
微信扫码,1秒登录

工作职责


•	Core Development & Optimization: Participate in the development of cutting-edge applications powered by Large Language Models (LLMs), contributing to code implementation, performance optimization, and debugging.
•	Requirement Translation & Feature Implementation: Collaborate closely with senior developers and product teams to deeply understand user requirements and translate them into high-quality functional modules.
•	LLM Models & Framework: Responsible for the design, development, and maintenance of LLM models within the team's proprietary LLM framework.
•	Advanced LLM Interaction: Skillfully apply prompt engineering techniques, context management, and advanced model interaction as part of LLM application development.
•	Continuous Learning & Growth: Actively learn and stay updated with the latest developments in LLM technologies, algorithms, and programming best practices.
•	Collaboration & Skill Enhancement: Actively participate in code reviews, pair programming sessions, and technical discussions to continuously grow your development skills.
•	Technical Problem Solving: Under the guidance of senior engineers, participate in technical problem-solving, performance optimization, and system debugging.
•	AI Agent System Productionization: Work closely with product and research teams to translate AI agent logic (e.g., tool-use, planning, reasoning) into robust, production-grade systems.
包括英文材料
Python+
C+++
Go+
开发框架+
PyTorch+
还有更多 •••
相关职位

logo of sensetime
社招系统开发

• Understand business scenarios and design targeted data acquisition solutions, ensuring data is relevant, high-quality, and aligned with project goals. • Architect, design, and maintain enterprise-grade databases, data warehouses, and lakehouse systems to support analytical, operational, and AI workloads. • Model and optimize schema design, storage layouts, data partitioning, clustering, and indexing strategies for large-scale datasets. • Implement and maintain ETL/ELT pipelines feeding data warehouses (e.g., Snowflake, BigQuery, Redshift, Databricks, or open-lakehouse environments). • Design, collect, and maintain high-quality datasets for AI inferencing and LLM model optimization, fine-tuning, and testing, ensuring data is formatted and preprocessed to meet model requirements. • Collaborate with AI application engineers to understand model performance requirements and translate them into targeted data collection and preparation strategies. • Develop and implement automated data pipelines for efficient data processing, including data cleaning, labeling, augmentation, and transformation. • Proactively identify data gaps based on model performance metrics, design solutions to acquire, clean, and optimize data for enhanced model accuracy and efficiency. • Build, clean, and manage diverse data sources, ensuring compliance with data security and privacy standards. • Conduct exploratory data analysis to discover data patterns, anomalies, and optimization opportunities, directly impacting model performance. • Continuously learn and adapt to the latest advancements in data engineering, AI, and large language model (LLM) technologies.

更新于 2026-01-14阿布扎比
logo of nvidia
社招

NVIDIA’s Solution Architect team is looking for a AI-focused Solution Architect with expertise in Large Language Model, generative AI, agentic AI, or recommender system. We work with the most exciting computing hardware and software, driving the latest breakthroughs in artificial intelligence. We need individuals who can enable customer productivity and develop lasting relationships with our technology partners, making NVIDIA an integral part of end-user solutions. We are looking for someone always thinking about artificial intelligence, someone who can maintain constructive collaboration in a fast paced, rapidly evolving field, someone able to coordinate efforts between corporate marketing, industry business development and engineering. You will be working with the latest AI architecture coupled with the most advanced neural network models, changing the way people interact with technology.As a Solutions Architect, you will be the first line of technical expertise between NVIDIA and our customers. Your duties will vary from working on proof-of-concept demonstrations, to driving relationships with key executives and managers to evangelize accelerated computing. Dynamically engaging with developers, scientific researchers, data scientists, IT managers and senior leaders is a meaningful part of the Solutions Architect role and will give you experience with a range of partners and concerns. What you’ll be doing: • Assisting field business development in guiding the customer build/extend their GPU infrastructures for AI. • Help customers build their large-scale projects, especially Large Language Model (LLM) projects. • Engage with customers to perform in-depth analysis and optimization to ensure the best performance on GPU architecture systems. This includes support in optimization of both training and inference pipelines. • Partner with Engineering, Product and Sales teams to develop, plan best suitable solutions for customers. Enable development and growth of product features through customer feedback and proof-of-concept evaluations. • Build industry expertise and become a contributor in integrating NVIDIA technology into Enterprise Computing architectures.

更新于 2026-04-04深圳|北京
logo of alibaba
社招3年以上LAZADA

产品运营 Seller Governance & After-sales Experience Product Operations 【职位描述】 1、负责Lazada平台的商家治理与售后链路的产品运营策略制定,可以对治理、履约、售后退换货等体验场景处理等治理策略进行优化机会挖掘并推动落地优化 2、与产品、算法、技术团队配合,优化治理与售后链路中的AI识别、判责、赔付等自动化能力,提升模型的准确率与召回率,并最终作用到核心业务结果指标 3、梳理项目指标体系并定期对运营数据进行分析和结论产出,为产品改进和策略升级提供决策支持。 4、与内部多个跨部门团队合作,解决运营过程中遇到的产品问题,提升用户体验 Key Responsibilities Develop and execute product operations strategies for platform governance and after-sales workflows on Lazada, identifying optimization opportunities across key experience touchpoints—including content/policy governance, order fulfillment, and returns/refunds—and driving end-to-end implementation to enhance operational efficiency and user satisfaction. Collaborate with product, algorithm, and engineering teams to enhance AI-powered automation capabilities throughout governance and after-sales processes—such as intelligent issue detection, liability assessment, and compensation decisions—continuously improving model accuracy and recall to deliver measurable impact on core business metrics. Establish KPI frameworks, conduct regular data analysis, and deliver actionable insights to guide product enhancements and strategy iteration. Partner cross-functionally with internal teams to resolve operational bottlenecks, streamline workflows, and elevate the overall customer experience.

更新于 2026-02-06杭州
logo of bytedance
校招A238623

Team Introduction: TikTok Content Security Algorithm Research Team The International Content Safety Algorithm Research Team is dedicated to maintaining a safe and trustworthy environment for users of ByteDance's international products. We develop and iterate on machine learning models and information systems to identify risks earlier, respond to incidents faster, and monitor potential threats more effectively. The team also leads the development of foundational large models for products. In the R&D process, we tackle key challenges such as data compliance, model reasoning capability, and multilingual performance optimization. Our goal is to build secure, compliant, and high-performance models that empower various business scenarios across the platform, including content moderation, search, and recommendation. Research Project Background: In recent years, Large Language Models (LLMs) have achieved remarkable progress across various domains of natural language processing (NLP) and artificial intelligence. These models have demonstrated impressive capabilities in tasks such as language generation, question answering, and text translation. However, reasoning remains a key area for further improvement. Current approaches to enhancing reasoning abilities often rely on large amounts of Supervised Fine-Tuning (SFT) data. However, acquiring such high-quality SFT data is expensive and poses a significant barrier to scalable model development and deployment. To address this, OpenAI's o1 series of models have made progress by increasing the length of the Chain-of-Thought (CoT) reasoning process. While this technique has proven effective, how to efficiently scale this approach in practical testing remains an open question. Recent research has explored alternative methods such as Process-based Reward Model (PRM), Reinforcement Learning (RL), and Monte Carlo Tree Search (MCTS) to improve reasoning. However, these approaches still fall short of the general reasoning performance achieved by OpenAI's o1 series of models. Notably, the recent DeepSeek R1 paper suggests that pure RL methods can enable LLM to autonomously develop reasoning skills without relying on the expensive SFT data, revealing the substantial potential of RL in advancing LLM capabilities. 团队介绍: 国际化内容安全算法研究团队致力于为字节跳动国际化产品的用户维护安全可信赖环境,通过开发、迭代机器学习模型和信息系统以更早、更快发掘风险、监控风险、响应紧急事件,团队同时负责产品基座大模型的研发,我们在研发过程中需要解决数据合规、模型推理能力、多语种性能优化等方面的问题,从而为平台上的内容审核、搜索、推荐等多项业务提供安全合规,性能优越的基座模型。 课题介绍: 课题背景: 近年来,大规模语言模型(Large Language Models, LLM)在自然语言处理和人工智能的各个领域都取得了显著的进展。这些模型展示了强大的能力,例如在生成语言、回答问题、翻译文本等任务上表现优异。然而,LLM 的推理能力仍有很大的提升空间。在现有的研究中,通常依赖于大量的监督微调(Supervised Fine-Tuning, SFT)数据来增强模型的推理性能。然而,高质量 SFT 数据的获取成本高昂,这对模型的开发和应用带来了极大的限制。 为了提升推理能力,OpenAI 的 o1 系列模型通过增加思维链(Chain-of-Thought, CoT)的推理过程长度取得了一定的成功。这种方法虽然有效,但在实际测试时如何高效地进行扩展仍是一个开放的问题。一些研究尝试使用基于过程的奖励模型(Process-based Reward Model, PRM)、强化学习(Reinforcement Learning, RL)以及蒙特卡洛树搜索算法(Monte Carlo Tree Search, MCTS)等方法来解决推理问题,然而这些方法尚未能达到 OpenAI o1 系列模型的通用推理性能水平。最近deepseek r1在论文中提到通过纯强化学习的方法,可以使得 LLM 自主发展推理能力,而无需依赖昂贵的 SFT 数据。这一系列的工作都揭示着强化学习对LLM的巨大潜力。 课题挑战: 1、Reward模型的设计:在强化学习过程中,设计一个合适的reward模型是关键。Reward模型需要准确地反映推理过程的效果,并引导模型逐步提升其推理能力。这不仅要求对不同任务精准设定评估标准,还要确保reward模型能够在训练过程中动态调整,以适应模型性能的变化和提高。 2、稳定的训练过程:在缺乏高质量SFT数据的情况下,如何确保强化学习过程中的稳定训练是一个重大挑战。强化学习过程通常涉及大量的探索和试错,这可能导致训练不稳定甚至模型性能下降。需要开发具有鲁棒性的训练方法,以保证模型在训练过程中的稳定性和效果。 3、如何从数学和代码任务上拓展到自然语言任务上:现有的推理强化方法主要应用在数学和代码这些CoT数据量相对丰富的任务上。然而,自然语言任务的开放性和复杂性更高,如何将成功的RL策略从这些相对简单的任务拓展到自然语言处理任务上,要求对数据处理和RL方法进行深入的研究和创新,以实现跨任务的通用推理能力。 4、推理效率的提升:在保证推理性能的前提下,提升推理效率也是一个重要挑战。推理过程的效率直接影响到模型在实际应用中的可用性和经济性。可以考虑利用知识蒸馏技术,将复杂模型的知识传递给较小的模型,以减少计算资源消耗。另外,使用长思维链(Long Chain-of-Thought, Long-CoT)技术来改进短思维链(Short-CoT)模型,也是一种潜在的方法,以在保证推理质量的同时提升推理速度。

更新于 2025-05-26新加坡