logo of siemens

西门子西门子中国研究院 大模型强化学习研究员(上海、北京、苏州)

社招全职1-3年研发地点:上海 | 苏州 | 北京状态:招聘

任职要求


• Master's or Doctor degree or above in Computer Science, Automation, Mathematics or related.
 • Self-motivated, good communication skills and good team player.
 • Ability to handle multiple competing priorities in a fast-paced environment.
 The skills you are expected to have: 
 • 1~3 years of hands-on RL experience (academic or industry).
 • Expertise in deep RL algorithms (model-based/model-free) and frameworks (e.g., RLlib, Gymnasium).
 • Strong Python skills with PyTorch/TensorFlow and proficiency in Linux.
 • Experience with distributed training (Horovod, DeepSpeed) and cloud platforms (AWS/Azure/Alicloud).
 • Familiarity with LLM agents or LLM post-training.
 • Prefer: Background in robotics, control systems, or game AI.
 • Prefer: Contributions to RL open-source projects or publications at top conferences (NeurIPS, ICML, ICLR, KDD, IROS, etc).
 You'll benefit from 
 • Diverse and inclusive culture, doing the work you like with people who appreciate it
 • Systematic career development platform, various training courses, and online learning resources for you to help you tailor your growth path based on your strengths
 • 15 days+ annual leaves, with additional benefits such as Christmas leave
• Generous benefits package, long-term care corporate annuity plan, flexible allocation of commercial insurance, employee stock sharing matching plan for mutual growth, etc

工作职责


We empower our people to stay resilient and relevant in a constantly changing world. We're looking for people who are always searching for creative ways to grow and learn. People who want to make a real impact, now and in the future. Does that sound like you? Then it seems like you'd make a great addition to our vibrant international team. DAI AIX – AI Acceleration and Exploration, is working on the cutting-edge research of Data Analytics and AI with Siemens global technology network, and consulting, co-creation, data driven applications for the end customers. Research Scientist is to do applied research for Industrial AI applications in the team. We are seeking a Reinforcement Learning (RL) Specialist to lead the design, implementation, and optimization of RL-driven systems for post-training of foundation models. The primary focus of this role is advancing our RL capabilities for real-world applications such as industrial control systems and LLM agents. You will develop cutting-edge algorithms, improve post-training efficiency, and deploy scalable RL solutions in industry. 
 You'll make an impact by 
 • 1. Reinforcement learning development for post-training: • Design and implement state-of-the-art RL algorithms (e.g., PPO, SAC, DQN) for post-training of foundation models like LLMs and time series foundation models.
 • Implement distributed RL training pipelines using frameworks like Ray RLlib, Deepspeed, or custom solutions.
 • Design and implement benchmark pipelines for model evaluation.
 
 • 2. Align foundation models like LLMs and time series foundation models with specific areas/tasks through techniques like SFT, RL.
 • 3. Coding & Infrastructure: • Write production-grade Python code using PyTorch, numpy, and pandas.
 • Manage Linux-based clusters for distributed training and deployment.
 
 • 4. All other support required by the line manager if necessary.
包括英文材料
Gymnasium+
Python+
PyTorch+
TensorFlow+
Linux+
DeepSpeed+
AWS+
Azure+
大模型+
NeurIPS+
ICML+
相关职位

logo of siemens
社招1-3年研发

We empower our people to stay resilient and relevant in a constantly changing world. We're looking for people who are always searching for creative ways to grow and learn. People who want to make a real impact, now and in the future. Does that sound like you? Then it seems like you'd make a great addition to our vibrant international team. DAI AIX – AI Acceleration and Exploration, is working on the cutting-edge research of Data Analytics and AI with Siemens global technology network, and consulting, co-creation, data driven applications for the end customers. Research Scientist is to do applied research for Industrial AI applications in the team. We are seeking a Reinforcement Learning (RL) Specialist to lead the design, implementation, and optimization of RL-driven systems for post-training of foundation models. The primary focus of this role is advancing our RL capabilities for real-world applications such as industrial control systems and LLM agents. You will develop cutting-edge algorithms, improve post-training efficiency, and deploy scalable RL solutions in industry. You'll make an impact by • Research on state-of-the-art data analytics & AI technologies on a general range. • Mainly focus on modern foundation model applications in industrial scenarios1. Context engineering for foundation models2. Development of agent systems for industrial applications3. Task-specific model finetuning • Partially work with multi-modal applications • Participating in both internal & external research projects • Assist deployment of customer development/deployment project

更新于 2025-10-09
logo of mi
社招I3389

根据监管要求及信息安全相关法律法规,开展内部自查,发现并推进整改,消除合规风险; 将法规要求结合公司的业务情况,沉淀为内部规范,与相关部门协作并推进落地; 完善内部安全管理体系,制定和维护相关管理制度、策略、流程及规范; 建立与业务团队良好的沟通协作机制,为业务部门提供合规咨询与方案支持,支持各类数据和隐私相关监管审查和认证等; 推动数据安全、客户信息保护的合规管理体系建设与完善,推动流程规范、技术体系、风险评估与跟踪、风险治理等要求的落地执行。

更新于 2023-07-20
logo of cainiao
社招6年以上综合类-法务

1. 支持菜鸟物流及供应链跨中国、跨境、海外法律、合规事务; 2. 协助业务商务谈判、处理突发风险事件,提供可行、可操作的解决方案; 3. 支持菜鸟在海外的国家的网络建设和本地化运营; 4. 对新业务类型进行调研及风险分析评估,并提供解决方案; 5. 关注及追踪跨境物流领域的发展动态,并研究境外国家和地区对贸易、物流、清关等业务的要求和法律法规,为业务落地提供支撑。

更新于 2025-08-18