西门子西门子中国研究院 大模型强化学习研究员(上海、北京、苏州)
任职要求
• Master's or Doctor degree or above in Computer Science, Automation, Mathematics or related. • Self-motivated, good communication skills and good team player. • Ability to handle multiple competing priorities in a fast-paced environment. The skills you are expected to have: • 1~3 years of hands-on RL experience (academic or industry). • Expertise in deep RL algorithms (model-based/model-free) and frameworks (e.g., RLlib, Gymnasium). • Strong Python skills with PyTorch/TensorFlow and proficiency in Linux. • Experience with distributed training (Horovod, DeepSpeed) and cloud platforms (AWS/Azure/Alicloud). • Familiarity with LLM agents or LLM post-training. • Prefer: Background in robotics, control systems, or game AI. • Prefer: Contributions to RL open-source projects or publications at top conferences (NeurIPS, ICML, ICLR, KDD, IROS, etc). You'll benefit from • Diverse and inclusive culture, doing the work you like with people who appreciate it • Systematic career development platform, various training courses, and online learning resources for you to help you tailor your growth path based on your strengths • 15 days+ annual leaves, with additional benefits such as Christmas leave • Generous benefits package, long-term care corporate annuity plan, flexible allocation of commercial insurance, employee stock sharing matching plan for mutual growth, etc
工作职责
We empower our people to stay resilient and relevant in a constantly changing world. We're looking for people who are always searching for creative ways to grow and learn. People who want to make a real impact, now and in the future. Does that sound like you? Then it seems like you'd make a great addition to our vibrant international team. DAI AIX – AI Acceleration and Exploration, is working on the cutting-edge research of Data Analytics and AI with Siemens global technology network, and consulting, co-creation, data driven applications for the end customers. Research Scientist is to do applied research for Industrial AI applications in the team. We are seeking a Reinforcement Learning (RL) Specialist to lead the design, implementation, and optimization of RL-driven systems for post-training of foundation models. The primary focus of this role is advancing our RL capabilities for real-world applications such as industrial control systems and LLM agents. You will develop cutting-edge algorithms, improve post-training efficiency, and deploy scalable RL solutions in industry. You'll make an impact by • 1. Reinforcement learning development for post-training: • Design and implement state-of-the-art RL algorithms (e.g., PPO, SAC, DQN) for post-training of foundation models like LLMs and time series foundation models. • Implement distributed RL training pipelines using frameworks like Ray RLlib, Deepspeed, or custom solutions. • Design and implement benchmark pipelines for model evaluation. • 2. Align foundation models like LLMs and time series foundation models with specific areas/tasks through techniques like SFT, RL. • 3. Coding & Infrastructure: • Write production-grade Python code using PyTorch, numpy, and pandas. • Manage Linux-based clusters for distributed training and deployment. • 4. All other support required by the line manager if necessary.
We empower our people to stay resilient and relevant in a constantly changing world. We're looking for people who are always searching for creative ways to grow and learn. People who want to make a real impact, now and in the future. Does that sound like you? Then it seems like you'd make a great addition to our vibrant international team. DAI AIX – AI Acceleration and Exploration, is working on the cutting-edge research of Data Analytics and AI with Siemens global technology network, and consulting, co-creation, data driven applications for the end customers. Research Scientist is to do applied research for Industrial AI applications in the team. We are seeking a Reinforcement Learning (RL) Specialist to lead the design, implementation, and optimization of RL-driven systems for post-training of foundation models. The primary focus of this role is advancing our RL capabilities for real-world applications such as industrial control systems and LLM agents. You will develop cutting-edge algorithms, improve post-training efficiency, and deploy scalable RL solutions in industry. You'll make an impact by • Research on state-of-the-art data analytics & AI technologies on a general range. • Mainly focus on modern foundation model applications in industrial scenarios1. Context engineering for foundation models2. Development of agent systems for industrial applications3. Task-specific model finetuning • Partially work with multi-modal applications • Participating in both internal & external research projects • Assist deployment of customer development/deployment project
根据监管要求及信息安全相关法律法规,开展内部自查,发现并推进整改,消除合规风险; 将法规要求结合公司的业务情况,沉淀为内部规范,与相关部门协作并推进落地; 完善内部安全管理体系,制定和维护相关管理制度、策略、流程及规范; 建立与业务团队良好的沟通协作机制,为业务部门提供合规咨询与方案支持,支持各类数据和隐私相关监管审查和认证等; 推动数据安全、客户信息保护的合规管理体系建设与完善,推动流程规范、技术体系、风险评估与跟踪、风险治理等要求的落地执行。