logo of apple

苹果AIML Infrastructure Systems Engineer Intern

实习兼职Machine Learning and AI地点:北京状态:招聘

任职要求


Minimum Qualifications
• Master or PhD degree in Computer Science, Electrical Engineering or equivalent
• Proficiency in coding with scripting and programming languages, including Bash, Python, Golang
• Demonstrated understanding of computing, storage and networking in public cloud infrastructure, including provisioning, setup, monitoring, security operations, performance tuning, and troubleshooting.
• Rich knowledge on data cleaning, machine learning model training
• Basic hand-on experience on Kubernetes, and Linux command line utilities
• Basic knowledge on infrastructure services, e.g. DNS, logging, LDAP etc.

Preferred Qualifications
• Self-motivated an…
登录查看完整任职要求
微信扫码,1秒登录

工作职责


• The Infrastructure Systems Engineer Intern will do the following tasks, through collaboration with team members in China and around the world.
• -  Analyze the requirements, demands, constraints and challenges of machine learning platform in local or global environments. Design or re-design platform architecture to improve its scalability and agility, and to enable new, high-impact use cases
• -  Investigate new technologies to enhance system performance, reliability and redundancy. Create performance profile for platforms and services, defining service level objectives (SLO) and driving the measurement, monitoring and evaluation over these objectives
• -  Improve automation of operations for infrastructure and platforms, including tools and processes of monitoring, logging and alerting, to improve scalability in both system construction and runtime operations
• -  Develop and implement the above design, bringing it to an internal product, with observability to support efficient systems management
包括英文材料
Bash+
Python+
Go+
还有更多 •••
相关职位

logo of apple
社招Machine

The Infrastructure Systems Engineer will do the following tasks, through collaboration with team members in China and around the world. - Analyze the requirements, demands, constraints and challenges of machine learning in local or global environments, design or re-design platform architecture to improve its scalability and agility, and to enable new, high-impact use cases - Develop and implement the above design, bringing it to an internal product, with observability to support efficient system management - Design and/or enhance automation of operations for infrastructure and platforms, including tools and processes of monitoring, logging and alerting, to improve scalability in both system construction and runtime operations - Support Dev and Eng efforts through provisioning operational solutions, co-design ML application architecture and drive the coordination among local and global, internal and cross-functional groups to achieve the result of success - Create performance profile for platforms and services, defining service level objectives (SLO) and driving the measurement, monitoring and evaluation over these objectives - Lead constant evaluation on system performance and reliability, discover potential faults, drive RCA and fixes

更新于 2025-07-30上海
logo of apple
社招Machine

The Infrastructure Systems Engineer will do the following tasks, through collaboration with team members in China and around the world. - Analyze the requirements, demands, constraints and challenges of machine learning in local or global environments, design or re-design platform architecture to improve its scalability and agility, and to enable new, high-impact use cases - Develop and implement the above design, bringing it to an internal product, with observability to support efficient system management - Design and/or enhance automation of operations for infrastructure and platforms, including tools and processes of monitoring, logging and alerting, to improve scalability in both system construction and runtime operations - Support Dev and Eng efforts through provisioning operational solutions, co-design ML application architecture and drive the coordination among local and global, internal and cross-functional groups to achieve the result of success - Create performance profile for platforms and services, defining service level objectives (SLO) and driving the measurement, monitoring and evaluation over these objectives - Lead constant evaluation on system performance and reliability, discover potential faults, drive RCA and fixes

更新于 2025-10-15北京
logo of nvidia
社招

N/A

更新于 2025-09-24上海|北京|深圳
logo of nvidia
社招

• Primary responsibilities will include deploying, managing and maintaining AI/HPC infrastructure in Linux-based environments for new and existing customers. • Be the domain expert with customers during planning calls through implementation. • Handover-related documentation and perform knowledge transfers required to support customers as they begin rolling out some of the most sophisticated systems in the world! • Provide feedback into internal teams such as opening bugs, documenting workarounds, and suggesting improvements.

更新于 2025-09-15北京|上海|深圳