小鹏汽车GPU tools高级/资深/专家工程师

社招全职1年以上2025-03-20地点：上海状态：招聘

扫码手机上打开

任职要求

职位要求:
1.一年以上GPU工具开发经验，C/C++编程功底扎实。
2.熟悉CUDA工具之一的…

登录查看完整任职要求

微信扫码，1秒登录

工作职责

1.负责GPGPU CUDA tools设计开发，比如gpusmi，nsight，trace，debug等工具。
2.tools UI开发。
3.支持和驱动交互。

📮 投递简历 ✨AI模拟面试

难度：

包括英文材料

C+

还有更多 •••

登录查看完整学习资料

相关职位

高级开发工程师

社招后端开发

1. Architect and implement agentic workflows that plan, reason, call tools/APIs, and coordinate with humans or other agents. 2. Select, extend, or build frameworks (e.g., LangChain, AutoGen, CrewAI, MetaGPT, LangGraph) to accelerate delivery while avoiding vendor lock-in. 3. Own the MLOps lifecycle: data collection, evaluation harnesses, safety filters, CI/CD, and observability for deployed agents. 4. Integrate enterprise systems & data sources (REST/GraphQL, Kafka, vector databases, Kubernetes) so agents can act on real business objects. 5. Mentor and review code for junior engineers; drive best practices in prompt engineering, evaluation, and secure coding. 6. Research emerging techniques (toolformer, self-reflection, role specialization) and translate findings into the product roadmap.

更新于 2025-07-23阿布扎比

Principal Software Engineering Manager- GPU Inference Optimization

社招Software

• Lead the software development in C/C++, Python, and in GPU languages such as CUDA, ROCm, or Triton• Analyze metrics and identify opportunities based on offline and online testing, develop and deliver robust and scalable solutions.• Work with cutting-edge hardware stacks and a fast-moving software stack to deliver best-of-class inference and optimal cost.• Engage with key partners to understand and implement inference and training optimization for state-of-the-art LLMs and other models.

更新于 2025-09-23北京

Deep Learning Performance Architect - Perf Tools

社招

• Architect Performance Tooling: Develop infrastructure tools/libraries for GPU performance analysis, visualization, and automated workflows used across GPU SW/HW development life cycle. • Unlock Architectural Insights: Analyze GPU workloads to identify bottlenecks and define new hardware profiling features that enhance perf debug and profiling capabilities. • AI-Powered Automation: Build AI/ML-driven tools to automate performance analysis, generate perf optimization guidance, and improve user experience of profiling infrastructure. • Cross-Stack Collaboration: Partner with kernel developers, system software teams, and hardware architects to support performance study, improve CUDA software stack, and co-design performance-centric solutions for current and next-generation GPU architecture

更新于 2025-09-18上海

Software Engineering Intern - 2026

实习

We are now looking for a software engineer intern. The NVIDIA Developer Tools team is seeking a software engineer intern to join our effort to advance the state of graphics and compute performance analysis and tuning. You will help developers of groundbreaking products in Automotive, VR, Gaming, Deep Learning and High Performance Computing to analyze and improve the performance of their products. You will have the opportunity to learn the pipeline and driver stack of the world's most sophisticated GPUs, work with a group of talented engineers from all over the world, and apply your software development skills to improve our products. What you’ll be doing: • Develop algorithms to exercise various parts of the GPU pipeline to verify our performance metrics. • Deeply dive into NVIDIA GPU architecture and software stack, develop new feature for NVIDIA GPU performance profiling tools. • Write unit and integration tests to verify the functionality, performance, stability, resource usage of our products.

更新于 2025-11-03上海