英伟达Principal On-Device Model Inference Optimization Engineer
任职要求
• MSc or PhD in Computer Science, Engineering, or a related field, or equivalent experience. • Over 10 years of confirmed experience specializing in model inference and optimization. • 15+ overall years of work experience in a relevant area • Expertise in modern machine learning frameworks, particularly PyTorch, ONNX, and TensorRT. • Proven experience in optimizing inference for transformer and convolutional architectures. • Strong programming proficiency in CUDA, Python, and C++. • In-depth knowledge of optimization techniques, including quantization, pruning, distillation, and hardware-aware neural architecture search. • Skilled in building and deploying scalable, cloud-based inference systems. • Passionate about developing efficient, production-ready solutions with a strong focus on code quality and performance. • Meticulous attention to detail, ensuring precision and reliability in safety-critical systems. • Strong collaboration and communication skills for working optimally across multidisciplinary teams. • A proactive, diligent mentality with a drive to tackle complex optimization challenges. Ways to stand out from the crowd: • Publications or industry experience in optimizing and deploying model inference at scale. • Hands-on expertise in hardware-aware optimizations and accelerators such as GPUs, TPUs, or custom ASICs. • Active contributions to open-source projects focused on inference optimization or machine learning frameworks. • Experience in designing and deploying inference pipelines for real-time or autonomous systems.
工作职责
• Develop and implement strategies to optimize AI model inference for on-device deployment. • Employ techniques like pruning, quantization, and knowledge distillation to minimize model size and computational demands. • Optimize performance-critical components using CUDA and C++. • Collaborate with multi-functional teams to align optimization efforts with hardware capabilities and deployment needs. • Benchmark inference performance, identify bottlenecks, and implement solutions. • Research and apply innovative methods for inference optimization. • Adapt models for diverse hardware platforms and operating systems with varying capabilities. • Create tools to validate the accuracy and latency of deployed models at scale with minimal friction. • Recommend and implement model architecture changes to improve the accuracy-latency balance.
As a Principal BD Manager for Fire TV, you will work closely with Senior Leadership, Product Development, GTM, Finance, Legal, Tax, amongst a number of other teams and develop strategic partnerships with major OEMs, ODMs, SOCs, and offline retailers. This role will focus on building relationships with partners across Asia to accelerate Fire TV customer acquisition globally. You will be required to handle multiple high-priority projects simultaneously and effectively negotiate terms that for the benefit of our customers. A day in the life In this role, you will: - Be the business development leader to launch and scale Fire TV globally. - Create and drive consensus on business plans for Fire TV’s launch in multiple countries. - Conduct in-depth study of countries and regional landscapes to identify priority ODMs, OEMs and launch timelines. - Define and negotiate regionally specific commercials with ODMs and OEMs, nuanced around competition, regional legal, tax and costing considerations - Drive joint OEM / brand pitches with ODM partners and enable regionally relevant ODMs to drive Fire TV’s business interests. - Sign regionally relevant offline retailer partnerships. - Validate long-term business critical decisions like country-specific SoC roadmaps. - Work with Product to define regionally relevant customer experiences, customization and scaling requirements.
As an architect on the Copilot Mac team, you will drive the ongoing development of the product on macOS. The team is committed to delivering a consistent cross-platform experience, using data-driven methods to measure impact, reach, and reliability, while working closely with various internal teams. You will be responsible for the design, implementation, measurement, rollout, and refinement of solutions.
Key Responsibilities: Contribution: • Strong hands-on capability of migration, integration and cloud project implementation in specific domain. push the consumption growth objective. • Identifying customer needs in specific domain, and driving adoption and expansion. Lead product implementation. optimization, troubleshooting. • Model accountability in technical solution optimization, and ownership of both successes and failures that impact the technical solutions. Challenge: • Handle customer's issues using multiple approaches to identify optimal technical solution in specific domain. • Utilize Oracle and third-party data to gain a strategic, forward-thinking perspective on technical solutions. • Stay informed about evolving technical and business trends that may impact current or future projects. Expertise: • Apply deep knowledge of Oracle Cloud technologies, industry verticals, and market trends to design tailored solutions. • Continuously deepen your product and industry expertise, educating others and leveraging this knowledge to deliver impactful solutions.
Key Responsibilities: Contribution: • Strong hands-on capability of migration, integration and cloud project implementation in specific domain. push the consumption growth objective. • Identifying customer needs in specific domain, and driving adoption and expansion. Lead product implementation. optimization, troubleshooting. • Model accountability in technical solution optimization, and ownership of both successes and failures that impact the technical solutions. Challenge: • Handle customer's issues using multiple approaches to identify optimal technical solution in specific domain. • Utilize Oracle and third-party data to gain a strategic, forward-thinking perspective on technical solutions. • Stay informed about evolving technical and business trends that may impact current or future projects. Expertise: • Apply deep knowledge of Oracle Cloud technologies, industry verticals, and market trends to design tailored solutions. • Continuously deepen your product and industry expertise, educating others and leveraging this knowledge to deliver impactful solutions.