英伟达Deep Learning Solution Architect
任职要求
• 5+ years’ experience with research/development/application of Machine Learning, data analytics, or computer vision work flows. • Outstanding verbal and written communication skills • Ability to work independently with minimal day-to-day direction • C/C++/Python/Java/Scala programming experience • Desire to be involved in multiple diverse and innovative projects • Experience using scale-out cloud and/or HPC ar…
工作职责
NVIDIA is leading company of AI computing. At NVIDIA, our employees are passionate about AI, HPC , VISUAL, GAMING. Our SA team is more focusing to bring NVIDIA new technology into difference industries. We help to design the architecture of AI computing platform, analysis the AI and HPC applications to deliver our value to customers. You will work closely with industry sales, developer relationship managers and product teams in the hiring position. What you’ll be doing: • Assist in supporting industry accounts and driving research/influencing/new business in those accounts. • Assist researchers/engineers on their GPU applications. • Deliver technical projects, demos and client support tasks as directed by the Solution Architecture leadership team. • Provide technical support for GPU system deployments. • Be an industry thought leader on integrating NVIDIA technology into applications built on Deep Learning, High Performance Data Analytics, Robotics, Signal Processing and other key applications. • Be an internal champion for Data Analytics, Machine Learning, and Cyber among the NVIDIA technical community.
1、深入探索LLM在搜索场景中的推理能力与深度研究(Deep Research)模式,优化信息整合与总结效果,打造高效、精准的智能搜索产品,推动AI技术在实际应用中的突破; 2、AI搜索总结Agent研发: 1)设计并实现基于LLM的搜索总结Agent,提升搜索结果的理解、推理与结构化总结能力; 2)探索LLM Reasoning技术(如思维链、多步推理),优化复杂查询的Deep Research模式,实现长文本理解与跨文档信息融合; 3)构建端到端系统,涵盖意图识别、知识检索、结果生成与偏好对齐,提升用户体验; 3、模型优化及应用: 1)通过指令微调(Instruction Tuning)、偏好对齐(RLHF/DPO)等技术优化模型在搜索场景的适应性; 2)探索多模态信息(文本、代码、结构化数据)融合的搜索与生成技术; 3)研究未来生活中的创新应用场景(如个性化知识助手、自动化研究工具),探索技术边界。
1、负责搭建快手NLP技术体系,包括但不限于文本分类、知识图谱、翻译、对话等; 2、与业务部门进行沟通与协作,交付满足产品需求的核心算法模型与能力。
1、负责AI小快智能助理机器人的研究和开发; 2、优化基础模型,并采用RAG、Agent等大模型衍生框架,来提升相关业务指标; 3、持续跟进并深入调研大模型前沿技术、开源方案,跟踪业内大模型领域的最新进展并推进相关研究,探寻将最新技术应用到AI小快的可能性。
• Be a subject matter expert on databases, particularly on Relational Databases, able to discuss with customer on database modelling, migration, performance testing and day to day operations • Have wide ranging experience with open source and commercial databases such as MySQL, PostgreSQL, Oracle & SQL Server…etc • Be familiar with major cloud service providers in using it for deployment of workloads (AWS, Alicloud, GCP, Azure…etc) • Be a technical expert on all aspects of OceanBase (compatibility assessment, deployment, administration, development, migration…etc) • Facilitate introduction, discussion and demonstration of OceanBase’s technology, vision and value proposition either via individual or group sessions with customers • Engage and discover prospects’ pain points, business/technical challenges and identify how OceanBase can add value • Run Proof of Concept projects with customers to validate OceanBase’s capabilities Support post sale OceanBase implementation with OceanBase delivery team • Maintain deep understanding of competitive as well as complementary technologies and how to position OceanBase DB in relation to them • Provide guidance on how to resolve customer-specific technical challenges • Collaborate with Product Management, Engineering, and Marketing to continuously improve OceanBase product and position in the market