AMDSoftware Development Engineer
任职要求
Skilled engineer with strong technical and analytical expertise in C++ development within Linux environments. The ideal candidate will thrive in both collaborative team settings and independent work, with the ability to define goals, manage development efforts, and deliver high-quality solutions. Strong problem-solving skills, a proactive approach, and a keen understanding of software engineering best practices are essential. KEY RESPONSIBILITIES: Deep Learning & LLM Framework Optimization: Optimize major DL/LLM frameworks (TensorFlow, PyTorch, vLLM, SGLang) for AMD GPUs and contribute improvements upstream. GPU Kernel & Operator Optimization: Develop and tune GPU kernels and performance-critical operators to maximize throughput and minimize latency. Model & Architecture Optimization: Adapt and optimize LLM architectures (e.g., Llama, Qwen, DeepSeek) and apply advanced techniques like FlashAttention, PagedAttention, and quantization. End-to-End Performance Engineering: Perform comprehensive profiling to identify bottlenecks and implement system, memory, and communication optimizations across multi-GPU and multi-node setups. Compiler & Pipeline Acceleration: Leverage advanced compiler technologies and graph compilers to enhance the full deep learning and inference pipeline. Research & Advanced Techniques: Prototype and integrate emerging optimization methods such as speculative decoding and weight-only quantization into production systems. Cross-Team & Open-Source Collaboration: Collaborate with internal GPU library teams and open-source maintainers to align improvements and ensure seamless upstream integration. Software Engineering Excellence: Apply robust engineering practices to deliver maintainable, reliable, and production-quality performance optimizations. MANDATORY EXPERIENCE: Inference Frameworks, Model Architectures & Optimization Expertise: Strong deep practical experience with vLLM or SGLang, mastery of modern LLMs (e.g., DeepSeek, Qwen), strong theoretical grounding in Transformer/Attention/MoE/KV Cac…
工作职责
THE ROLE: As a core member of the team, you will play a pivotal role in optimizing and developing deep learning frameworks for AMD GPUs. Your experience will be critical in enhancing GPU kernels, deep learning models, and training/inference performance across multi-GPU and multi-node systems. You will engage with both internal GPU library teams and open-source maintainers to ensure seamless integration of optimizations, utilizing cutting-edge compiler technologies and advanced engineering principles to drive continuous improvement.
Headquartered in Singapore, Ant International powers the future of global commerce with digital innovation for everyone and every business to thrive. In close collaboration with partners, we support merchants of all sizes worldwide to realize their growth aspirations through a comprehensive range of tech-driven digital payment and financial services solutions. We are seeking for Java Software Engineers for our Malaysia Tech Center, work on end-to-end solutions for cross-border payments for our global merchants and globalization business. Key Responsibilities: 1. Design solutions involving integration with multiple systems and services. 2. Develop high volume, high performance, low latency and reliable mission critical applications. 3. Write maintainable, robust, and testable code. 4. Perform code and test case review. 5. Implement processes, solutions or tools to improve software delivery and quality. 6. Able to adopt latest software development trends and industry best practices.
We are aiming to leverage AI and other leading technology and dedicated to provide safe and reliable risk control capabilities behind payments. The core technologies include rule engines, model engines, intelligent algorithm models, etc., We are the leading platform with capabilities of high concurrent real-time risk calculations and massive big data analysis and processing. And as the core risk management tech platform for global payment business, we adopt a multi-center deployment architecture around the world. Here you may have the opportunity to learn more about and participate in the design and development of the following aspects: 1. Ultimate computing optimization at the millisecond level. 2. Behavior analysis and risk mining under massive data. 3. Global multi-center system architecture planning and high-availability solution design. 4. Participated in the design of R&D of risk control systems and big data platforms. You will also have the opportunity to explore the architectural design and implementation of cutting-edge technologies such as privacy computing and large models in risk control systems.
Design and implement user lifecycle management strategies for the e-commerce platform, responsible for enhancing overall user engagement, activity, and purchase depth. Analyze buyer data and behavior to identify trends and opportunities for increasing engagement and satisfaction. Collaborate with cross-functional teams, including marketing and customer service, to design and execute targeted campaigns and initiatives. Monitor and evaluate the performance of CLM projects, providing regular reports and suggestions for improvement. Utilize CRM tools to manage buyer interactions and personalize communication strategies. Ensure alignment of CLM activities with overall business objectives and customer experience goals.
- Lead the development and execution/develop of the company’s Enterprise Risk Management (ERM) strategy, ensuring it is aligned with the overall business strategy and risk appetite, in accordance with business conditions, the regulatory environment, and industry trends. - Develop and maintain a comprehensive ERM framework, including risk identification, assessment, mitigation, and monitoring processes. - Collaborate with the Executive Team and department heads to integrate risk management practices across all functions and business units. Collaborate to ensure the effective execution of risk controls and advise on risk reduction opportunities & best practices for risk management. - Provide thought leadership and guidance on emerging risks, including operational, strategic, compliance, and reputation - Oversee the development and testing of business continuity plans and disaster recovery procedures to ensure organizational resilience in the event of disruptions, and oversee the management of crisis situations, ensuring that risk mitigation measures are implemented swiftly and effectively. - Foster a risk-aware culture across the organization by ensuring effective communication of risk management policies, guidelines, and procedures.