
哈啰【量天尺】Ageng AI工程师 -上海
社招全职量天尺计划地点:上海状态:招聘
任职要求
1. 计算机科学、人工智能或相关专业,硕士及以上学历,博士优先,有顶会期刊(CVPR、ICML等)优先。 2. 熟悉分布式训练和并行计算,有大规模项目实战背景。有AI模型训练与推理加速相关经验 3. 精通主流深度学习算法,包括但不限于 Transformer、Diffusion、GNN、强化学习等,熟悉其计算模式及优化要点。 4. 编程基础扎实,熟练掌握Python、C/C++等语言,具备良好的架构设计及编码规范。 5. 熟悉主流深度学习框架(如TensorFlow、PyTorch)及分布式训练框架(如DeepSpeed、NeMo Megatron等),熟悉推理框架(如vLLM、TensorRT等),对多进程、多线程、MPI等并行计算有深入理解。 6. 熟悉模型压缩、量化、剪枝等模型加速方法,并具备相关项目实践经验。 7.熟悉模型性能分析工具(如PyTorch Profiler、TensorBoard等),有性能优化经验者优先 8. 参与过相关领域的技术竞赛并获得优异成绩(如Kaggle、ACM、MLPerf)优先。
工作职责
我们希望你是 2024年11月-2026年10月 期间毕业的 博士/硕士研究生 同时也是: 学术先锋:在国内外顶刊/顶会上发表过重要学术论文(包括但不限于NeurIPS、ICML、CVPR、ICCV、ECCV 等顶会或 IEEE Transactions 系列核心期刊) 竞赛达人:在国内外顶尖赛事中取得优秀成绩(包括但不限于RoboMaster、Topcoder、Codeforces、ACM-ICPC、RoboCup) 实战高手:有自动驾驶、机器人、大模型基座,复杂Agent相关科研项目或实习经历(包括但不限于感知算法优化、决策模型开发,复杂多Agent的搭建等) 同频共振:理性务实、敢想敢干、渴望成功、乐观激进、聪明自省 工作内容: 1. 负责AI模型训练和推理流程的深度优化,包括多机多卡分布式训练方案,保障高效稳定的训练速度和推理性能,熟悉并应用TP/PP/DP/EP/ZeRO等分布式或并行优化策略,充分挖掘硬件性能。 2. 优化并行训练策略与分布式训练框架,提高模型可扩展性和集群资源利用率,解决分布式训练中的负载均衡、同步机制、通信瓶颈等问题。 3. 研究并实践模型压缩、量化(包括量化KV cache)、剪枝、FlashAttention等加速技术,缩短推理时延,降低部署成本,与算法团队合作,针对应用场景进行模型结构的裁剪和定制化。 4. 对接基础设施团队,优化集群算力、显存和带宽等资源的调度与使用效率,分析并持续降低训练和推理的总体计算成本。 5. 利用NVIDIA Nsight Compute和PyTorch Profiler等工具,深入分析模型性能瓶颈,挖掘硬件和算法的潜力。 6. 关注AI加速领域最新研究进展及产业动态,对新技术进行可行性评估与引入,主动探索并落地新型训练优化策略或加速引擎。
包括英文材料
学历+
CVPR+
https://cvpr.thecvf.com/
ICML+
https://icml.cc/
深度学习+
https://d2l.ai/
Interactive deep learning book with code, math, and discussions.
算法+
https://roadmap.sh/datastructures-and-algorithms
Step by step guide to learn Data Structures and Algorithms in 2025
https://www.hellointerview.com/learn/code
A visual guide to the most important patterns and approaches for the coding interview.
https://www.w3schools.com/dsa/
Transformer+
https://huggingface.co/learn/llm-course/en/chapter1/4
Breaking down how Large Language Models work, visualizing how data flows through.
https://poloclub.github.io/transformer-explainer/
An interactive visualization tool showing you how transformer models work in large language models (LLM) like GPT.
https://www.youtube.com/watch?v=wjZofJX0v4M
Breaking down how Large Language Models work, visualizing how data flows through.
GNN+
https://distill.pub/2021/gnn-intro/
Neural networks have been adapted to leverage the structure and properties of graphs.
https://gnn.seas.upenn.edu/
Graph Neural Networks (GNNs) are information processing architectures for signals supported on graphs.
https://www.ibm.com/think/topics/graph-neural-network
Graph neural networks (GNNs) are a deep neural network architecture that is popular both in practical applications and cutting-edge machine learning research.
强化学习+
https://cloud.google.com/discover/what-is-reinforcement-learning?hl=en
Reinforcement learning (RL) is a type of machine learning where an "agent" learns optimal behavior through interaction with its environment.
https://huggingface.co/learn/deep-rl-course/unit0/introduction
This course will teach you about Deep Reinforcement Learning from beginner to expert. It’s completely free and open-source!
https://www.kaggle.com/learn/intro-to-game-ai-and-reinforcement-learning
Build your own video game bots, using classic and cutting-edge algorithms.
Python+
https://liaoxuefeng.com/books/python/introduction/index.html
中文,免费,零起点,完整示例,基于最新的Python 3版本。
https://www.learnpython.org/
a free interactive Python tutorial for people who want to learn Python, fast.
https://www.youtube.com/watch?v=K5KVEU3aaeQ
Master Python from scratch 🚀 No fluff—just clear, practical coding skills to kickstart your journey!
https://www.youtube.com/watch?v=rfscVS0vtbw
This course will give you a full introduction into all of the core concepts in python.
C+
https://www.freecodecamp.org/chinese/news/the-c-beginners-handbook/
本手册遵循二八定律。你将在 20% 的时间内学习 80% 的 C 编程语言。
https://www.youtube.com/watch?v=87SH2Cn0s9A
https://www.youtube.com/watch?v=KJgsSFOSQv0
This course will give you a full introduction into all of the core concepts in the C programming language.
https://www.youtube.com/watch?v=PaPN51Mm5qQ
In this complete C programming course, Dr. Charles Severance (aka Dr. Chuck) will help you understand computer architecture and low-level programming with the help of the classic C Programming language book written by Brian Kernighan and Dennis Ritchie.
C+++
https://www.learncpp.com/
LearnCpp.com is a free website devoted to teaching you how to program in modern C++.
https://www.youtube.com/watch?v=ZzaPdXTrSb8
系统设计+
https://roadmap.sh/system-design
Everything you need to know about designing large scale systems.
https://www.youtube.com/watch?v=F2FmTdLtb_4
This complete system design tutorial covers scalability, reliability, data handling, and high-level architecture with clear explanations, real-world examples, and practical strategies.
TensorFlow+
https://www.youtube.com/watch?v=tpCFfeUEGs8
Ready to learn the fundamentals of TensorFlow and deep learning with Python? Well, you’ve come to the right place.
https://www.youtube.com/watch?v=ZUKz4125WNI
This part continues right where part one left off so get that Google Colab window open and get ready to write plenty more TensorFlow code.
PyTorch+
https://datawhalechina.github.io/thorough-pytorch/
PyTorch是利用深度学习进行数据科学研究的重要工具,在灵活性、可读性和性能上都具备相当的优势,近年来已成为学术界实现深度学习算法最常用的框架。
https://www.youtube.com/watch?v=V_xro1bcAuA
Learn PyTorch for deep learning in this comprehensive course for beginners. PyTorch is a machine learning framework written in Python.
Megatron+
https://www.youtube.com/watch?v=hc0u4avAkuM
vLLM+
https://www.newline.co/@zaoyang/ultimate-guide-to-vllm--aad8b65d
vLLM is a framework designed to make large language models faster, more efficient, and better suited for production environments.
https://www.youtube.com/watch?v=Ju2FrqIrdx0
vLLM is a cutting-edge serving engine designed for large language models (LLMs), offering unparalleled performance and efficiency for AI-driven applications.
TensorRT+
https://docs.nvidia.com/deeplearning/tensorrt/latest/getting-started/quick-start-guide.html
This TensorRT Quick Start Guide is a starting point for developers who want to try out the TensorRT SDK; specifically, it demonstrates how to quickly construct an application to run inference on a TensorRT engine.
多线程+
https://liaoxuefeng.com/books/java/threading/basic/index.html
和单线程相比,多线程编程的特点在于:多线程经常需要读写共享数据,并且需要同步。
https://www.youtube.com/watch?v=_uQgGS_VIXM&list=PLsc-VaxfZl4do3Etp_xQ0aQBoC-x5BIgJ
https://www.youtube.com/watch?v=IEEhzQoKtQU
https://www.youtube.com/watch?v=mTGdtC9f4EU&list=PLL8woMHwr36EDxjUoCzboZjedsnhLP1j4
https://www.youtube.com/watch?v=TPVH_coGAQs&list=PLk6CEY9XxSIAeK-EAh3hB4fgNvYkYmghp
https://www.youtube.com/watch?v=xPqnoB2hjjA
This video is an introduction to multithreading in modern C++.
https://www.youtube.com/watch?v=YKBwKy5PrpQ
Rust threading is easy to implement and improves the efficiency of your applications on multi-core systems!
Message Passing Interface+
https://www.youtube.com/watch?v=7huftuXExV0
Parallel programming and MPI are crucial tools for achieving high performance computing.
[英文] 📺Basics of the Message Passing Interface (MPI) to program distributed memory parallel computers
https://www.youtube.com/watch?v=tm8M5H1OZmw
The Message Passing Interface (MPI) is a widely used standard to program distributed message parallel computers.
TensorBoard+
https://www.tensorflow.org/tensorboard/get_started
In machine learning, to improve something you often need to be able to measure it.
https://www.youtube.com/watch?v=k7KfYXXrOj0
In this video we learn how to use various parts of TensorBoard to for example obtain loss plots, accuracy plots, visualize image data, confusion matrices.
Kaggle+
[英文] Kaggle Learn
https://www.kaggle.com/learn
Gain the skills you need to do independent data science projects.
相关职位
实习淘天集团日常实习
我们是国内使用量最大的电商AIGC平台之一。 你将参与到:构建以评测驱动的多模态数据高效处理与迭代机制,融合多模态理解、合成增强等关键技术,打造适用于AI电商场景的高质量融合数据,最终实现“数据-模型-场景”的高效协同闭环,推动新一代AI数据体系的建设。
更新于 2025-08-26
实习淘天集团日常实习
我们是国内使用量最大的电商AIGC平台之一。 你将参与到:基于淘宝海量商品数据,打造技术先进的电商多模态大模型,支撑发布、比货、导购等多种电商业务场景,并面向商家和消费者探索 AIGC 等创新业务应用。
更新于 2025-08-26

社招美术设计类
1.游戏演出内容制作:根据文字剧本,利用游戏内演出编辑器和引擎内动作库资源,在UE引擎中完成游戏内的剧情演出; 2.资产跟进:能够把控演出配置相关的角色、动作、特效等资产,与TD和影视合作提升资产品质; 3.工具开发与维护:具有剧情编辑器开发经验;持续使用并跟进相关引擎内工具的功能与易用性迭代,使工具能够更好地应用于开发; 4.与叙事策划配合,将演出部分耦合到游戏整体叙事体验中,并根据整体流程体验调整迭代演出内容; 5.跟进并优化迭代游戏局内镜头效果,提升游戏体验。
更新于 2025-01-13