蚂蚁金服蚂蚁技术研究院长期研究型实习生-大模型算法
实习兼职蚂蚁技术研究院长期研究型实习生项目地点:北京 | 上海 | 杭州状态:招聘
任职要求
1. 学历背景:计算机科学、人工智能、数据科学、数学等相关专业的硕士或博士在读学生。 2. 技术能力: * 精通Python编程,具备扎实的算法、数据结构和机器学习基础。 * 精通深度学习和大模型框架(如PyTorch、Huggingface transformer、LangChain、vLLM、DeepSpeed、Megatron-LM等)。 * 了解大模型的基本原理、训练和评估流程。 3. 研究经验: * 有大模型数据侧相关研究经验者优先。 * 有大规模数据处理经验(如Common Crawl等)者优先。 4. 论文阅读与实现能力:能够快速阅读和理解顶级会议(如NeurIPS、ICLR、ICML、ACL等)的论文,并复现算法。 5. 学习与创新能力:对前沿技术有强烈兴趣,具备独立思考和解决问题的能力。 6. 团队合作:良好的沟通能力和团队协作精神,能够与团队成员高效合作。 7. 加分项: * 在顶级会议或期刊发表过相关论文者优先。 * 有分布式数据处理框架(如Ray、Spark)使用经验者优先。
工作职责
参与大模型数据侧的前沿研究工作。你将与顶尖的研究团队合作,探索数据在大模型训练、优化和应用中的核心作用,推动大模型数据智能领域的创新。
包括英文材料
学历+
数据科学+
https://roadmap.sh/ai-data-scientist
Step by step roadmap guide to becoming an AI and Data Scientist
Python+
https://liaoxuefeng.com/books/python/introduction/index.html
中文,免费,零起点,完整示例,基于最新的Python 3版本。
https://www.learnpython.org/
a free interactive Python tutorial for people who want to learn Python, fast.
https://www.youtube.com/watch?v=K5KVEU3aaeQ
Master Python from scratch 🚀 No fluff—just clear, practical coding skills to kickstart your journey!
https://www.youtube.com/watch?v=rfscVS0vtbw
This course will give you a full introduction into all of the core concepts in python.
算法+
https://roadmap.sh/datastructures-and-algorithms
Step by step guide to learn Data Structures and Algorithms in 2025
https://www.hellointerview.com/learn/code
A visual guide to the most important patterns and approaches for the coding interview.
https://www.w3schools.com/dsa/
数据结构+
https://www.youtube.com/watch?v=8hly31xKli0
In this course you will learn about algorithms and data structures, two of the fundamental topics in computer science.
https://www.youtube.com/watch?v=B31LgI4Y4DQ
Learn about data structures in this comprehensive course. We will be implementing these data structures in C or C++.
https://www.youtube.com/watch?v=CBYHwZcbD-s
Data Structures and Algorithms full course tutorial java
机器学习+
https://www.youtube.com/watch?v=0oyDqO8PjIg
Learn about machine learning and AI with this comprehensive 11-hour course from @LunarTech_ai.
https://www.youtube.com/watch?v=i_LwzRVP7bg
Learn Machine Learning in a way that is accessible to absolute beginners.
https://www.youtube.com/watch?v=NWONeJKn6kc
Learn the theory and practical application of machine learning concepts in this comprehensive course for beginners.
https://www.youtube.com/watch?v=PcbuKRNtCUc
Learn about all the most important concepts and terms related to machine learning and AI.
深度学习+
https://d2l.ai/
Interactive deep learning book with code, math, and discussions.
大模型+
https://www.youtube.com/watch?v=xZDB1naRUlk
You will build projects with LLMs that will enable you to create dynamic interfaces, interact with vast amounts of text data, and even empower LLMs with the capability to browse the internet for research papers.
https://www.youtube.com/watch?v=zjkBMFhNj_g
PyTorch+
https://datawhalechina.github.io/thorough-pytorch/
PyTorch是利用深度学习进行数据科学研究的重要工具,在灵活性、可读性和性能上都具备相当的优势,近年来已成为学术界实现深度学习算法最常用的框架。
https://www.youtube.com/watch?v=V_xro1bcAuA
Learn PyTorch for deep learning in this comprehensive course for beginners. PyTorch is a machine learning framework written in Python.
Transformer+
https://huggingface.co/learn/llm-course/en/chapter1/4
Breaking down how Large Language Models work, visualizing how data flows through.
https://poloclub.github.io/transformer-explainer/
An interactive visualization tool showing you how transformer models work in large language models (LLM) like GPT.
https://www.youtube.com/watch?v=wjZofJX0v4M
Breaking down how Large Language Models work, visualizing how data flows through.
LangChain+
https://python.langchain.com/docs/tutorials/
New to LangChain or LLM app development in general? Read this material to quickly get up and running building your first applications.
https://www.freecodecamp.org/news/beginners-guide-to-langchain/
LangChain is a popular framework for creating LLM-powered apps.
vLLM+
https://www.newline.co/@zaoyang/ultimate-guide-to-vllm--aad8b65d
vLLM is a framework designed to make large language models faster, more efficient, and better suited for production environments.
https://www.youtube.com/watch?v=Ju2FrqIrdx0
vLLM is a cutting-edge serving engine designed for large language models (LLMs), offering unparalleled performance and efficiency for AI-driven applications.
DeepSpeed+
https://www.youtube.com/watch?v=pDGI668pNg0
Megatron+
https://www.youtube.com/watch?v=hc0u4avAkuM
NeurIPS+
https://neurips.cc/
ICLR+
https://iclr.cc/
ICML+
https://icml.cc/
Ray+
https://github.com/ray-project/ray
Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
https://www.youtube.com/watch?v=FhXfEXUUQp0
In this video, I'll teach you everything you need to know about Apache Ray!
https://www.youtube.com/watch?v=fMiAyj2kgac
Using powerful machine learning algorithms is easy using Ray.io and Python.
https://www.youtube.com/watch?v=q_aTbb7XeL4
Parallel and Distributed computing sounds scary until you try this fantastic Python library.
Spark+
[英文] Learning Spark Book
https://pages.databricks.com/rs/094-YMS-629/images/LearningSpark2.0.pdf
This new edition has been updated to reflect Apache Spark’s evolution through Spark 2.x and Spark 3.0, including its expanded ecosystem of built-in and external data sources, machine learning, and streaming technologies with which Spark is tightly integrated.
相关职位
实习蚂蚁技术研究院长
1. 结合知识图谱和LLM,构建一个AI系统解决现实世界里的复杂问题(如生成机器学习算法,数据分析,生成代码等等); 2. 近期的研究方向包括并不限于: (1) 用于复杂问题求解的知识增强型大语言模型 (2)图基础模型 (3)基于代码图的代码生成 (4)用符号推理增强LLM的推理 3. 开展与其它相关领域/学科结合的交叉研究,拓宽知识图谱和LLM的应用范围; 4. 将相应成果以论文形式发表到顶尖学术会议/期刊,并与研发团队协作落地到实际场景
实习蚂蚁技术研究院长
项目简介: 蚂蚁技术研究院计算系统实验室先进加速技术团队旨在为同态加密计算、大模型推理等新兴应用探索加速技术栈,涵盖应用优化、算法并行优化、体系结构优化、电路优化、系统优化等多个方向。