蚂蚁金服蚂蚁技术研究院长期研究型实习生-强化学习方向
实习兼职蚂蚁技术研究院长期研究型实习生项目地点:北京 | 上海 | 杭州状态:招聘
任职要求
1.有相关研究背景的博士生,特别优秀的硕士生亦可; 2.有丰富的自然语言处理或强化学习方向的科研经验,基础扎实,对至少其中一个子领域的算法在深度/广度两个维度都有深入的理解和思考; 3.学习能力强,热爱思考问题,有独立的研究风格,工作原创性高; 4.动手能力强,工作效率高,并有独立编写和调试完整算法的丰富经验; 5.逻辑能力强,能够清晰表达自己的想法,善于主动交流讨论、倾听建议、以及分享; 6.对算法研究抱有热情,有独立在国际顶刊/顶会发表论文的能力; 7.有在ACL/EMNLP/NeurIPS/ICLR/ICML等国际顶会顶刊发表论文经验者优先; 8.至少3个月的实习工作。
工作职责
职位描述:负责强化学习方向算法的创新研究
包括英文材料
NLP+
https://www.youtube.com/watch?v=fNxaJsNG3-s&list=PLQY2H8rRoyvzDbLUZkbudP-MFQZwNmU4S
Welcome to Zero to Hero for Natural Language Processing using TensorFlow!
https://www.youtube.com/watch?v=R-AG4-qZs1A&list=PLeo1K3hjS3uuvuAXhYjV2lMEShq2UYSwX
Natural Language Processing tutorial for beginners series in Python.
https://www.youtube.com/watch?v=rmVRLeJRkl4&list=PLoROMvodv4rMFqRtEuo6SGjY4XbRIVRd4
The foundations of the effective modern methods for deep learning applied to NLP.
强化学习+
https://cloud.google.com/discover/what-is-reinforcement-learning?hl=en
Reinforcement learning (RL) is a type of machine learning where an "agent" learns optimal behavior through interaction with its environment.
https://huggingface.co/learn/deep-rl-course/unit0/introduction
This course will teach you about Deep Reinforcement Learning from beginner to expert. It’s completely free and open-source!
https://www.kaggle.com/learn/intro-to-game-ai-and-reinforcement-learning
Build your own video game bots, using classic and cutting-edge algorithms.
算法+
https://roadmap.sh/datastructures-and-algorithms
Step by step guide to learn Data Structures and Algorithms in 2025
https://www.hellointerview.com/learn/code
A visual guide to the most important patterns and approaches for the coding interview.
https://www.w3schools.com/dsa/
NeurIPS+
https://neurips.cc/
ICLR+
https://iclr.cc/
ICML+
https://icml.cc/
相关职位