阿里巴巴晓天衡宇-AI开发工程师-数据方向
社招全职3年以上地点:杭州状态:招聘
任职要求
1. 具备高效的 Vibe Coding 能力:能够熟练运用 Cursor、Claude Code、GitHub Copilot 等 AI 辅助编程工具,进行高质量的代码生成、调试与重构,极大提升开发与迭代效率。 2. 扎实的数据处理与采集能力: * 熟练掌握Python及常用数据处理库(如Pandas, NumPy),对ETL流程有较为深入理解。 3. 丰富的智能体(Agent)开发经验: * 熟悉LangChain、LlamaIndex等主流智能体开发框架,具备基于大语言模型(LLM)的智能体应用开发实战经验。 * 深刻理解智能体的核心组件,包括任务规划(Planning)、记忆机制(Memory)、工具调用(Tool Calling/Function Calling)等,并能将其应用于…
登录查看完整任职要求
微信扫码,1秒登录
工作职责
1. 负责行业研究数据的自动化发现与获取:设计并实现自动化工具和流程,持续、高效地抓取和整合公开的行业信息,包括但不限于行业动态、技术发展趋势、市场报告、科研文献、专利信息等。 2. 搭建多模态数据预处理流程:负责对采集到的文本、图像、表格等多模态数据进行清洗、去重、格式标准化、关键信息提取及数据对齐等预处理工作,构建稳定可靠的数据处理管道(Pipeline)。 3. 建设与管理行业研究信息平台:主导行业研究数据质量管理平台的规划、设计与开发,确保数据的高质量存储、高效检索、便捷应用和持续更新,为行业研究团队提供强大的数据支撑。
包括英文材料
GitHub+
[英文] GitHub Learn
https://learn.github.com/
Discover a wide range of beginner-friendly tutorials, hands-on learning, and expert-led lessons.
Python+
https://liaoxuefeng.com/books/python/introduction/index.html
中文,免费,零起点,完整示例,基于最新的Python 3版本。
https://www.learnpython.org/
a free interactive Python tutorial for people who want to learn Python, fast.
https://www.youtube.com/watch?v=K5KVEU3aaeQ
Master Python from scratch 🚀 No fluff—just clear, practical coding skills to kickstart your journey!
https://www.youtube.com/watch?v=rfscVS0vtbw
This course will give you a full introduction into all of the core concepts in python.
Pandas+
[英文] 10 minutes to pandas
https://pandas.pydata.org/docs/user_guide/10min.html
This is a short introduction to pandas, geared mainly for new users.
[英文] Cookbook - pandas
https://pandas.pydata.org/docs/user_guide/cookbook.html#cookbook
This is a repository for short and sweet examples and links for useful pandas recipes.
https://www.kaggle.com/learn/pandas
Solve short hands-on challenges to perfect your data manipulation skills.
https://www.youtube.com/watch?v=2uvysYbKdjM
I'm super excited for this one. We're doing another complete Python Pandas tutorial walkthrough.
https://www.youtube.com/watch?v=Mdq1WWSdUtw
Filtering, Joins, Indexing, Data Cleaning, Visualizations
NumPy+
https://numpy.org/doc/stable/user/absolute_beginners.html
NumPy (Numerical Python) is an open source Python library that’s widely used in science and engineering.
[英文] NumPy - Learn
https://numpy.org/learn/
Below is a curated collection of educational resources, both for self-learning and teaching others, developed by NumPy contributors and vetted by the community.
https://www.kaggle.com/code/themlphdstudent/learn-numpy-numpy-50-exercises-and-solution
This kernel uses exercises of NumPy from the Machine Learning Plus webpage
https://www.youtube.com/watch?v=KHoEbRH46Zk
If you've heard of Pandas and NumPy, you may think one is simply a superset of the other.
https://www.youtube.com/watch?v=QUT1VHiLmmI
Learn the basics of the NumPy library in this tutorial for beginners.
https://www.youtube.com/watch?v=VXU4LSAQDSc
This video serves as an introduction to the NumPy Python library.
ETL+
https://www.ibm.com/think/topics/etl
ETL—meaning extract, transform, load—is a data integration process that combines, cleans and organizes data from multiple sources into a single, consistent data set for storage in a data warehouse, data lake or other target system.
https://www.youtube.com/watch?v=OW5OgsLpDCQ
It explains what ETL is and what it can do for you to improve your data analysis and productivity.
智能体+
https://learn.microsoft.com/en-us/shows/ai-agents-for-beginners/
In this 10-lesson course we take you from concept to code while covering the fundamentals of building AI agents.
https://www.ibm.com/think/ai-agents
Your one-stop resource for gaining in-depth knowledge and hands-on applications of AI agents.
AI agent+
https://www.ibm.com/think/ai-agents
Your one-stop resource for gaining in-depth knowledge and hands-on applications of AI agents.
LangChain+
https://python.langchain.com/docs/tutorials/
New to LangChain or LLM app development in general? Read this material to quickly get up and running building your first applications.
https://www.freecodecamp.org/news/beginners-guide-to-langchain/
LangChain is a popular framework for creating LLM-powered apps.
还有更多 •••