阿里巴巴阿里国际站-研发工程师-AI+工程师
社招全职3年以上技术类-开发地点:杭州状态:招聘
任职要求
1. 本科及以上学历,计算机、人工智能、数学、统计或相关专业;3年以上软件开发经验,具备大模型工程化落地、微调及应用开发的实际项目经验。 2. 工程基础:精通Python,熟悉Java/Go/C++至少一种;熟练掌握主流工程框架(如 Spring Boot、Django、React/Vue 等),具备高并发、分布式系统设计与开发能力;熟悉数据库(MySQL/PostgreSQL)、缓存(Redis)、消息队列(Kafka/RabbitMQ)等中间件的选型、优化与运维。 3. ML与数据基础:熟练使用 Python 进行数据处理,掌握 Pandas、NumPy、SQL 等常用工具;理解常见机器学习与深度学习算法原理;熟练使用至少一种主流深度学习框架(如:PyTorch),并具备模型训练、评估与调优经验。 4. 大模型工程化:掌握大模型微调技术(如 SFT、LoRA、P-Tuning、RLHF),能完成模型适配与优化;深入理解并实践 RAG、Prompt Engine…
登录查看完整任职要求
微信扫码,1秒登录
工作职责
1. 负责大模型在业务场景中的架构设计与工程化落地,综合运用RAG、AI Agent、Function Calling、Prompt Engineering等技术,构建端到端AI应用,包括知识库构建、向量检索集成、AI工作流编排及与业务系统的深度对接。 2. 负责大模型的生命周期工程管理,基于百炼、ModelScope、Hugging Face 或 LLaMA-Factory 等平台,开展模型微调(SFT/LoRA/P-Tuning/RLHF)、部署、监控与持续迭代; 3. 设计并实现面向业务目标的自动化评测体系,结合人工评估与自动指标(如F1、BLEU、ROUGE及业务定制指标),驱动数据闭环与模型迭代。 4. 开发高可用、高并发的应用服务,通过优化推理API性能、缓存策略与系统架构,保障服务稳定性与可扩展性。 5. 能够跨职能协同产出,与产品、UI/UX、测试及运维等团队紧密协作,推动AI功能从原型验证到规模化上线的全链路交付。
包括英文材料
学历+
大模型+
https://www.youtube.com/watch?v=xZDB1naRUlk
You will build projects with LLMs that will enable you to create dynamic interfaces, interact with vast amounts of text data, and even empower LLMs with the capability to browse the internet for research papers.
https://www.youtube.com/watch?v=zjkBMFhNj_g
Java+
https://www.youtube.com/watch?v=eIrMbAQSU34
Master Java – a must-have language for software development, Android apps, and more! ☕️ This beginner-friendly course takes you from basics to real coding skills.
C+++
https://www.learncpp.com/
LearnCpp.com is a free website devoted to teaching you how to program in modern C++.
https://www.youtube.com/watch?v=ZzaPdXTrSb8
Python+
https://liaoxuefeng.com/books/python/introduction/index.html
中文,免费,零起点,完整示例,基于最新的Python 3版本。
https://www.learnpython.org/
a free interactive Python tutorial for people who want to learn Python, fast.
https://www.youtube.com/watch?v=K5KVEU3aaeQ
Master Python from scratch 🚀 No fluff—just clear, practical coding skills to kickstart your journey!
https://www.youtube.com/watch?v=rfscVS0vtbw
This course will give you a full introduction into all of the core concepts in python.
Spring Boot+
https://spring.io/guides/gs/spring-boot
his guide provides a sampling of how Spring Boot helps you accelerate application development.
https://www.youtube.com/watch?v=Nv2DERaMx-4&list=PLzUMQwCOrQTksiYqoumAQxuhPNa3HqasL
The author teaches you how to use Spring Boot from a complete beginner, to building a REST API with a real database, Dockerising it and deploying it to the cloud.
Gin+
https://gin-gonic.com/en/docs/quickstart/
This guide walks you through installing Gin, setting up a project, and running your first API—so you can start building web services with confidence.
https://go.dev/doc/tutorial/web-service-gin
This tutorial introduces the basics of writing a RESTful web service API with Go and the Gin Web Framework (Gin).
Django+
https://www.youtube.com/watch?v=nGIg40xs9e4
Learn how to build a simple Django application in as fast as 20 minutes!
https://www.youtube.com/watch?v=rHux0gMZ3Eg
Learn Django and start building amazing back-ends!
MySQL+
https://juejin.cn/post/7190306988939542585
这是一篇 MySQL 通关一篇过硬核经验学习路线,包括数据库相关知识,SQL语句的使用,数据库约束,设计等。
[英文] MySQL Tutorial
https://www.mysqltutorial.org/
your go-to resource for mastering MySQL in a fast, easy, and enjoyable way.
https://www.youtube.com/watch?v=5OdVJbNCSso
MySQL SQL tutorial for beginners
https://www.youtube.com/watch?v=7S_tz1z_5bA
This beginner-friendly course teaches you SQL from scratch.
PostgreSQL+
[英文] PostgreSQL Tutorial
https://neon.com/postgresql/tutorial
This PostgreSQL tutorial helps you quickly understand PostgreSQL.
[英文] PostgreSQL Tutorial
https://www.pgtutorial.com/
This PostgreSQL tutorial will teach you about PostgreSQL from beginner to advanced.
https://www.youtube.com/watch?v=qw--VYLpxG4
It is the most advanced open source database system widely used to build back-end systems.
https://www.youtube.com/watch?v=SpfIwlAYaKk
Learn PostgreSQL, one of the world's most advanced and robust open-source relational database systems.
Redis+
[英文] Developer Hub
https://redis.io/dev/
Get all the tutorials, learning paths, and more you need to start building—fast.
https://www.runoob.com/redis/redis-tutorial.html
REmote DIctionary Server(Redis) 是一个由 Salvatore Sanfilippo 写的 key-value 存储系统,是跨平台的非关系型数据库。
https://www.youtube.com/watch?v=jgpVdJB2sKQ
In this video I will be covering Redis in depth from how to install it, what commands you can use, all the way to how to use it in a real world project.
缓存+
https://hackernoon.com/the-system-design-cheat-sheet-cache
The cache is a layer that stores a subset of data, typically the most frequently accessed or essential information, in a location quicker to access than its primary storage location.
https://www.youtube.com/watch?v=bP4BeUjNkXc
Caching strategies, Distributed Caching, Eviction Policies, Write-Through Cache and Least Recently Used (LRU) cache are all important terms when it comes to designing an efficient system with a caching layer.
https://www.youtube.com/watch?v=dGAgxozNWFE
高并发+
https://www.baeldung.com/concurrency-principles-patterns
In this tutorial, we’ll discuss some of the design principles and patterns that have been established over time to build highly concurrent applications.
https://www.baeldung.com/java-concurrency
Handling concurrency in an application can be a tricky process with many potential pitfalls. A solid grasp of the fundamentals will go a long way to help minimize these issues.
https://www.oreilly.com/library/view/concurrency-in-go/9781491941294/
You’ll understand how Go chooses to model concurrency, what issues arise from this model, and how you can compose primitives within this model to solve problems.
https://www.oreilly.com/library/view/modern-concurrency-in/9781098165406/
With this book, you'll explore the transformative world of Java 21's key feature: virtual threads.
https://www.youtube.com/watch?v=qyM8Pi1KiiM
https://www.youtube.com/watch?v=wEsPL50Uiyo
Prompt+
https://cloud.google.com/vertex-ai/generative-ai/docs/learn/prompts/introduction-prompt-design
A prompt is a natural language request submitted to a language model to receive a response back.
https://learn.microsoft.com/en-us/azure/ai-foundry/openai/concepts/prompt-engineering
These techniques aren't recommended for reasoning models like gpt-5 and o-series models.
https://www.youtube.com/watch?v=LWiMwhDZ9as
Learn and master the fundamentals of Prompt Engineering and LLMs with this 5-HOUR Prompt Engineering Crash Course!
RAG+
https://www.youtube.com/watch?v=sVcwVQRHIc8
Learn how to implement RAG (Retrieval Augmented Generation) from scratch, straight from a LangChain software engineer.
Prometheus+
https://grafana.com/docs/grafana/latest/getting-started/get-started-grafana-prometheus/
Prometheus is an open source monitoring system for which Grafana provides out-of-the-box support.
https://prometheus.io/docs/tutorials/getting_started/
Prometheus is a system monitoring and alerting system.
Grafana+
CI+
https://www.ibm.com/cn-zh/think/topics/continuous-integration
持续集成 (CI) 是一种软件开发实践,开发人员在整个开发周期中会定期将新的代码和代码变更集成到中央代码存储库中。它是 DevOps 和敏捷方法的关键组成部分。
https://www.youtube.com/watch?v=42UP1fxi2SY
CD+
https://www.redhat.com/zh-cn/topics/devops/what-is-ci-cd
CI/CD 是持续集成和持续交付/部署的缩写,旨在简化并加快软件开发生命周期。
https://www.youtube.com/watch?v=R8_veQiYBjI&list=PLy7NrYWoggjzSIlwxeBbcgfAdYoxCIrM2
TensorRT+
https://docs.nvidia.com/deeplearning/tensorrt/latest/getting-started/quick-start-guide.html
This TensorRT Quick Start Guide is a starting point for developers who want to try out the TensorRT SDK; specifically, it demonstrates how to quickly construct an application to run inference on a TensorRT engine.
Spark+
[英文] Learning Spark Book
https://pages.databricks.com/rs/094-YMS-629/images/LearningSpark2.0.pdf
This new edition has been updated to reflect Apache Spark’s evolution through Spark 2.x and Spark 3.0, including its expanded ecosystem of built-in and external data sources, machine learning, and streaming technologies with which Spark is tightly integrated.
Flink+
https://nightlies.apache.org/flink/flink-docs-release-2.0/docs/learn-flink/overview/
This training presents an introduction to Apache Flink that includes just enough to get you started writing scalable streaming ETL, analytics, and event-driven applications, while leaving out a lot of (ultimately important) details.
https://www.youtube.com/watch?v=WajYe9iA2Uk&list=PLa7VYi0yPIH2GTo3vRtX8w9tgNTTyYSux
Today’s businesses are increasingly software-defined, and their business processes are being automated. Whether it’s orders and shipments, or downloads and clicks, business events can always be streamed. Flink can be used to manipulate, process, and react to these streaming events as they occur.
强化学习+
https://cloud.google.com/discover/what-is-reinforcement-learning?hl=en
Reinforcement learning (RL) is a type of machine learning where an "agent" learns optimal behavior through interaction with its environment.
https://huggingface.co/learn/deep-rl-course/unit0/introduction
This course will teach you about Deep Reinforcement Learning from beginner to expert. It’s completely free and open-source!
https://www.kaggle.com/learn/intro-to-game-ai-and-reinforcement-learning
Build your own video game bots, using classic and cutting-edge algorithms.
Go+
https://www.youtube.com/watch?v=8uiZC0l4Ajw
学习Golang的完整教程!从开始到结束不到一个小时,包括如何在Go中构建API的完整演示。没有多余的内容,只有你需要知道的知识。
还有更多 •••
相关职位
社招A129339
1、负责AI智能化工作流引擎的架构设计与研发,构建复杂业务场景的自动化系统,持续优化执行效率及稳定性; 2、开发基于LLM的智能决策节点组件,探索AI能力与工作流引擎的深度集成方案,提升业务流程自动化水平; 3、设计可视化工作流编排工具,开发支持动态配置、实时监控、异常自愈的智能工作流管理平台; 4、构建支持多模态交互(CLI/低代码/自然语言/音视频/图像)的工作流开发体系; 5、针对行业场景设计AI增强型工作流解决方案,如智能客服流程编排、自动化决策引擎等创新应用。
更新于 2023-11-27北京
社招3年以上A127336
1、负责工程效率相关应用技术研发、开发和维护,包括但不限于质量、流程、自动化等方向,确保产品的高质量和稳定性; 2、探索工程效率后端领域最优实践,寻找打破传统后端的工作方式; 3、探索并打造基于LLM的下一代质量服务,重点通过LLM提升软件测试质量与效率,包括但不限于文本/自动化用例生成、智能接口测试、智能评估、风险引擎等方向。
更新于 2024-10-23深圳
社招3年以上技术类-开发
1. 驱动阿里国际数字商业板块业务增长,构建 AIDC 分国家数据运营资产体系; 2. 建设本地化市场洞察和机会发现能力,帮助业务全方位了解行业动态,挖掘商业机会 3. 承担设计和建立产品/业务的指标体系,能够科学地跟踪和指导业务的发展 4. 参与构建海外电商用增、商品、商家、流量等行业的数据智能化应用解决方案,驱动业务获得增量价值; 5. 参与构建企业级 Data Agent,融合 LLM 与业务数据打造智能决策大脑;
更新于 2026-01-14杭州