夸克智能信息-图像生成&编辑算法专家-杭州
社招全职1年以上技术类-算法地点:北京 | 杭州状态:招聘
任职要求
1.1年以上计算机视觉的实践经验,有以下研究方向优先: -多模态生成和理解:如文本/图像/视频/3D生成和编辑,以及其他相关的多模态经验; -熟悉前言多模态大模型技术,包括但不限于LLaVA、QwenVL、InternVL等; -熟练掌握SFT和RL训练策略,熟悉ms-swift、LLaMA-Factory等代码框架; -熟悉扩散模型,GAN,等用于生成任务的转换器; -有大规模训练经验、AIGC, LLM和RLHF等; 2、动手能力强, 具有熟练的算法和编程能力,熟悉C/C++和Python编程; 3、工作积极主动, 能与团队融洽合作相处,同时能够独立完成研究工作; 4、具有行业影响力高质量论文, 或者顶尖竞赛经历的优先(e.g., ACM)。
工作职责
1、利用SD、VLLM、LLM等AIGC相关技术参与图文生成、视频生成、智能化编辑,包括但不限于海报生成、动态海报、数字人等; 2、负责AI算法的架构设计与优化,针对不同业务场景提出通用性或定制化的解决方案; 3、结合实际业务需求,探索和解决新问题,并通过创新和改进推动团队整体能力提升。
包括英文材料
OpenCV+
https://learnopencv.com/getting-started-with-opencv/
At LearnOpenCV we are on a mission to educate the global workforce in computer vision and AI.
https://opencv.org/university/free-opencv-course/
This free OpenCV course will teach you how to manipulate images and videos, and detect objects and faces, among other exciting topics in just about 3 hours.
大模型+
https://www.youtube.com/watch?v=xZDB1naRUlk
You will build projects with LLMs that will enable you to create dynamic interfaces, interact with vast amounts of text data, and even empower LLMs with the capability to browse the internet for research papers.
https://www.youtube.com/watch?v=zjkBMFhNj_g
SFT+
https://cameronrwolfe.substack.com/p/understanding-and-using-supervised
Understanding how SFT works from the idea to a working implementation...
Swift+
[英文] A Swift Tour
https://docs.swift.org/swift-book/documentation/the-swift-programming-language/guidedtour/
Explore the features and syntax of Swift.
https://www.hackingwithswift.com/learn
Free Swift and iOS tutorials
https://www.youtube.com/watch?v=8Xg7E9shq0U
Learn the Swift programming language in this full tutorial for beginners.
LLaMA-Factory+
https://llamafactory.readthedocs.io/en/latest/
LLaMA Factory is an easy-to-use and efficient platform for training and fine-tuning large language models.
算法+
https://roadmap.sh/datastructures-and-algorithms
Step by step guide to learn Data Structures and Algorithms in 2025
https://www.hellointerview.com/learn/code
A visual guide to the most important patterns and approaches for the coding interview.
https://www.w3schools.com/dsa/
C+
https://www.freecodecamp.org/chinese/news/the-c-beginners-handbook/
本手册遵循二八定律。你将在 20% 的时间内学习 80% 的 C 编程语言。
https://www.youtube.com/watch?v=87SH2Cn0s9A
https://www.youtube.com/watch?v=KJgsSFOSQv0
This course will give you a full introduction into all of the core concepts in the C programming language.
https://www.youtube.com/watch?v=PaPN51Mm5qQ
In this complete C programming course, Dr. Charles Severance (aka Dr. Chuck) will help you understand computer architecture and low-level programming with the help of the classic C Programming language book written by Brian Kernighan and Dennis Ritchie.
C+++
https://www.learncpp.com/
LearnCpp.com is a free website devoted to teaching you how to program in modern C++.
https://www.youtube.com/watch?v=ZzaPdXTrSb8
Python+
https://liaoxuefeng.com/books/python/introduction/index.html
中文,免费,零起点,完整示例,基于最新的Python 3版本。
https://www.learnpython.org/
a free interactive Python tutorial for people who want to learn Python, fast.
https://www.youtube.com/watch?v=K5KVEU3aaeQ
Master Python from scratch 🚀 No fluff—just clear, practical coding skills to kickstart your journey!
https://www.youtube.com/watch?v=rfscVS0vtbw
This course will give you a full introduction into all of the core concepts in python.
相关职位
社招1年以上运营-产品运营
1、深度参与大模型前沿方向的数据工作,重点负责文生图模型的数据寻源、数据标注与管理,模型效果评估; 2、设计各技术方案下阶段的数据方案与策略,建设文生图大模型的数据生产和质量提升流程,管理高效高质的数据生产pipeline,建设起行业领先的数据生产标准; 3、建立科学的模型效果评估方案与策略,给出模型优化建议,助力模型效果达到业内一流; 4、深入理解业务场景、市场动态和大模型技术趋势,牵引数据团队和算法团队的深度融合。
更新于 2025-09-26
社招1年以上技术类-算法
1.负责基于开源或内部基础大模型,进行文生图、文生视频、图像/视频编辑等AIGC技术能力的精调、优化,持续提升用户体验。 2.深入探索Agent在智能创作等业务场景的应用,负责构建大规模Multi-Agent系统,并对视觉语言模型(VLM)进行高效的定制与微调,以驱动业务创新。 3.进行前沿AI应用方向的技术预研,跟踪并评估最新研究成果,主动探索其在业务场景中的可行性,并负责将有潜力的技术迅速落地为核心业务能力,驱动产品创新与运营效率提升。
更新于 2025-09-26
社招5年以上技术类-开发
1. 负责夸克智能视觉相关业务服务,负责深度学习算法服务的流程设计及研发工作 2. 深入理解业务(扫描滤镜、文字服务、图像编辑、图像生成等),和算法紧密合作,对已有服务进行全链路的改进和优化 3. 技术预研和技术难点攻关,引入业界新技术和系统化方法,提升服务迭代效率,保障服务的稳定性、高性能和可扩展性
更新于 2025-10-16