
奇虎36026秋-AI大模型算法工程师-内容抽取及理解(北京)-4914(J11813)
校招全职算法类地点:北京状态:招聘
任职要求
1.教育背景:拥有计算机科学或相关领域的本科及以上学历。 2.专业经验:精通C++/Java/Python等至少一种编程语言。 3.工程技术专长: 熟悉Linux/Unix操作系统。 精通集合和多线程编程,以实现高并发,低延迟。 掌握常用设计模式,提升产品的扩展性和可维护性,延长产品生命周期,降低工程开发成本。 4.AI技术: 对LLM微调、推理和评估有深入理解。 有模型训练经验者优先。 搜索引擎与NLP:了解搜索引擎和NLP算法,有大规模知识挖掘、表示、推理和建模经验者优先。 领域专长:在知识图谱、智能问答、搜索引擎等领域有实际项目经验者优先。 文件处理:具有PDF、DOC文件解析经验者优先。
工作职责
我们正在寻找一位具有深厚技术背景和丰富经验的高级AI工程师,负责以下关键职责: 1.系统设计:利用最新的LLM、知识图谱、NLP和搜索引擎技术,参与构建和维护智能问答系统架构。 2.技术实现:推动问答技术的工程化,包括文本解析、知识图谱构建、搜索引擎集成和推理逻辑开发。 3.性能优化:参与系统架构和性能优化,确保产品的高效运行和扩展性。 4.技术引领:独立探索前沿技术,为产品方案提供技术支撑和验证。
包括英文材料
学历+
C+++
https://www.learncpp.com/
LearnCpp.com is a free website devoted to teaching you how to program in modern C++.
https://www.youtube.com/watch?v=ZzaPdXTrSb8
Java+
https://www.youtube.com/watch?v=eIrMbAQSU34
Master Java – a must-have language for software development, Android apps, and more! ☕️ This beginner-friendly course takes you from basics to real coding skills.
Python+
https://liaoxuefeng.com/books/python/introduction/index.html
中文,免费,零起点,完整示例,基于最新的Python 3版本。
https://www.learnpython.org/
a free interactive Python tutorial for people who want to learn Python, fast.
https://www.youtube.com/watch?v=K5KVEU3aaeQ
Master Python from scratch 🚀 No fluff—just clear, practical coding skills to kickstart your journey!
https://www.youtube.com/watch?v=rfscVS0vtbw
This course will give you a full introduction into all of the core concepts in python.
Linux+
https://ryanstutorials.net/linuxtutorial/
Ok, so you want to learn how to use the Bash command line interface (terminal) on Unix/Linux.
https://ubuntu.com/tutorials/command-line-for-beginners
The Linux command line is a text interface to your computer.
https://www.youtube.com/watch?v=6WatcfENsOU
In this Linux crash course, you will learn the fundamental skills and tools you need to become a proficient Linux system administrator.
https://www.youtube.com/watch?v=v392lEyM29A
Never fear the command line again, make it fear you.
https://www.youtube.com/watch?v=ZtqBQ68cfJc
Unix+
[英文] The UNIX® Standard
https://www.opengroup.org/membership/forums/platform/unix
https://www.youtube.com/watch?v=IrDUcdpPmdI
UNIX is an operating system which was first developed in the 1970s, and has been under constant development ever since.
多线程+
https://liaoxuefeng.com/books/java/threading/basic/index.html
和单线程相比,多线程编程的特点在于:多线程经常需要读写共享数据,并且需要同步。
https://www.youtube.com/watch?v=_uQgGS_VIXM&list=PLsc-VaxfZl4do3Etp_xQ0aQBoC-x5BIgJ
https://www.youtube.com/watch?v=IEEhzQoKtQU
https://www.youtube.com/watch?v=mTGdtC9f4EU&list=PLL8woMHwr36EDxjUoCzboZjedsnhLP1j4
https://www.youtube.com/watch?v=TPVH_coGAQs&list=PLk6CEY9XxSIAeK-EAh3hB4fgNvYkYmghp
https://www.youtube.com/watch?v=xPqnoB2hjjA
This video is an introduction to multithreading in modern C++.
https://www.youtube.com/watch?v=YKBwKy5PrpQ
Rust threading is easy to implement and improves the efficiency of your applications on multi-core systems!
高并发+
https://www.baeldung.com/concurrency-principles-patterns
In this tutorial, we’ll discuss some of the design principles and patterns that have been established over time to build highly concurrent applications.
https://www.baeldung.com/java-concurrency
Handling concurrency in an application can be a tricky process with many potential pitfalls. A solid grasp of the fundamentals will go a long way to help minimize these issues.
https://www.oreilly.com/library/view/concurrency-in-go/9781491941294/
You’ll understand how Go chooses to model concurrency, what issues arise from this model, and how you can compose primitives within this model to solve problems.
https://www.oreilly.com/library/view/modern-concurrency-in/9781098165406/
With this book, you'll explore the transformative world of Java 21's key feature: virtual threads.
https://www.youtube.com/watch?v=qyM8Pi1KiiM
https://www.youtube.com/watch?v=wEsPL50Uiyo
设计模式+
https://liaoxuefeng.com/books/java/design-patterns/index.html
设计模式,即Design Patterns,是指在软件设计中,被反复使用的一种代码设计经验。使用设计模式的目的是为了可重用代码,提高代码的可扩展性和可维护性。
[英文] Design Patterns
https://refactoring.guru/design-patterns
Design patterns are typical solutions to common problems in software design. Each pattern is like a blueprint that you can customize to solve a particular design problem in your code.
https://www.youtube.com/watch?v=NU_1StN5Tkk
Design Patterns tutorial explained in simple words using real-world examples.
大模型+
https://www.youtube.com/watch?v=xZDB1naRUlk
You will build projects with LLMs that will enable you to create dynamic interfaces, interact with vast amounts of text data, and even empower LLMs with the capability to browse the internet for research papers.
https://www.youtube.com/watch?v=zjkBMFhNj_g
NLP+
https://www.youtube.com/watch?v=fNxaJsNG3-s&list=PLQY2H8rRoyvzDbLUZkbudP-MFQZwNmU4S
Welcome to Zero to Hero for Natural Language Processing using TensorFlow!
https://www.youtube.com/watch?v=R-AG4-qZs1A&list=PLeo1K3hjS3uuvuAXhYjV2lMEShq2UYSwX
Natural Language Processing tutorial for beginners series in Python.
https://www.youtube.com/watch?v=rmVRLeJRkl4&list=PLoROMvodv4rMFqRtEuo6SGjY4XbRIVRd4
The foundations of the effective modern methods for deep learning applied to NLP.
算法+
https://roadmap.sh/datastructures-and-algorithms
Step by step guide to learn Data Structures and Algorithms in 2025
https://www.hellointerview.com/learn/code
A visual guide to the most important patterns and approaches for the coding interview.
https://www.w3schools.com/dsa/
相关职位

校招算法类
1、负责PDF/DOC文档OCR相关算法/多模态解析的研发和工程实现,并将算法应用到业务场景中; 2、跟进OCR/多模态前沿技术,包括但不限于图像文字检测、识别,多语种识别,视频文本识别,版面分析,文本属性,语义理解等相关工作,进行技术难点攻关与前瞻研究; 3、通过持续优化人工智能识别算法和机器学习模型,提升光学识别的准确度和效率,提升应用的客户感知; 4、熟悉opencv。熟悉目标检测、跟踪、识别、分割、特征点等常见的任务。熟悉常见图像操作。 5、有PDF/DOC文档识别或者多模态文档经验优先;比如有OCR或者多模态解析经验,解决过财报,报表数字识别,和表格数字识别问题经验优先。
更新于 2025-09-02

校招算法类
我们正在寻找对AI Agent技术充满热情的应届毕业生加入我们的算法团队。你将参与设计和开发下一代智能Agent系统,致力于构建能够自主决策、多轮交互和复杂任务执行的AI应用。 主要工作内容: 1.参与Agent框架的设计与优化,包括规划、记忆、工具使用等核心模块 2.开发多模态Agent系统,支持文本、图像、语音等多种输入输出形式 3.研究和实现Agent的推理链优化,提升复杂任务的执行效率 4.构建Agent评测体系,设计自动化测试和性能监控方案 5.参与Agent在垂直领域的落地应用,如代码生成、数据分析、客服等场景 6.跟踪前沿研究,将最新理论成果快速转化为产品创新
更新于 2025-09-02

校招算法类
1. 面向业务场景:互联网图文内容业务、视频内容业务及集团AI创新业务 2. 结合业务需求,在可控图像生成方向、可控图像编辑方向、可控视频编辑方向进行前沿工作的跟踪、研究及落地,并对业务进行技术引领和落地支撑
更新于 2025-09-02