
同花顺大模型算法工程师
社招全职地点:杭州状态:招聘
任职要求
1. 具备完备且扎实的计算机基础知识体系,在深度学习和自然语言处理领域有较深入的研究或开发经验 2. 熟悉大语言模型的基本原理与训练方法,能够熟练进行大语言模型训练数据的构建、清洗和结构化 3. 精通数据处理和分析工具(如Python、Pandas、NumPy等),有处理大规模金融数…
登录查看完整任职要求
微信扫码,1秒登录
工作职责
基于k线、图表、财报等金融数据训练多模态大模型,利用多模态技术开拓大模型在金融领域的应用 1. 跟进最新多模态大语言模型(MM-LLM)预训练、指令微调、RLHF-V等前沿技术,探索相关技术在金融领域的落地应用 2. 构建和优化金融数据处理pipeline,包括数据收集、清洗、标注和特征工程 3. 建立多种模态数据的统一标准和处理流程,包括文本、语音、图片、视频、代码等多模态数据 4. 搭建和打造性能最优的end-to-end multi-modal pipeline,实现从原始数据到模型输出的全流程优化
包括英文材料
深度学习+
https://d2l.ai/
Interactive deep learning book with code, math, and discussions.
NLP+
https://www.youtube.com/watch?v=fNxaJsNG3-s&list=PLQY2H8rRoyvzDbLUZkbudP-MFQZwNmU4S
Welcome to Zero to Hero for Natural Language Processing using TensorFlow!
https://www.youtube.com/watch?v=R-AG4-qZs1A&list=PLeo1K3hjS3uuvuAXhYjV2lMEShq2UYSwX
Natural Language Processing tutorial for beginners series in Python.
https://www.youtube.com/watch?v=rmVRLeJRkl4&list=PLoROMvodv4rMFqRtEuo6SGjY4XbRIVRd4
The foundations of the effective modern methods for deep learning applied to NLP.
Python+
https://liaoxuefeng.com/books/python/introduction/index.html
中文,免费,零起点,完整示例,基于最新的Python 3版本。
https://www.learnpython.org/
a free interactive Python tutorial for people who want to learn Python, fast.
https://www.youtube.com/watch?v=K5KVEU3aaeQ
Master Python from scratch 🚀 No fluff—just clear, practical coding skills to kickstart your journey!
https://www.youtube.com/watch?v=rfscVS0vtbw
This course will give you a full introduction into all of the core concepts in python.
Pandas+
[英文] 10 minutes to pandas
https://pandas.pydata.org/docs/user_guide/10min.html
This is a short introduction to pandas, geared mainly for new users.
[英文] Cookbook - pandas
https://pandas.pydata.org/docs/user_guide/cookbook.html#cookbook
This is a repository for short and sweet examples and links for useful pandas recipes.
https://www.kaggle.com/learn/pandas
Solve short hands-on challenges to perfect your data manipulation skills.
https://www.youtube.com/watch?v=2uvysYbKdjM
I'm super excited for this one. We're doing another complete Python Pandas tutorial walkthrough.
https://www.youtube.com/watch?v=Mdq1WWSdUtw
Filtering, Joins, Indexing, Data Cleaning, Visualizations
NumPy+
https://numpy.org/doc/stable/user/absolute_beginners.html
NumPy (Numerical Python) is an open source Python library that’s widely used in science and engineering.
[英文] NumPy - Learn
https://numpy.org/learn/
Below is a curated collection of educational resources, both for self-learning and teaching others, developed by NumPy contributors and vetted by the community.
https://www.kaggle.com/code/themlphdstudent/learn-numpy-numpy-50-exercises-and-solution
This kernel uses exercises of NumPy from the Machine Learning Plus webpage
https://www.youtube.com/watch?v=KHoEbRH46Zk
If you've heard of Pandas and NumPy, you may think one is simply a superset of the other.
https://www.youtube.com/watch?v=QUT1VHiLmmI
Learn the basics of the NumPy library in this tutorial for beginners.
https://www.youtube.com/watch?v=VXU4LSAQDSc
This video serves as an introduction to the NumPy Python library.
SQL+
https://liaoxuefeng.com/books/sql/introduction/index.html
什么是SQL?简单地说,SQL就是访问和处理关系数据库的计算机标准语言。
https://sqlbolt.com/
Learn SQL with simple, interactive exercises.
https://www.youtube.com/watch?v=p3qvj9hO_Bo
In this video we will cover everything you need to know about SQL in only 60 minutes.
NoSQL+
https://nosql-database.org/
Everything about NoSQL Systems – Types, Benefits, and Real-World Uses
https://piaosanlang.gitbooks.io/mongodb/content/section1.1.html
NoSQL(NoSQL = Not Only SQL ),即"不仅仅是SQL",指的是非关系型的数据库。是对不同于传统的关系型数据库管理系统的统称。
https://www.youtube.com/watch?v=0buKQHokLK8
NoSQL databases can operate in multiple modes: as key-value store, document store or wide column store.
还有更多 •••
相关职位
社招1年以上算法开发岗
1、参与生成式大模型能力构建;不局限于模型设计、prompt优化、预训练、模型推理加速、其他能力建设等; 2、采用最先进的并行处理和分布式学习技术,制定并执行性能优化策略,显著提升大型语言模型的训练速度和推理能力,例如跟进DeepSeek R1技术架构等,确保技术行业领先; 3、推进大模型技术在京东物流各个业务场景落地,包括不限于智能问答、智能数据分析、智能决策以及Computer Use等,助力业务流程优化,增质提效; 4、深度探索大语言模型方向,保持技术领先优势,推动京东物流在行业内树立高效、精准的大模型/多模态大模型应用标杆,并取得业务收益。
更新于 2025-06-09北京
社招大模型
1、探索新一代大语言模型基座架构,完成扩散模型(diffusion model)在大语言模型的重塑,突破逐个token预测的方式,实现高效的推理模式,探索全新scaling law; 2、实现大模型训练的数据清洗、合成和评估;设计和实现大模型训练的AI Infra框架。
更新于 2025-11-20北京|上海