希音后台开发工程师
社招全职信息技术类地点:广州状态:招聘
任职要求
• 本科及以上,计算机、数学、统计、通信等相关专业 • 精通 Python(或 Java/C++),熟悉 PyTorch/TensorFlow、Pandas/NumPy • 理解常见 ML/DL 模型(决策树、XGBoost、CNN、Transformer 等),能独立训练与调优 • 具备大数据处理经验(Spark、Hive)及向量检索实战(FAISS、HNSW)优先 • 加分项: • 熟悉大语言模型(GPT、LLaMA、ChatGLM)Fine-tuning 或 Prompt 工程 • 掌握视觉语言模型(CLIP、BLIP2 等)多模态特征融合
工作职责
• 参与机器学习/深度学习算法(推荐、预测、分类等)的设计、实现与优化 • 数据清洗、特征工程与探索性分析,构建高质量训练/测试集 • 设计并执行实验(A/B Test、交叉验证等),分析结果并持续迭代 • 协助模型上线部署与监控,保障服务稳定性
包括英文材料
Python+
https://liaoxuefeng.com/books/python/introduction/index.html
中文,免费,零起点,完整示例,基于最新的Python 3版本。
https://www.learnpython.org/
a free interactive Python tutorial for people who want to learn Python, fast.
https://www.youtube.com/watch?v=K5KVEU3aaeQ
Master Python from scratch 🚀 No fluff—just clear, practical coding skills to kickstart your journey!
https://www.youtube.com/watch?v=rfscVS0vtbw
This course will give you a full introduction into all of the core concepts in python.
Java+
https://www.youtube.com/watch?v=eIrMbAQSU34
Master Java – a must-have language for software development, Android apps, and more! ☕️ This beginner-friendly course takes you from basics to real coding skills.
C+++
https://www.learncpp.com/
LearnCpp.com is a free website devoted to teaching you how to program in modern C++.
https://www.youtube.com/watch?v=ZzaPdXTrSb8
PyTorch+
https://datawhalechina.github.io/thorough-pytorch/
PyTorch是利用深度学习进行数据科学研究的重要工具,在灵活性、可读性和性能上都具备相当的优势,近年来已成为学术界实现深度学习算法最常用的框架。
https://www.youtube.com/watch?v=V_xro1bcAuA
Learn PyTorch for deep learning in this comprehensive course for beginners. PyTorch is a machine learning framework written in Python.
TensorFlow+
https://www.youtube.com/watch?v=tpCFfeUEGs8
Ready to learn the fundamentals of TensorFlow and deep learning with Python? Well, you’ve come to the right place.
https://www.youtube.com/watch?v=ZUKz4125WNI
This part continues right where part one left off so get that Google Colab window open and get ready to write plenty more TensorFlow code.
Pandas+
[英文] 10 minutes to pandas
https://pandas.pydata.org/docs/user_guide/10min.html
This is a short introduction to pandas, geared mainly for new users.
[英文] Cookbook - pandas
https://pandas.pydata.org/docs/user_guide/cookbook.html#cookbook
This is a repository for short and sweet examples and links for useful pandas recipes.
https://www.kaggle.com/learn/pandas
Solve short hands-on challenges to perfect your data manipulation skills.
https://www.youtube.com/watch?v=2uvysYbKdjM
I'm super excited for this one. We're doing another complete Python Pandas tutorial walkthrough.
https://www.youtube.com/watch?v=Mdq1WWSdUtw
Filtering, Joins, Indexing, Data Cleaning, Visualizations
XGBoost+
[英文] What is XGBoost?
https://www.ibm.com/think/topics/xgboost
XGBoost (eXtreme Gradient Boosting) is a distributed, open-source machine learning library that uses gradient boosted decision trees, a supervised learning boosting algorithm that makes use of gradient descent.
https://www.youtube.com/watch?v=BJXt-WdeJJo
takes a deep dive into one of the most powerful machine learning algorithm, eXtreme Gradient Boosting, using a Jupyter notebook with Python.
Transformer+
https://huggingface.co/learn/llm-course/en/chapter1/4
Breaking down how Large Language Models work, visualizing how data flows through.
https://poloclub.github.io/transformer-explainer/
An interactive visualization tool showing you how transformer models work in large language models (LLM) like GPT.
https://www.youtube.com/watch?v=wjZofJX0v4M
Breaking down how Large Language Models work, visualizing how data flows through.
Spark+
[英文] Learning Spark Book
https://pages.databricks.com/rs/094-YMS-629/images/LearningSpark2.0.pdf
This new edition has been updated to reflect Apache Spark’s evolution through Spark 2.x and Spark 3.0, including its expanded ecosystem of built-in and external data sources, machine learning, and streaming technologies with which Spark is tightly integrated.
Hive+
[英文] Hive Tutorial
https://www.tutorialspoint.com/hive/index.htm
Hive is a data warehouse infrastructure tool to process structured data in Hadoop. It resides on top of Hadoop to summarize Big Data, and makes querying and analyzing easy.
https://www.youtube.com/watch?v=D4HqQ8-Ja9Y
GPT+
https://www.youtube.com/watch?v=kCc8FmEb1nY
We build a Generatively Pretrained Transformer (GPT), following the paper "Attention is All You Need" and OpenAI's GPT-2 / GPT-3.
ChatGLM+
https://www.youtube.com/watch?v=EXUX0MjBzI0
In this step-by-step tutorial, you'll learn how to use ChatGLM, one of the most powerful and completely free AI video generators available today.
https://www.youtube.com/watch?v=fGpXj4bl5LI
Exploring the concept of a GLM (General Language Model) and working with ChatGLM6B.
Prompt+
https://cloud.google.com/vertex-ai/generative-ai/docs/learn/prompts/introduction-prompt-design
A prompt is a natural language request submitted to a language model to receive a response back.
https://learn.microsoft.com/en-us/azure/ai-foundry/openai/concepts/prompt-engineering
These techniques aren't recommended for reasoning models like gpt-5 and o-series models.
https://www.youtube.com/watch?v=LWiMwhDZ9as
Learn and master the fundamentals of Prompt Engineering and LLMs with this 5-HOUR Prompt Engineering Crash Course!
相关职位
社招1年以上核心本地商业-业
1.主导业务大项目的方案设计和开发; 2.参与系统的高可用建设,做好系统日常运维,确保系统稳定; 3.参与设计并完成架构演进的实施; 4.发现并解决当前系统中存在的问题,持续提升系统效率和质量; 5.指导新人,积极输出实践经验,促进共同进步。
更新于 2025-06-16
社招3年以上IEG
1. 负责大模型应用的后端开发与架构设计,支撑高并发、低延迟的AI服务; 2. 参与大模型相关技术的落地,包括但不限于模型部署、API接入、RAG技术、及Agent开发; 3. 优化大模型服务的性能、稳定性及扩展性,解决实际业务场景中的技术挑战; 4. 与算法、产品团队紧密协作,推动技术方案的高效实现与迭代。 Work Location: China-Shenzhen
更新于 2025-06-26