
同程旅行图像算法工程师
社招全职2年以上地点:北京状态:招聘
任职要求
1、学历:硕士及以上学历,专业为模式识别、机器学习、计算机、自动化、数学等相关专业。 2、数学基础:具备扎实的数学基础,熟悉模式识别、机器学习理论基础,掌握领域内常用算法。 3、编程能力:熟练掌握C、C++、Python中的一种,熟悉linux下开发。 4、逻辑思维能力:具备优秀的逻辑思维能力,有强烈的上进心和求知欲,善于接受及学习新技术。 5、团队合作精神:良好的团队合作精神,较强的沟通能力。 6、经验:期望2年以上图像/多模态算法开发经验,并至少对以下领域之一有实际的经验积累。 a. 对深度学习算法有一定的理解,包括不限于文本分类、语义理解、图像/视频理解、检测、分割、人脸文本生成等。 b. 熟悉常见的机器学习和深度学习算法, 熟练掌握至少一种深度学习框架,如Pytorch/TensorRT/Tensorflow/MNN/NCNN等,并能够熟练掌握及理解CNN/RNN/Transformer等常见网络模型。 7、学术前沿关注度:对学术前沿有浓厚兴趣,时刻跟进技术前沿,并善于利用各类技术解决复杂的实际问题。 8、具有AIGC相关项目经验者优先。
工作职责
职位名称: 职位描述(JD): 我们正在寻找一位富有激情和创新精神的图像算法工程师,加入我们充满活力的团队。作为图像算法工程师,你将负责研发和优化图像处理算法,并致力于将最新技术应用于我们的产品中。 岗位职责 1、研发视觉算法,与各类技术专家共同探索计算机视觉、AIGC、多模态、人工智能等前沿技术,并推动其在业务中的应用,实现业务增长。 2、根据业务场景,设计并预先研究简洁的技术方案。 3、负责图像生成、智能编辑算法相关产品的研发,如图像矫正、图像高清增强、图像美化/去噪/修复、图像文本识别等,涉及的技术包括但不限于图像检测、识别分割、超分、合成、多模态等。 4、负责算法的研发、实施及效果优化,解决算法在实际应用中可能遇到的优化问题,包括模型加速、压缩及效果提升等,确保CV算法成功应用于实际场景中。 5、积极研究前沿计算机视觉技术,结合公司未来业务需求,开发领先算法,短期实现技术突破并长期对公司业务产生积极影响。
包括英文材料
学历+
模式识别+
https://www.mathworks.com/discovery/pattern-recognition.html
Pattern recognition is the process of classifying input data into objects, classes, or categories using computer algorithms based on key features or regularities.
https://www.microsoft.com/en-us/research/wp-content/uploads/2006/01/Bishop-Pattern-Recognition-and-Machine-Learning-2006.pdf
Pattern recognition has its origins in engineering, whereas machine learning grew out of computer science.
机器学习+
https://www.youtube.com/watch?v=0oyDqO8PjIg
Learn about machine learning and AI with this comprehensive 11-hour course from @LunarTech_ai.
https://www.youtube.com/watch?v=i_LwzRVP7bg
Learn Machine Learning in a way that is accessible to absolute beginners.
https://www.youtube.com/watch?v=NWONeJKn6kc
Learn the theory and practical application of machine learning concepts in this comprehensive course for beginners.
https://www.youtube.com/watch?v=PcbuKRNtCUc
Learn about all the most important concepts and terms related to machine learning and AI.
算法+
https://roadmap.sh/datastructures-and-algorithms
Step by step guide to learn Data Structures and Algorithms in 2025
https://www.hellointerview.com/learn/code
A visual guide to the most important patterns and approaches for the coding interview.
https://www.w3schools.com/dsa/
C+
https://www.freecodecamp.org/chinese/news/the-c-beginners-handbook/
本手册遵循二八定律。你将在 20% 的时间内学习 80% 的 C 编程语言。
https://www.youtube.com/watch?v=87SH2Cn0s9A
https://www.youtube.com/watch?v=KJgsSFOSQv0
This course will give you a full introduction into all of the core concepts in the C programming language.
https://www.youtube.com/watch?v=PaPN51Mm5qQ
In this complete C programming course, Dr. Charles Severance (aka Dr. Chuck) will help you understand computer architecture and low-level programming with the help of the classic C Programming language book written by Brian Kernighan and Dennis Ritchie.
C+++
https://www.learncpp.com/
LearnCpp.com is a free website devoted to teaching you how to program in modern C++.
https://www.youtube.com/watch?v=ZzaPdXTrSb8
Python+
https://liaoxuefeng.com/books/python/introduction/index.html
中文,免费,零起点,完整示例,基于最新的Python 3版本。
https://www.learnpython.org/
a free interactive Python tutorial for people who want to learn Python, fast.
https://www.youtube.com/watch?v=K5KVEU3aaeQ
Master Python from scratch 🚀 No fluff—just clear, practical coding skills to kickstart your journey!
https://www.youtube.com/watch?v=rfscVS0vtbw
This course will give you a full introduction into all of the core concepts in python.
Linux+
https://ryanstutorials.net/linuxtutorial/
Ok, so you want to learn how to use the Bash command line interface (terminal) on Unix/Linux.
https://ubuntu.com/tutorials/command-line-for-beginners
The Linux command line is a text interface to your computer.
https://www.youtube.com/watch?v=6WatcfENsOU
In this Linux crash course, you will learn the fundamental skills and tools you need to become a proficient Linux system administrator.
https://www.youtube.com/watch?v=v392lEyM29A
Never fear the command line again, make it fear you.
https://www.youtube.com/watch?v=ZtqBQ68cfJc
深度学习+
https://d2l.ai/
Interactive deep learning book with code, math, and discussions.
PyTorch+
https://datawhalechina.github.io/thorough-pytorch/
PyTorch是利用深度学习进行数据科学研究的重要工具,在灵活性、可读性和性能上都具备相当的优势,近年来已成为学术界实现深度学习算法最常用的框架。
https://www.youtube.com/watch?v=V_xro1bcAuA
Learn PyTorch for deep learning in this comprehensive course for beginners. PyTorch is a machine learning framework written in Python.
TensorRT+
https://docs.nvidia.com/deeplearning/tensorrt/latest/getting-started/quick-start-guide.html
This TensorRT Quick Start Guide is a starting point for developers who want to try out the TensorRT SDK; specifically, it demonstrates how to quickly construct an application to run inference on a TensorRT engine.
TensorFlow+
https://www.youtube.com/watch?v=tpCFfeUEGs8
Ready to learn the fundamentals of TensorFlow and deep learning with Python? Well, you’ve come to the right place.
https://www.youtube.com/watch?v=ZUKz4125WNI
This part continues right where part one left off so get that Google Colab window open and get ready to write plenty more TensorFlow code.
MNN+
https://github.com/alibaba/MNN?tab=readme-ov-file#intro
MNN is a highly efficient and lightweight deep learning framework.
CNN+
https://learnopencv.com/understanding-convolutional-neural-networks-cnn/
Convolutional Neural Network (CNN) forms the basis of computer vision and image processing.
[英文] CNN Explainer
https://poloclub.github.io/cnn-explainer/
Learn Convolutional Neural Network (CNN) in your browser!
https://www.deeplearningbook.org/contents/convnets.html
Convolutional networks(LeCun, 1989), also known as convolutional neuralnetworks, or CNNs, are a specialized kind of neural network for processing data.
https://www.youtube.com/watch?v=2xqkSUhmmXU
MIT Introduction to Deep Learning 6.S191: Lecture 3 Convolutional Neural Networks for Computer Vision
RNN+
https://d2l.ai/chapter_recurrent-neural-networks/rnn.html
A neural network that uses recurrent computation for hidden states is called a recurrent neural network (RNN).
https://www.deeplearningbook.org/contents/rnn.html
Recurrent neural networks, or RNNs (Rumelhart et al., 1986a), are a family of neural networks for processing sequential data.
https://www.ibm.com/think/topics/recurrent-neural-networks
A recurrent neural network or RNN is a deep neural network trained on sequential or time series data to create a machine learning (ML) model that can make sequential predictions or conclusions based on sequential inputs.
Transformer+
https://huggingface.co/learn/llm-course/en/chapter1/4
Breaking down how Large Language Models work, visualizing how data flows through.
https://poloclub.github.io/transformer-explainer/
An interactive visualization tool showing you how transformer models work in large language models (LLM) like GPT.
https://www.youtube.com/watch?v=wjZofJX0v4M
Breaking down how Large Language Models work, visualizing how data flows through.
相关职位
社招技术类
1、负责电商平台商品内容理解所需的相关图像/视频算法研发; 2、优化同款商品/相似图片、图片分类等业务模型; 3、负责检测、分类、分割、AR/slam算法研发和服务。
更新于 2025-03-31

社招
1、负责在交通行业内落地业界前沿的视觉算法:包括但不限于:人脸识别、车票文字OCR、图像理解等方向。推荐预测算法:包括但不限于延误预测、放票规则预测、出行方案推荐等。 2、负责业务环节中图像相关应用问题的需求发掘,设计,技术研发,以及推动上线落地和不断迭代优化。 3、跟踪前沿的算法理论,并且将优秀的算法应用到业务场景中,提升业务效果。
更新于 2023-02-15
社招研发类
1、从事媒体领域,包括:图像/视频相关领域业界先进技术及发展趋势洞察、算法的研究创新、评测及工程化落地,实现公司在媒体领域的技术竞争力领先; 2、负责图像理解、图像增强、视觉等传统或AI系统方案设计、算法开发、工程化落地,构建智能手机、平板等产品在图像、视频等领域的核心竞争力; 3、独立承担图像算法的调研、设计工作,完成关键算法的开发、产品交付及验证; 4、调研和跟踪高校、第三方公司在图像/AI等算法方向的技术发展趋势,推进技术合作。
更新于 2025-07-28