百度机器学习/数据挖掘/自然语言处理工程师(J84110)
社招全职2年以上MEG地点:上海状态:招聘
任职要求
-2年以上工作经验 - 具有以下一个或多个领域的理论背景和实践经验:机器学习/数据挖掘/深度学习/信息检索/自然语言处理/机制设计/博弈论 -至少精通一门编程语言,熟悉网络编程,多线程,分布式编程技术,对数据结构和算法设计有较为深刻的理解 -良好的逻辑思维能力,对数据敏感,能够发现关键数据,抓住核心问题 -较强的沟通能力和逻辑表达能力,具备良好的团队合作精神和主动沟通意识
工作职责
-团队描述 商业品牌策略研发组,隶属于百度移动生态事业群,致力于构建业界领先的品牌广告搜索引擎,负责包括开屏矩阵、品牌专区、信息流GD、品牌智能体等多个公司核心业务线,服务包括零售、美妆、服饰、汽车、3C等行业知名品牌客户。基于百度搜索、信息流亿级别的用户流量,设计实现大规模、高吞吐、低延时的分布式广告检索系统,处理在线、离线、近线等多种复杂业务场景;依托于海量的互联网数据,在触发排序策略、相关性模型、转化优化、流量预估、库存分配等方向,都有雄厚的积累和技术领先性;基于vue、react等业界主流前端框架,打造一站式的品牌样式生产平台,为品牌客户制作丰富、优质、炫酷的广告创意,给予百度用户优秀的视觉和交互体验;良好的团队技术氛围,定期的前沿技术分享与业务探讨,每一位同学在这里都能获得技术深度和业务广度的积累和成长 -研究数据挖掘或统计学习领域的前沿技术,并用于实际问题的解决和优化 -大规模机器学习算法研究及并行化实现,为各种大规模机器学习应用研发核心技术 -通过对数据的敏锐洞察,深入挖掘产品潜在价值和需求,进而提供更有价值的产品和服务,通过技术创新推动产品成长
包括英文材料
机器学习+
https://www.youtube.com/watch?v=0oyDqO8PjIg
Learn about machine learning and AI with this comprehensive 11-hour course from @LunarTech_ai.
https://www.youtube.com/watch?v=i_LwzRVP7bg
Learn Machine Learning in a way that is accessible to absolute beginners.
https://www.youtube.com/watch?v=NWONeJKn6kc
Learn the theory and practical application of machine learning concepts in this comprehensive course for beginners.
https://www.youtube.com/watch?v=PcbuKRNtCUc
Learn about all the most important concepts and terms related to machine learning and AI.
数据挖掘+
https://www.youtube.com/watch?v=-bSkREem8dM
Database vs Data Warehouse vs Data Lake
https://www.youtube.com/watch?v=7rs0i-9nOjo
深度学习+
https://d2l.ai/
Interactive deep learning book with code, math, and discussions.
信息检索+
https://nlp.stanford.edu/IR-book/information-retrieval-book.html
Christopher D. Manning, Prabhakar Raghavan and Hinrich Schütze, Introduction to Information Retrieval, Cambridge University Press. 2008.
NLP+
https://www.youtube.com/watch?v=fNxaJsNG3-s&list=PLQY2H8rRoyvzDbLUZkbudP-MFQZwNmU4S
Welcome to Zero to Hero for Natural Language Processing using TensorFlow!
https://www.youtube.com/watch?v=R-AG4-qZs1A&list=PLeo1K3hjS3uuvuAXhYjV2lMEShq2UYSwX
Natural Language Processing tutorial for beginners series in Python.
https://www.youtube.com/watch?v=rmVRLeJRkl4&list=PLoROMvodv4rMFqRtEuo6SGjY4XbRIVRd4
The foundations of the effective modern methods for deep learning applied to NLP.
网络编程+
https://www.youtube.com/watch?v=2HrYIl6GpYg
I will make a simple HTTP web server with the C Programming Language.
https://www.youtube.com/watch?v=8z6okCgdREo
This tutorial is for Gophers who have written a command line or an API application, but have little to no experience in lower-level concepts like reading and writing to sockets, working with channels, and managing multiple goroutines.
https://www.youtube.com/watch?v=bdIiTxtMaKA&list=PL9IEJIKnBJjH_zM5LnovnoaKlXML5qh17
https://www.youtube.com/watch?v=bzja9fQWzdA
Implement the ubiquitous TCP protocol that underlies much of the traffic on the internet!
[英文] 📺Network Programming with Python Course (build a port scanner, mailing client, chat room, DDOS)
https://www.youtube.com/watch?v=FGdiSJakIS4
Learn network programming in Python by building four projects. You will learn to build a mailing client, a DDOS script, a port scanner, and a TCP Chat Room.
https://www.youtube.com/watch?v=gntyAFoZp-E
https://www.youtube.com/watch?v=JiuouCJQzSQ
Explore the fundamentals of networking in Rust by building a simple TCP server.
https://www.youtube.com/watch?v=JRTLSxGf_6w
https://www.youtube.com/watch?v=sFizpxHkIlI
In this video we'll cover SOCKET PROGRAMMING in JAVA.
https://www.youtube.com/watch?v=sXW_sNGvqcU
多线程+
https://liaoxuefeng.com/books/java/threading/basic/index.html
和单线程相比,多线程编程的特点在于:多线程经常需要读写共享数据,并且需要同步。
https://www.youtube.com/watch?v=_uQgGS_VIXM&list=PLsc-VaxfZl4do3Etp_xQ0aQBoC-x5BIgJ
https://www.youtube.com/watch?v=IEEhzQoKtQU
https://www.youtube.com/watch?v=mTGdtC9f4EU&list=PLL8woMHwr36EDxjUoCzboZjedsnhLP1j4
https://www.youtube.com/watch?v=TPVH_coGAQs&list=PLk6CEY9XxSIAeK-EAh3hB4fgNvYkYmghp
https://www.youtube.com/watch?v=xPqnoB2hjjA
This video is an introduction to multithreading in modern C++.
https://www.youtube.com/watch?v=YKBwKy5PrpQ
Rust threading is easy to implement and improves the efficiency of your applications on multi-core systems!
数据结构+
https://www.youtube.com/watch?v=8hly31xKli0
In this course you will learn about algorithms and data structures, two of the fundamental topics in computer science.
https://www.youtube.com/watch?v=B31LgI4Y4DQ
Learn about data structures in this comprehensive course. We will be implementing these data structures in C or C++.
https://www.youtube.com/watch?v=CBYHwZcbD-s
Data Structures and Algorithms full course tutorial java
算法+
https://roadmap.sh/datastructures-and-algorithms
Step by step guide to learn Data Structures and Algorithms in 2025
https://www.hellointerview.com/learn/code
A visual guide to the most important patterns and approaches for the coding interview.
https://www.w3schools.com/dsa/
相关职位
社招1年以上MEG
-研究数据挖掘或统计学习领域的前沿技术,并用于实际问题的解决和优化 -大规模机器学习算法研究及并行化实现,为各种大规模机器学习应用研发核心技术 -通过对数据的敏锐洞察,深入挖掘产品潜在价值和需求,进而提供更有价值的产品和服务,通过技术创新推动产品成长 -负责对搜索、商业广告、运营活动、电商交易等场景风控策略及模型进行迭代、研究与探索,建设业内领先的反作弊、风控算法系统
更新于 2024-08-14
社招技术类
1.迭代召回及相关性算法能力,深入理解用户意图、挖掘广告内容信息,提升广告匹配效率 2.优化点击率、转化率模型效果,利用丰富的内容和用户行为数据,并结合实际业务场景,提升模型预估准确度 3.优化广告策略算法建设,深入理解广告机制,在智能出价、拍卖机制等方向上迭代策略,提升广告主投放体验 4.跟踪学习相关领域前沿进展,探索新技术在实际业务场景中的落地
更新于 2025-03-31
社招技术类
1. 迭代召回模型,提升个性化能力 2. 迭代相关性模型/Query意图模型,深入理解用户意图,平衡相关性和效率 3. 优化视频内容的多模态模型,深入理解视频内容信息,服务全链路算法模块 4. 优化点击率、各类转化率、时长模型效果,提升模型的个性化能力,优化准度 5. 优化全链路排序策略,更好的平衡多目标,提升搜索结果页质量以及长期目标,如搜索渗透和留存
更新于 2025-03-31