[{"data":1,"prerenderedAt":-1},["ShallowReactive",2],{"similar-dipanjanS--practical-machine-learning-with-python":3,"tool-dipanjanS--practical-machine-learning-with-python":64},[4,17,27,35,44,52],{"id":5,"name":6,"github_repo":7,"description_zh":8,"stars":9,"difficulty_score":10,"last_commit_at":11,"category_tags":12,"status":16},3808,"stable-diffusion-webui","AUTOMATIC1111\u002Fstable-diffusion-webui","stable-diffusion-webui 是一个基于 Gradio 构建的网页版操作界面，旨在让用户能够轻松地在本地运行和使用强大的 Stable Diffusion 图像生成模型。它解决了原始模型依赖命令行、操作门槛高且功能分散的痛点，将复杂的 AI 绘图流程整合进一个直观易用的图形化平台。\n\n无论是希望快速上手的普通创作者、需要精细控制画面细节的设计师，还是想要深入探索模型潜力的开发者与研究人员，都能从中获益。其核心亮点在于极高的功能丰富度：不仅支持文生图、图生图、局部重绘（Inpainting）和外绘（Outpainting）等基础模式，还独创了注意力机制调整、提示词矩阵、负向提示词以及“高清修复”等高级功能。此外，它内置了 GFPGAN 和 CodeFormer 等人脸修复工具，支持多种神经网络放大算法，并允许用户通过插件系统无限扩展能力。即使是显存有限的设备，stable-diffusion-webui 也提供了相应的优化选项，让高质量的 AI 艺术创作变得触手可及。",162132,3,"2026-04-05T11:01:52",[13,14,15],"开发框架","图像","Agent","ready",{"id":18,"name":19,"github_repo":20,"description_zh":21,"stars":22,"difficulty_score":23,"last_commit_at":24,"category_tags":25,"status":16},1381,"everything-claude-code","affaan-m\u002Feverything-claude-code","everything-claude-code 是一套专为 AI 编程助手（如 Claude Code、Codex、Cursor 等）打造的高性能优化系统。它不仅仅是一组配置文件，而是一个经过长期实战打磨的完整框架，旨在解决 AI 代理在实际开发中面临的效率低下、记忆丢失、安全隐患及缺乏持续学习能力等核心痛点。\n\n通过引入技能模块化、直觉增强、记忆持久化机制以及内置的安全扫描功能，everything-claude-code 能显著提升 AI 在复杂任务中的表现，帮助开发者构建更稳定、更智能的生产级 AI 代理。其独特的“研究优先”开发理念和针对 Token 消耗的优化策略，使得模型响应更快、成本更低，同时有效防御潜在的攻击向量。\n\n这套工具特别适合软件开发者、AI 研究人员以及希望深度定制 AI 工作流的技术团队使用。无论您是在构建大型代码库，还是需要 AI 协助进行安全审计与自动化测试，everything-claude-code 都能提供强大的底层支持。作为一个曾荣获 Anthropic 黑客大奖的开源项目，它融合了多语言支持与丰富的实战钩子（hooks），让 AI 真正成长为懂上",140436,2,"2026-04-05T23:32:43",[13,15,26],"语言模型",{"id":28,"name":29,"github_repo":30,"description_zh":31,"stars":32,"difficulty_score":23,"last_commit_at":33,"category_tags":34,"status":16},2271,"ComfyUI","Comfy-Org\u002FComfyUI","ComfyUI 是一款功能强大且高度模块化的视觉 AI 引擎，专为设计和执行复杂的 Stable Diffusion 图像生成流程而打造。它摒弃了传统的代码编写模式，采用直观的节点式流程图界面，让用户通过连接不同的功能模块即可构建个性化的生成管线。\n\n这一设计巧妙解决了高级 AI 绘图工作流配置复杂、灵活性不足的痛点。用户无需具备编程背景，也能自由组合模型、调整参数并实时预览效果，轻松实现从基础文生图到多步骤高清修复等各类复杂任务。ComfyUI 拥有极佳的兼容性，不仅支持 Windows、macOS 和 Linux 全平台，还广泛适配 NVIDIA、AMD、Intel 及苹果 Silicon 等多种硬件架构，并率先支持 SDXL、Flux、SD3 等前沿模型。\n\n无论是希望深入探索算法潜力的研究人员和开发者，还是追求极致创作自由度的设计师与资深 AI 绘画爱好者，ComfyUI 都能提供强大的支持。其独特的模块化架构允许社区不断扩展新功能，使其成为当前最灵活、生态最丰富的开源扩散模型工具之一，帮助用户将创意高效转化为现实。",107662,"2026-04-03T11:11:01",[13,14,15],{"id":36,"name":37,"github_repo":38,"description_zh":39,"stars":40,"difficulty_score":10,"last_commit_at":41,"category_tags":42,"status":16},4292,"Deep-Live-Cam","hacksider\u002FDeep-Live-Cam","Deep-Live-Cam 是一款专注于实时换脸与视频生成的开源工具，用户仅需一张静态照片，即可通过“一键操作”实现摄像头画面的即时变脸或制作深度伪造视频。它有效解决了传统换脸技术流程繁琐、对硬件配置要求极高以及难以实时预览的痛点，让高质量的数字内容创作变得触手可及。\n\n这款工具不仅适合开发者和技术研究人员探索算法边界，更因其极简的操作逻辑（仅需三步：选脸、选摄像头、启动），广泛适用于普通用户、内容创作者、设计师及直播主播。无论是为了动画角色定制、服装展示模特替换，还是制作趣味短视频和直播互动，Deep-Live-Cam 都能提供流畅的支持。\n\n其核心技术亮点在于强大的实时处理能力，支持口型遮罩（Mouth Mask）以保留使用者原始的嘴部动作，确保表情自然精准；同时具备“人脸映射”功能，可同时对画面中的多个主体应用不同面孔。此外，项目内置了严格的内容安全过滤机制，自动拦截涉及裸露、暴力等不当素材，并倡导用户在获得授权及明确标注的前提下合规使用，体现了技术发展与伦理责任的平衡。",88924,"2026-04-06T03:28:53",[13,14,15,43],"视频",{"id":45,"name":46,"github_repo":47,"description_zh":48,"stars":49,"difficulty_score":23,"last_commit_at":50,"category_tags":51,"status":16},3704,"NextChat","ChatGPTNextWeb\u002FNextChat","NextChat 是一款轻量且极速的 AI 助手，旨在为用户提供流畅、跨平台的大模型交互体验。它完美解决了用户在多设备间切换时难以保持对话连续性，以及面对众多 AI 模型不知如何统一管理的痛点。无论是日常办公、学习辅助还是创意激发，NextChat 都能让用户随时随地通过网页、iOS、Android、Windows、MacOS 或 Linux 端无缝接入智能服务。\n\n这款工具非常适合普通用户、学生、职场人士以及需要私有化部署的企业团队使用。对于开发者而言，它也提供了便捷的自托管方案，支持一键部署到 Vercel 或 Zeabur 等平台。\n\nNextChat 的核心亮点在于其广泛的模型兼容性，原生支持 Claude、DeepSeek、GPT-4 及 Gemini Pro 等主流大模型，让用户在一个界面即可自由切换不同 AI 能力。此外，它还率先支持 MCP（Model Context Protocol）协议，增强了上下文处理能力。针对企业用户，NextChat 提供专业版解决方案，具备品牌定制、细粒度权限控制、内部知识库整合及安全审计等功能，满足公司对数据隐私和个性化管理的高标准要求。",87618,"2026-04-05T07:20:52",[13,26],{"id":53,"name":54,"github_repo":55,"description_zh":56,"stars":57,"difficulty_score":23,"last_commit_at":58,"category_tags":59,"status":16},2268,"ML-For-Beginners","microsoft\u002FML-For-Beginners","ML-For-Beginners 是由微软推出的一套系统化机器学习入门课程，旨在帮助零基础用户轻松掌握经典机器学习知识。这套课程将学习路径规划为 12 周，包含 26 节精炼课程和 52 道配套测验，内容涵盖从基础概念到实际应用的完整流程，有效解决了初学者面对庞大知识体系时无从下手、缺乏结构化指导的痛点。\n\n无论是希望转型的开发者、需要补充算法背景的研究人员，还是对人工智能充满好奇的普通爱好者，都能从中受益。课程不仅提供了清晰的理论讲解，还强调动手实践，让用户在循序渐进中建立扎实的技能基础。其独特的亮点在于强大的多语言支持，通过自动化机制提供了包括简体中文在内的 50 多种语言版本，极大地降低了全球不同背景用户的学习门槛。此外，项目采用开源协作模式，社区活跃且内容持续更新，确保学习者能获取前沿且准确的技术资讯。如果你正寻找一条清晰、友好且专业的机器学习入门之路，ML-For-Beginners 将是理想的起点。",84991,"2026-04-05T10:45:23",[14,60,43,61,15,62,26,13,63],"数据工具","插件","其他","音频",{"id":65,"github_repo":66,"name":67,"description_en":68,"description_zh":69,"ai_summary_zh":69,"readme_en":70,"readme_zh":71,"quickstart_zh":72,"use_case_zh":73,"hero_image_url":74,"owner_login":75,"owner_name":76,"owner_avatar_url":77,"owner_bio":78,"owner_company":79,"owner_location":79,"owner_email":79,"owner_twitter":79,"owner_website":79,"owner_url":80,"languages":81,"stars":94,"forks":95,"last_commit_at":96,"license":97,"difficulty_score":23,"env_os":98,"env_gpu":98,"env_ram":98,"env_deps":99,"category_tags":112,"github_topics":113,"view_count":23,"oss_zip_url":79,"oss_zip_packed_at":79,"status":16,"created_at":128,"updated_at":129,"faqs":130,"releases":161},4203,"dipanjanS\u002Fpractical-machine-learning-with-python","practical-machine-learning-with-python","Master the essential skills needed to recognize and solve complex real-world problems with Machine Learning and Deep Learning by leveraging the highly popular Python Machine Learning Eco-system.","practical-machine-learning-with-python 是一套源自同名畅销书的全开源代码库，旨在帮助学习者掌握利用 Python 生态解决现实世界复杂问题的核心技能。面对海量且分散的网络资源，初学者往往难以系统性地入门机器学习与深度学习，该项目通过提供书中所有的代码、笔记和实例，填补了理论与实践之间的鸿沟，让用户能够直接动手构建智能系统。\n\n它非常适合希望从理论走向实战的开发者、数据科学家以及相关专业学生。无论是需要处理数据分析、自然语言处理还是构建深度神经网络，用户都能从中找到对应的解决方案。其独特的技术亮点在于“做中学”的理念，不仅涵盖了 scikit-learn、pandas、TensorFlow、Keras、spaCy 等主流前沿框架，还通过真实的案例研究，将复杂的算法概念抽象化，重点展示如何思考、设计并成功执行完整的机器学习项目。如果你渴望在大数据与人工智能领域建立扎实的专业能力，practical-machine-learning-with-python 将是你不可或缺的实践指南。","# Practical Machine Learning with Python\n### A Problem-Solver's Guide to Building Real-World Intelligent Systems\n\n*\"Data is the new oil\"* is a saying which you must have heard by now along with the huge interest building up around Big Data and Machine Learning in the recent past along with Artificial Intelligence and Deep Learning. Besides this, data scientists have been termed as having *\"The sexiest job in the 21st Century\"* which makes it all the more worthwhile to build up some valuable expertise in these areas. Getting started with machine learning in the real world can be overwhelming with the vast amount of resources out there on the web.\n\n[*__\"Practical Machine Learning with Python\"__*](https:\u002F\u002Fgithub.com\u002FdipanjanS\u002Fpractical-machine-learning-with-python#contents)  follows a structured and comprehensive three-tiered approach packed with concepts, methodologies, hands-on examples, and code. This book is packed with over 500 pages of useful information which helps its readers master the essential skills needed to recognize and solve complex problems with Machine Learning and Deep Learning by following a data-driven mindset. By using real-world case studies that leverage the popular Python Machine Learning ecosystem, this book is your perfect companion for learning the art and science of Machine Learning to become a successful practitioner. The concepts, techniques, tools, frameworks, and methodologies used in this book will teach you how to think, design, build, and execute Machine Learning systems and projects successfully.\n\nThis repository contains all the code, notebooks and examples used in this book. We will also be adding bonus content here from time to time. So keep watching this space!\n\n## Get the book \n\u003Cdiv>\n\u003Ca target=\"_blank\" href=\"https:\u002F\u002Fwww.apress.com\u002Fus\u002Fbook\u002F9781484232064\">\n  \u003Cimg src=\"https:\u002F\u002Foss.gittoolsai.com\u002Fimages\u002FdipanjanS_practical-machine-learning-with-python_readme_a9fc0e577539.png\" alt=\"apress\" align=\"left\"\u002F>\n\u003C\u002Fa>\n\u003Ca target=\"_blank\" href=\"http:\u002F\u002Fwww.springer.com\u002Fus\u002Fbook\u002F9781484232064\">\n  \u003Cimg src=\"https:\u002F\u002Foss.gittoolsai.com\u002Fimages\u002FdipanjanS_practical-machine-learning-with-python_readme_9e67285f3e31.png\" alt=\"springer\" align=\"left\"\u002F>\n\u003C\u002Fa>\n\u003Ca target=\"_blank\" href=\"https:\u002F\u002Fwww.amazon.com\u002FPractical-Machine-Learning-Python-Problem-Solvers\u002Fdp\u002F1484232062\u002Fref=sr_1_10?ie=UTF8&qid=1513756537&sr=8-10&keywords=practical+machine+learning+with+python\">\n  \u003Cimg src=\"https:\u002F\u002Foss.gittoolsai.com\u002Fimages\u002FdipanjanS_practical-machine-learning-with-python_readme_cb984ada4c6b.jpg\" alt=\"amazon\" align=\"left\"\u002F>\n\u003C\u002Fa>\n\u003Cbr>\n\u003C\u002Fdiv>\n\u003Cbr>\n\u003Cdiv>\n\u003C\u002Fdiv>\n\u003Cbr>\u003Cbr>\n\n## About the book \n\u003Ca target=\"_blank\" href=\"https:\u002F\u002Fwww.amazon.com\u002FPractical-Machine-Learning-Python-Problem-Solvers\u002Fdp\u002F1484232062\u002Fref=sr_1_10?ie=UTF8&qid=1513756537&sr=8-10&keywords=practical+machine+learning+with+python\">\n  \u003Cimg src=\"https:\u002F\u002Foss.gittoolsai.com\u002Fimages\u002FdipanjanS_practical-machine-learning-with-python_readme_d8b750d63e62.jpg\" alt=\"Book Cover\" width=\"250\" align=\"left\"\u002F>\n\u003C\u002Fa>\n\nMaster the essential skills needed to recognize and solve complex problems with machine learning and deep learning. Using real-world examples that leverage the popular Python machine learning ecosystem, this book is your perfect companion for learning the art and science of machine learning to become a successful practitioner. The concepts, techniques, tools, frameworks, and methodologies used in this book will teach you how to think, design, build, and execute machine learning systems and projects successfully. \n\nWe focus on leveraging the latest state-of-the-art data analysis, machine learning and deep learning frameworks including [`scikit-learn`](http:\u002F\u002Fscikit-learn.org\u002Fstable\u002F), [`pandas`](https:\u002F\u002Fpandas.pydata.org\u002F), [`statsmodels`](http:\u002F\u002Fwww.statsmodels.org\u002Fstable\u002Findex.html), [`spaCy`](https:\u002F\u002Fspacy.io\u002F), [`nltk`](http:\u002F\u002Fwww.nltk.org\u002F), [`gensim`](https:\u002F\u002Fradimrehurek.com\u002Fgensim\u002F), [`tensorflow`](https:\u002F\u002Fwww.tensorflow.org\u002F), [`keras`](https:\u002F\u002Fkeras.io\u002F), [`skater`](https:\u002F\u002Fwww.datascience.com\u002Fresources\u002Ftools\u002Fskater) and several others to process, wrangle, analyze, visualize and model on real-world datasets and problems! With a learn-by-doing approach, we try to abstract out complex theory and concepts (while presenting the essentials wherever necessary), which often tends to hold back practitioners from leveraging the true power of machine learning to solve their own problems.\n\n\u003Cdiv style='font-size:0.5em;'>\u003Csup>\nEdition: 1st &emsp; Pages: 532 &emsp; Language: English\u003Cbr\u002F>\n Book Title: Practical Machine Learning with Python &emsp; Publisher: Apress (a part of Springer) &emsp; Copyright: Dipanjan Sarkar, Raghav Bali, Tushar Sharma\u003Cbr\u002F>  \n Print ISBN: 978-1-4842-3206-4 &emsp; Online ISBN: 978-1-4842-3207-1 &emsp; DOI: 10.1007\u002F978-1-4842-3207-1\u003Cbr\u002F>\n\u003C\u002Fdiv>\n\u003Cbr>\n\n[*__Practical Machine Learning with Python__*](https:\u002F\u002Fgithub.com\u002FdipanjanS\u002Fpractical-machine-learning-with-python#contents) follows a structured and comprehensive three-tiered approach packed with hands-on examples and code.\n\n - [__Part 1__](https:\u002F\u002Fgithub.com\u002FdipanjanS\u002Fpractical-machine-learning-with-python#contents) focuses on understanding machine learning concepts and tools. This includes machine learning basics with a broad overview of algorithms, techniques, concepts and applications, followed by a tour of the entire Python machine learning ecosystem. Brief guides for useful machine learning tools, libraries and frameworks are also covered.\n\n - [__Part 2__](https:\u002F\u002Fgithub.com\u002FdipanjanS\u002Fpractical-machine-learning-with-python#contents) details standard machine learning pipelines, with an emphasis on data processing analysis, feature engineering, and modeling. You will learn how to process, wrangle, summarize and visualize data in its various forms. Feature engineering and selection methodologies will be covered in detail with real-world datasets followed by model building, tuning, interpretation and deployment.\n\n - [__Part 3__](https:\u002F\u002Fgithub.com\u002FdipanjanS\u002Fpractical-machine-learning-with-python#contents) explores multiple real-world case studies spanning diverse domains and industries like *retail*, *transportation*, *movies*, *music*, *marketing*, *computer vision* and *finance*. For each case study, you will learn the application of various machine learning techniques and methods. The hands-on examples will help you become familiar with state-of-the-art machine learning tools and techniques and understand what algorithms are best suited for any problem.\n\n[*__Practical Machine Learning with Python__*](https:\u002F\u002Fgithub.com\u002FdipanjanS\u002Fpractical-machine-learning-with-python#contents) will empower you to start solving your own problems with machine learning today!\n\u003Cbr>\n\n## [Contents](https:\u002F\u002Fgithub.com\u002FdipanjanS\u002Fpractical-machine-learning-with-python\u002Ftree\u002Fmaster\u002Fnotebooks#book-contents)  \n\n - [__Part I: Understanding Machine Learning__](https:\u002F\u002Fgithub.com\u002FdipanjanS\u002Fpractical-machine-learning-with-python\u002Ftree\u002Fmaster\u002Fnotebooks#part-i-understanding-machine-learning)\n    - [Chapter 1: Machine Learning Basics](https:\u002F\u002Fgithub.com\u002FdipanjanS\u002Fpractical-machine-learning-with-python\u002Ftree\u002Fmaster\u002Fnotebooks\u002FCh01_Machine_Learning_Basics#chapter-1-machine-learning-basics)\n    - [Chapter 2: The Python Machine Learning Ecosystem](https:\u002F\u002Fgithub.com\u002FdipanjanS\u002Fpractical-machine-learning-with-python\u002Ftree\u002Fmaster\u002Fnotebooks\u002FCh02_The_Python_ML_Ecosystem#chapter-2-the-python-machine-learning-ecosystem)\n - [__Part II: The Machine Learning Pipeline__](https:\u002F\u002Fgithub.com\u002FdipanjanS\u002Fpractical-machine-learning-with-python\u002Ftree\u002Fmaster\u002Fnotebooks#part-ii-the-machine-learning-pipeline)\n    - [Chapter 3: Processing, Wrangling and Visualizing Data](https:\u002F\u002Fgithub.com\u002FdipanjanS\u002Fpractical-machine-learning-with-python\u002Ftree\u002Fmaster\u002Fnotebooks\u002FCh03_Processing_Wrangling_and_Visualizing_Data#chapter-3-processing-wrangling-and-visualizing-data)\n    - [Chapter 4: Feature Engineering and Selection](https:\u002F\u002Fgithub.com\u002FdipanjanS\u002Fpractical-machine-learning-with-python\u002Ftree\u002Fmaster\u002Fnotebooks\u002FCh04_Feature_Engineering_and_Selection#chapter-4-feature-engineering-and-selection)\n    - [Chapter 5: Building, Tuning and Deploying Models](https:\u002F\u002Fgithub.com\u002FdipanjanS\u002Fpractical-machine-learning-with-python\u002Ftree\u002Fmaster\u002Fnotebooks\u002FCh05_Building_Tuning_and_Deploying_Models#chapter-5-building-tuning-and-deploying-models)\n - [__Part III: Real-World Case Studies__](https:\u002F\u002Fgithub.com\u002FdipanjanS\u002Fpractical-machine-learning-with-python\u002Ftree\u002Fmaster\u002Fnotebooks#part-iii-real-world-case-studies)\n    - [Chapter 6: Analyzing Bike Sharing Trends](https:\u002F\u002Fgithub.com\u002FdipanjanS\u002Fpractical-machine-learning-with-python\u002Ftree\u002Fmaster\u002Fnotebooks\u002FCh06_Analyzing_Bike_Sharing_Trends#chapter-6-analyzing-bike-sharing-trends)\n    - [Chapter 7: Analyzing Movie Reviews Sentiment](https:\u002F\u002Fgithub.com\u002FdipanjanS\u002Fpractical-machine-learning-with-python\u002Ftree\u002Fmaster\u002Fnotebooks\u002FCh07_Analyzing_Movie_Reviews_Sentiment#chapter-7-analyzing-movie-reviews-sentiment)\n    - [Chapter 8: Customer Segmentation and Effective Cross Selling](https:\u002F\u002Fgithub.com\u002FdipanjanS\u002Fpractical-machine-learning-with-python\u002Ftree\u002Fmaster\u002Fnotebooks\u002FCh08_Customer_Segmentation_and_Effective_Cross_Selling#chapter-8-customer-segmentation-and-effective-cross-selling)\n    - [Chapter 9: Analyzing Wine Types and Quality](https:\u002F\u002Fgithub.com\u002FdipanjanS\u002Fpractical-machine-learning-with-python\u002Ftree\u002Fmaster\u002Fnotebooks\u002FCh09_Analyzing_Wine_Types_and_Quality#chapter-9-analyzing-wine-types-and-quality)\n    - [Chapter 10: Analyzing Music Trends and Recommendations](https:\u002F\u002Fgithub.com\u002FdipanjanS\u002Fpractical-machine-learning-with-python\u002Ftree\u002Fmaster\u002Fnotebooks\u002FCh10_Analyzing_Music_Trends_and_Recommendations#chapter-10-analyzing-music-trends-and-recommendations)\n    - [Chapter 11: Forecasting Stock and Commodity Prices](https:\u002F\u002Fgithub.com\u002FdipanjanS\u002Fpractical-machine-learning-with-python\u002Ftree\u002Fmaster\u002Fnotebooks\u002FCh11_Forecasting_Stock_and_Commodity_Prices#chapter-11-forecasting-stock-and-commodity-prices)\n    - [Chapter 12: Deep Learning for Computer Vision](https:\u002F\u002Fgithub.com\u002FdipanjanS\u002Fpractical-machine-learning-with-python\u002Ftree\u002Fmaster\u002Fnotebooks\u002FCh12_Deep_Learning_for_Computer_Vision#chapter-12-deep-learning-for-computer-vision)\n\n## What You'll Learn\n\n - Execute end-to-end machine learning projects and systems\n - Implement hands-on examples with industry standard, open source, robust machine learning tools and frameworks\n - Review case studies depicting applications of machine learning and deep learning on diverse domains and industries\n - Apply a wide range of machine learning models including regression, classification, and clustering.\n - Understand and apply the latest models and methodologies from deep learning including CNNs, RNNs, LSTMs and transfer learning.\n \n## Powered by the following Frameworks\n\n| | | | | |\n:---:|:---:|:---:|:---:|:---:\n\u003Ca target=\"_blank\" href=\"https:\u002F\u002Fanaconda.org\u002F\">\u003Cimg src=\"https:\u002F\u002Foss.gittoolsai.com\u002Fimages\u002FdipanjanS_practical-machine-learning-with-python_readme_dfdac1cde43d.jpg\" alt=\"anaconda\" \u002F>\u003C\u002Fa>|\u003Ca target=\"_blank\" href=\"http:\u002F\u002Fjupyter.org\u002F\">\u003Cimg src=\"https:\u002F\u002Foss.gittoolsai.com\u002Fimages\u002FdipanjanS_practical-machine-learning-with-python_readme_01456b676213.jpg\" alt=\"jupyter\" \u002F>\u003C\u002Fa>|\u003Ca target=\"_blank\" href=\"http:\u002F\u002Fwww.numpy.org\u002F\">\u003Cimg src=\"https:\u002F\u002Foss.gittoolsai.com\u002Fimages\u002FdipanjanS_practical-machine-learning-with-python_readme_aeaee91ba255.jpg\" alt=\"numpy\" \u002F>\u003C\u002Fa>|\u003Ca target=\"_blank\" href=\"https:\u002F\u002Fwww.scipy.org\u002F\">\u003Cimg src=\"https:\u002F\u002Foss.gittoolsai.com\u002Fimages\u002FdipanjanS_practical-machine-learning-with-python_readme_6886876371e3.jpg\" alt=\"scipy\" \u002F>\u003C\u002Fa>|\u003Ca target=\"_blank\" href=\"https:\u002F\u002Fpandas.pydata.org\u002F\">\u003Cimg src=\"https:\u002F\u002Foss.gittoolsai.com\u002Fimages\u002FdipanjanS_practical-machine-learning-with-python_readme_aad3731353ed.jpg\" alt=\"pandas\" \u002F>\u003C\u002Fa>\n\u003Ca target=\"_blank\" href=\"http:\u002F\u002Fwww.statsmodels.org\u002Fstable\u002Findex.html\">\u003Cimg src=\"https:\u002F\u002Foss.gittoolsai.com\u002Fimages\u002FdipanjanS_practical-machine-learning-with-python_readme_ed9fcf8a91d5.jpg\" alt=\"statsmodels\" \u002F>\u003C\u002Fa>|\u003Ca target=\"_blank\" href=\"http:\u002F\u002Fdocs.python-requests.org\u002Fen\u002Fmaster\u002F\">\u003Cimg src=\"https:\u002F\u002Foss.gittoolsai.com\u002Fimages\u002FdipanjanS_practical-machine-learning-with-python_readme_9001126803d7.jpg\" alt=\"requests\" \u002F>\u003C\u002Fa>|\u003Ca target=\"_blank\" href=\"http:\u002F\u002Fwww.nltk.org\u002F\">\u003Cimg src=\"https:\u002F\u002Foss.gittoolsai.com\u002Fimages\u002FdipanjanS_practical-machine-learning-with-python_readme_dcc01dcd454a.jpg\" alt=\"nltk\" \u002F>\u003C\u002Fa>|\u003Ca target=\"_blank\" href=\"https:\u002F\u002Fradimrehurek.com\u002Fgensim\u002F\">\u003Cimg src=\"https:\u002F\u002Foss.gittoolsai.com\u002Fimages\u002FdipanjanS_practical-machine-learning-with-python_readme_28a624443ec6.jpg\" alt=\"gensim\" \u002F>\u003C\u002Fa>|\u003Ca target=\"_blank\" href=\"https:\u002F\u002Fspacy.io\u002F\">\u003Cimg src=\"https:\u002F\u002Foss.gittoolsai.com\u002Fimages\u002FdipanjanS_practical-machine-learning-with-python_readme_3139f7d6a60e.jpg\" alt=\"spacy\" \u002F>\u003C\u002Fa>\n\u003Ca target=\"_blank\" href=\"http:\u002F\u002Fscikit-learn.org\u002Fstable\u002F\">\u003Cimg src=\"https:\u002F\u002Foss.gittoolsai.com\u002Fimages\u002FdipanjanS_practical-machine-learning-with-python_readme_aed3207a2161.jpg\" alt=\"scikit-learn\" \u002F>\u003C\u002Fa>|\u003Ca target=\"_blank\" href=\"https:\u002F\u002Fwww.datascience.com\u002Fresources\u002Ftools\u002Fskater\">\u003Cimg src=\"https:\u002F\u002Foss.gittoolsai.com\u002Fimages\u002FdipanjanS_practical-machine-learning-with-python_readme_3717f53dd46e.png\" alt=\"skater\" \u002F>\u003C\u002Fa>|\u003Ca target=\"_blank\" href=\"https:\u002F\u002Ffacebook.github.io\u002Fprophet\u002F\">\u003Cimg src=\"https:\u002F\u002Foss.gittoolsai.com\u002Fimages\u002FdipanjanS_practical-machine-learning-with-python_readme_4a6794a4177c.jpg\" alt=\"prophet\" \u002F>\u003C\u002Fa>|\u003Ca target=\"_blank\" href=\"https:\u002F\u002Fkeras.io\u002F\">\u003Cimg src=\"https:\u002F\u002Foss.gittoolsai.com\u002Fimages\u002FdipanjanS_practical-machine-learning-with-python_readme_0c5af9ce2423.jpg\" alt=\"keras\" \u002F>\u003C\u002Fa>|\u003Ca target=\"_blank\" href=\"https:\u002F\u002Fwww.tensorflow.org\u002F\">\u003Cimg src=\"https:\u002F\u002Foss.gittoolsai.com\u002Fimages\u002FdipanjanS_practical-machine-learning-with-python_readme_782a076fb297.jpg\" alt=\"tensorflow\" \u002F>\u003C\u002Fa>\n\u003Ca target=\"_blank\" href=\"https:\u002F\u002Fmatplotlib.org\u002F\">\u003Cimg src=\"https:\u002F\u002Foss.gittoolsai.com\u002Fimages\u002FdipanjanS_practical-machine-learning-with-python_readme_226ef5d8b419.jpg\" alt=\"matplotlib\" \u002F>\u003C\u002Fa>|\u003Ca target=\"_blank\" href=\"https:\u002F\u002Forange.biolab.si\u002F\">\u003Cimg src=\"https:\u002F\u002Foss.gittoolsai.com\u002Fimages\u002FdipanjanS_practical-machine-learning-with-python_readme_ebfcac8ec83a.jpg\" alt=\"orange\" \u002F>\u003C\u002Fa>|\u003Ca target=\"_blank\" href=\"https:\u002F\u002Fseaborn.pydata.org\u002F\">\u003Cimg src=\"https:\u002F\u002Foss.gittoolsai.com\u002Fimages\u002FdipanjanS_practical-machine-learning-with-python_readme_61e8bf432e95.jpg\" alt=\"seaborn\" \u002F>\u003C\u002Fa>|\u003Ca target=\"_blank\" href=\"https:\u002F\u002Fplot.ly\u002F\">\u003Cimg src=\"https:\u002F\u002Foss.gittoolsai.com\u002Fimages\u002FdipanjanS_practical-machine-learning-with-python_readme_7501c761cc12.jpg\" alt=\"plotly\" \u002F>\u003C\u002Fa>|\u003Ca target=\"_blank\" href=\"https:\u002F\u002Fwww.crummy.com\u002Fsoftware\u002FBeautifulSoup\u002Fbs4\u002Fdoc\u002F\">\u003Cimg src=\"https:\u002F\u002Foss.gittoolsai.com\u002Fimages\u002FdipanjanS_practical-machine-learning-with-python_readme_faa699597a91.jpg\" alt=\"beautiful soup\" \u002F>\u003C\u002Fa>\n\u003Cbr>\n\n## Audience\n\nThis book has been specially written for IT professionals, analysts, developers, data scientists, engineers, graduate students and anyone with an interest to analyze and derive insights from data!\n\u003Cbr>\n\n## Acknowledgements\nTBA\n\u003Cbr>\n","# 使用 Python 的实用机器学习\n### 问题解决者指南：构建现实世界中的智能系统\n\n“数据是新的石油”这句名言，你一定早已耳闻。近年来，随着大数据、机器学习、人工智能和深度学习的兴起，相关领域的热度持续攀升。此外，数据科学家还被誉为“21世纪最具吸引力的职业”，这也使得在这些领域积累宝贵的专业知识显得尤为重要。然而，在海量网络资源面前，初入机器学习的世界可能会让人感到无所适从。\n\n[*《使用 Python 的实用机器学习》*](https:\u002F\u002Fgithub.com\u002FdipanjanS\u002Fpractical-machine-learning-with-python#contents) 采用了一种结构化且全面的三层方法，内容涵盖概念、方法论、动手示例和代码。全书超过500页，旨在帮助读者掌握利用数据驱动思维识别并解决复杂问题所需的核心技能。通过运用基于流行 Python 机器学习生态系统的实际案例研究，本书将成为你学习机器学习艺术与科学、成为一名成功实践者的完美伴侣。书中所介绍的概念、技术、工具、框架和方法将教会你如何思考、设计、构建并成功实施机器学习系统和项目。\n\n本仓库包含了本书中使用的所有代码、笔记本及示例。我们还将不时在此添加额外内容，请持续关注！\n\n## 购买本书\n\u003Cdiv>\n\u003Ca target=\"_blank\" href=\"https:\u002F\u002Fwww.apress.com\u002Fus\u002Fbook\u002F9781484232064\">\n  \u003Cimg src=\"https:\u002F\u002Foss.gittoolsai.com\u002Fimages\u002FdipanjanS_practical-machine-learning-with-python_readme_a9fc0e577539.png\" alt=\"apress\" align=\"left\"\u002F>\n\u003C\u002Fa>\n\u003Ca target=\"_blank\" href=\"http:\u002F\u002Fwww.springer.com\u002Fus\u002Fbook\u002F9781484232064\">\n  \u003Cimg src=\"https:\u002F\u002Foss.gittoolsai.com\u002Fimages\u002FdipanjanS_practical-machine-learning-with-python_readme_9e67285f3e31.png\" alt=\"springer\" align=\"left\"\u002F>\n\u003C\u002Fa>\n\u003Ca target=\"_blank\" href=\"https:\u002F\u002Fwww.amazon.com\u002FPractical-Machine-Learning-Python-Problem-Solvers\u002Fdp\u002F1484232062\u002Fref=sr_1_10?ie=UTF8&qid=1513756537&sr=8-10&keywords=practical+machine+learning+with+python\">\n  \u003Cimg src=\"https:\u002F\u002Foss.gittoolsai.com\u002Fimages\u002FdipanjanS_practical-machine-learning-with-python_readme_cb984ada4c6b.jpg\" alt=\"amazon\" align=\"left\"\u002F>\n\u003C\u002Fa>\n\u003Cbr>\n\u003C\u002Fdiv>\n\u003Cbr>\n\u003Cdiv>\n\u003C\u002Fdiv>\n\u003Cbr>\u003Cbr>\n\n## 关于本书\n\u003Ca target=\"_blank\" href=\"https:\u002F\u002Fwww.amazon.com\u002FPractical-Machine-Learning-Python-Problem-Solvers\u002Fdp\u002F1484232062\u002Fref=sr_1_10?ie=UTF8&qid=1513756537&sr=8-10&keywords=practical+machine+learning+with+python\">\n  \u003Cimg src=\"https:\u002F\u002Foss.gittoolsai.com\u002Fimages\u002FdipanjanS_practical-machine-learning-with-python_readme_d8b750d63e62.jpg\" alt=\"图书封面\" width=\"250\" align=\"left\"\u002F>\n\u003C\u002Fa>\n\n掌握利用机器学习和深度学习识别并解决复杂问题所需的核心技能。本书以真实世界的案例为基础，充分利用流行的 Python 机器学习生态系统，是你学习机器学习艺术与科学、成为成功实践者的理想伙伴。书中所介绍的概念、技术、工具、框架和方法将教你如何思考、设计、构建并成功执行机器学习系统和项目。\n\n我们重点使用最新的前沿数据分析、机器学习和深度学习框架，包括 [`scikit-learn`](http:\u002F\u002Fscikit-learn.org\u002Fstable\u002F)、[`pandas`](https:\u002F\u002Fpandas.pydata.org\u002F)、[`statsmodels`](http:\u002F\u002Fwww.statsmodels.org\u002Fstable\u002Findex.html)、[`spaCy`](https:\u002F\u002Fspacy.io\u002F)、[`nltk`](http:\u002F\u002Fwww.nltk.org\u002F)、[`gensim`](https:\u002F\u002Fradimrehurek.com\u002Fgensim\u002F)、[`tensorflow`](https:\u002F\u002Fwww.tensorflow.org\u002F)、[`keras`](https:\u002F\u002Fkeras.io\u002F)、[`skater`](https:\u002F\u002Fwww.datascience.com\u002Fresources\u002Ftools\u002Fskater) 等，对真实世界的数据集和问题进行处理、清洗、分析、可视化和建模！秉持“边学边做”的理念，我们尽量简化复杂的理论和概念（同时在必要时呈现核心要点），从而避免因理论过于晦涩而阻碍从业者充分发挥机器学习的力量来解决自身问题。\n\n\u003Cdiv style='font-size:0.5em;'>\u003Csup>\n版本：第1版 &emsp; 页数：532 &emsp; 语言：英语\u003Cbr\u002F>\n 书名：使用 Python 的实用机器学习 &emsp; 出版社：Apress（Springer旗下） &emsp; 版权：Dipanjan Sarkar、Raghav Bali、Tushar Sharma\u003Cbr\u002F>  \n 印刷ISBN：978-1-4842-3206-4 &emsp; 在线ISBN：978-1-4842-3207-1 &emsp; DOI：10.1007\u002F978-1-4842-3207-1\u003Cbr\u002F>\n\u003C\u002Fdiv>\n\u003Cbr>\n\n[*《使用 Python 的实用机器学习》*](https:\u002F\u002Fgithub.com\u002FdipanjanS\u002Fpractical-machine-learning-with-python#contents) 采用了结构化且全面的三层方法，包含大量动手示例和代码。\n\n - [__第一部分__](https:\u002F\u002Fgithub.com\u002FdipanjanS\u002Fpractical-machine-learning-with-python#contents) 专注于理解机器学习的概念和工具。内容包括机器学习基础，对算法、技术、概念和应用进行广泛概述，随后深入探讨整个 Python 机器学习生态系统。此外，还提供了常用机器学习工具、库和框架的简要指南。\n\n - [__第二部分__](https:\u002F\u002Fgithub.com\u002FdipanjanS\u002Fpractical-machine-learning-with-python#contents) 详细介绍了标准的机器学习流程，重点在于数据处理与分析、特征工程和建模。你将学习如何处理、清洗、汇总和可视化各种形式的数据。针对真实数据集，我们将深入讲解特征工程与选择方法，随后进行模型构建、调优、解释和部署。\n\n - [__第三部分__](https:\u002F\u002Fgithub.com\u002FdipanjanS\u002Fpractical-machine-learning-with-python#contents) 探讨了多个跨不同领域和行业的实际案例，如零售、交通、电影、音乐、营销、计算机视觉和金融等。每个案例都将展示多种机器学习技术和方法的应用。通过这些动手示例，你将熟悉最先进的机器学习工具和技术，并了解哪些算法最适合解决特定问题。\n\n[*《使用 Python 的实用机器学习》*](https:\u002F\u002Fgithub.com\u002FdipanjanS\u002Fpractical-machine-learning-with-python#contents) 将赋能你，让你即刻开始用机器学习解决自己的问题！\n\u003Cbr>\n\n## [目录](https:\u002F\u002Fgithub.com\u002FdipanjanS\u002Fpractical-machine-learning-with-python\u002Ftree\u002Fmaster\u002Fnotebooks#book-contents)  \n\n - [__第一部分：理解机器学习__](https:\u002F\u002Fgithub.com\u002FdipanjanS\u002Fpractical-machine-learning-with-python\u002Ftree\u002Fmaster\u002Fnotebooks#part-i-understanding-machine-learning)\n    - [第1章：机器学习基础](https:\u002F\u002Fgithub.com\u002FdipanjanS\u002Fpractical-machine-learning-with-python\u002Ftree\u002Fmaster\u002Fnotebooks\u002FCh01_Machine_Learning_Basics#chapter-1-machine-learning-basics)\n    - [第2章：Python机器学习生态系统](https:\u002F\u002Fgithub.com\u002FdipanjanS\u002Fpractical-machine-learning-with-python\u002Ftree\u002Fmaster\u002Fnotebooks\u002FCh02_The_Python_ML_Ecosystem#chapter-2-the-python-machine-learning-ecosystem)\n - [__第二部分：机器学习流水线__](https:\u002F\u002Fgithub.com\u002FdipanjanS\u002Fpractical-machine-learning-with-python\u002Ftree\u002Fmaster\u002Fnotebooks#part-ii-the-machine-learning-pipeline)\n    - [第3章：数据处理、清洗与可视化](https:\u002F\u002Fgithub.com\u002FdipanjanS\u002Fpractical-machine-learning-with-python\u002Ftree\u002Fmaster\u002Fnotebooks\u002FCh03_Processing_Wrangling_and_Visualizing_Data#chapter-3-processing-wrangling-and-visualizing-data)\n    - [第4章：特征工程与选择](https:\u002F\u002Fgithub.com\u002FdipanjanS\u002Fpractical-machine-learning-with-python\u002Ftree\u002Fmaster\u002Fnotebooks\u002FCh04_Feature_Engineering_and_Selection#chapter-4-feature-engineering-and-selection)\n    - [第5章：模型构建、调优与部署](https:\u002F\u002Fgithub.com\u002FdipanjanS\u002Fpractical-machine-learning-with-python\u002Ftree\u002Fmaster\u002Fnotebooks\u002FCh05_Building_Tuning_and_Deploying_Models#chapter-5-building-tuning-and-deploying-models)\n - [__第三部分：真实世界案例研究__](https:\u002F\u002Fgithub.com\u002FdipanjanS\u002Fpractical-machine-learning-with-python\u002Ftree\u002Fmaster\u002Fnotebooks#part-iii-real-world-case-studies)\n    - [第6章：分析共享单车使用趋势](https:\u002F\u002Fgithub.com\u002FdipanjanS\u002Fpractical-machine-learning-with-python\u002Ftree\u002Fmaster\u002Fnotebooks\u002FCh06_Analyzing_Bike_Sharing_Trends#chapter-6-analyzing-bike-sharing-trends)\n    - [第7章：分析电影评论情感](https:\u002F\u002Fgithub.com\u002FdipanjanS\u002Fpractical-machine-learning-with-python\u002Ftree\u002Fmaster\u002Fnotebooks\u002FCh07_Analyzing_Movie_Reviews_Sentiment#chapter-7-analyzing-movie-reviews-sentiment)\n    - [第8章：客户细分与高效交叉销售](https:\u002F\u002Fgithub.com\u002FdipanjanS\u002Fpractical-machine-learning-with-python\u002Ftree\u002Fmaster\u002Fnotebooks\u002FCh08_Customer_Segmentation_and_Effective_Cross_Selling#chapter-8-customer-segmentation-and-effective-cross-selling)\n    - [第9章：分析葡萄酒类型与品质](https:\u002F\u002Fgithub.com\u002FdipanjanS\u002Fpractical-machine-learning-with-python\u002Ftree\u002Fmaster\u002Fnotebooks\u002FCh09_Analyzing_Wine_Types_and_Quality#chapter-9-analyzing-wine-types-and-quality)\n    - [第10章：分析音乐趋势与推荐](https:\u002F\u002Fgithub.com\u002FdipanjanS\u002Fpractical-machine-learning-with-python\u002Ftree\u002Fmaster\u002Fnotebooks\u002FCh10_Analyzing_Music_Trends_and_Recommendations#chapter-10-analyzing-music-trends-and-recommendations)\n    - [第11章：预测股票与大宗商品价格](https:\u002F\u002Fgithub.com\u002FdipanjanS\u002Fpractical-machine-learning-with-python\u002Ftree\u002Fmaster\u002Fnotebooks\u002FCh11_Forecasting_Stock_and_Commodity_Prices#chapter-11-forecasting-stock-and-commodity-prices)\n    - [第12章：用于计算机视觉的深度学习](https:\u002F\u002Fgithub.com\u002FdipanjanS\u002Fpractical-machine-learning-with-python\u002Ftree\u002Fmaster\u002Fnotebooks\u002FCh12_Deep_Learning_for_Computer_Vision#chapter-12-deep-learning-for-computer-vision)\n\n## 您将学到的内容\n\n - 执行端到端的机器学习项目和系统\n - 使用行业标准、开源且稳健的机器学习工具和框架进行实践\n - 复习展示机器学习和深度学习在不同领域和行业中应用的案例研究\n - 应用广泛的机器学习模型，包括回归、分类和聚类\n - 理解并应用来自深度学习的最新模型和方法，包括卷积神经网络、循环神经网络、长短期记忆网络以及迁移学习\n\n## 由以下框架支持\n\n| | | | | |\n:---:|:---:|:---:|:---:|:---:\n\u003Ca target=\"_blank\" href=\"https:\u002F\u002Fanaconda.org\u002F\">\u003Cimg src=\"https:\u002F\u002Foss.gittoolsai.com\u002Fimages\u002FdipanjanS_practical-machine-learning-with-python_readme_dfdac1cde43d.jpg\" alt=\"anaconda\" \u002F>\u003C\u002Fa>|\u003Ca target=\"_blank\" href=\"http:\u002F\u002Fjupyter.org\u002F\">\u003Cimg src=\"https:\u002F\u002Foss.gittoolsai.com\u002Fimages\u002FdipanjanS_practical-machine-learning-with-python_readme_01456b676213.jpg\" alt=\"jupyter\" \u002F>\u003C\u002Fa>|\u003Ca target=\"_blank\" href=\"http:\u002F\u002Fwww.numpy.org\u002F\">\u003Cimg src=\"https:\u002F\u002Foss.gittoolsai.com\u002Fimages\u002FdipanjanS_practical-machine-learning-with-python_readme_aeaee91ba255.jpg\" alt=\"numpy\" \u002F>\u003C\u002Fa>|\u003Ca target=\"_blank\" href=\"https:\u002F\u002Fwww.scipy.org\u002F\">\u003Cimg src=\"https:\u002F\u002Foss.gittoolsai.com\u002Fimages\u002FdipanjanS_practical-machine-learning-with-python_readme_6886876371e3.jpg\" alt=\"scipy\" \u002F>\u003C\u002Fa>|\u003Ca target=\"_blank\" href=\"https:\u002F\u002Fpandas.pydata.org\u002F\">\u003Cimg src=\"https:\u002F\u002Foss.gittoolsai.com\u002Fimages\u002FdipanjanS_practical-machine-learning-with-python_readme_aad3731353ed.jpg\" alt=\"pandas\" \u002F>\u003C\u002Fa>\n\u003Ca target=\"_blank\" href=\"http:\u002F\u002Fwww.statsmodels.org\u002Fstable\u002Findex.html\">\u003Cimg src=\"https:\u002F\u002Foss.gittoolsai.com\u002Fimages\u002FdipanjanS_practical-machine-learning-with-python_readme_ed9fcf8a91d5.jpg\" alt=\"statsmodels\" \u002F>\u003C\u002Fa>|\u003Ca target=\"_blank\" href=\"http:\u002F\u002Fdocs.python-requests.org\u002Fen\u002Fmaster\u002F\">\u003Cimg src=\"https:\u002F\u002Foss.gittoolsai.com\u002Fimages\u002FdipanjanS_practical-machine-learning-with-python_readme_9001126803d7.jpg\" alt=\"requests\" \u002F>\u003C\u002Fa>|\u003Ca target=\"_blank\" href=\"http:\u002F\u002Fwww.nltk.org\u002F\">\u003Cimg src=\"https:\u002F\u002Foss.gittoolsai.com\u002Fimages\u002FdipanjanS_practical-machine-learning-with-python_readme_dcc01dcd454a.jpg\" alt=\"nltk\" \u002F>\u003C\u002Fa>|\u003Ca target=\"_blank\" href=\"https:\u002F\u002Fradimrehurek.com\u002Fgensim\u002F\">\u003Cimg src=\"https:\u002F\u002Foss.gittoolsai.com\u002Fimages\u002FdipanjanS_practical-machine-learning-with-python_readme_28a624443ec6.jpg\" alt=\"gensim\" \u002F>\u003C\u002Fa>|\u003Ca target=\"_blank\" href=\"https:\u002F\u002Fspacy.io\u002F\">\u003Cimg src=\"https:\u002F\u002Foss.gittoolsai.com\u002Fimages\u002FdipanjanS_practical-machine-learning-with-python_readme_3139f7d6a60e.jpg\" alt=\"spacy\" \u002F>\u003C\u002Fa>\n\u003Ca target=\"_blank\" href=\"http:\u002F\u002Fscikit-learn.org\u002Fstable\u002F\">\u003Cimg src=\"https:\u002F\u002Foss.gittoolsai.com\u002Fimages\u002FdipanjanS_practical-machine-learning-with-python_readme_aed3207a2161.jpg\" alt=\"scikit-learn\" \u002F>\u003C\u002Fa>|\u003Ca target=\"_blank\" href=\"https:\u002F\u002Fwww.datascience.com\u002Fresources\u002Ftools\u002Fskater\">\u003Cimg src=\"https:\u002F\u002Foss.gittoolsai.com\u002Fimages\u002FdipanjanS_practical-machine-learning-with-python_readme_3717f53dd46e.png\" alt=\"skater\" \u002F>\u003C\u002Fa>|\u003Ca target=\"_blank\" href=\"https:\u002F\u002Ffacebook.github.io\u002Fprophet\u002F\">\u003Cimg src=\"https:\u002F\u002Foss.gittoolsai.com\u002Fimages\u002FdipanjanS_practical-machine-learning-with-python_readme_4a6794a4177c.jpg\" alt=\"prophet\" \u002F>\u003C\u002Fa>|\u003Ca target=\"_blank\" href=\"https:\u002F\u002Fkeras.io\u002F\">\u003Cimg src=\"https:\u002F\u002Foss.gittoolsai.com\u002Fimages\u002FdipanjanS_practical-machine-learning-with-python_readme_0c5af9ce2423.jpg\" alt=\"keras\" \u002F>\u003C\u002Fa>|\u003Ca target=\"_blank\" href=\"https:\u002F\u002Fwww.tensorflow.org\u002F\">\u003Cimg src=\"https:\u002F\u002Foss.gittoolsai.com\u002Fimages\u002FdipanjanS_practical-machine-learning-with-python_readme_782a076fb297.jpg\" alt=\"tensorflow\" \u002F>\u003C\u002Fa>\n\u003Ca target=\"_blank\" href=\"https:\u002F\u002Fmatplotlib.org\u002F\">\u003Cimg src=\"https:\u002F\u002Foss.gittoolsai.com\u002Fimages\u002FdipanjanS_practical-machine-learning-with-python_readme_226ef5d8b419.jpg\" alt=\"matplotlib\" \u002F>\u003C\u002Fa>|\u003Ca target=\"_blank\" href=\"https:\u002F\u002Forange.biolab.si\u002F\">\u003Cimg src=\"https:\u002F\u002Foss.gittoolsai.com\u002Fimages\u002FdipanjanS_practical-machine-learning-with-python_readme_ebfcac8ec83a.jpg\" alt=\"orange\" \u002F>\u003C\u002Fa>|\u003Ca target=\"_blank\" href=\"https:\u002F\u002Fseaborn.pydata.org\u002F\">\u003Cimg src=\"https:\u002F\u002Foss.gittoolsai.com\u002Fimages\u002FdipanjanS_practical-machine-learning-with-python_readme_61e8bf432e95.jpg\" alt=\"seaborn\" \u002F>\u003C\u002Fa>|\u003Ca target=\"_blank\" href=\"https:\u002F\u002Fplot.ly\u002F\">\u003Cimg src=\"https:\u002F\u002Foss.gittoolsai.com\u002Fimages\u002FdipanjanS_practical-machine-learning-with-python_readme_7501c761cc12.jpg\" alt=\"plotly\" \u002F>\u003C\u002Fa>|\u003Ca target=\"_blank\" href=\"https:\u002F\u002Fwww.crummy.com\u002Fsoftware\u002FBeautifulSoup\u002Fbs4\u002Fdoc\u002F\">\u003Cimg src=\"https:\u002F\u002Foss.gittoolsai.com\u002Fimages\u002FdipanjanS_practical-machine-learning-with-python_readme_faa699597a91.jpg\" alt=\"beautiful soup\" \u002F>\u003C\u002Fa>\n\u003Cbr>\n\n## 目标读者\n\n本书专为IT专业人士、分析师、开发者、数据科学家、工程师、研究生以及任何对数据分析和洞察感兴趣的人士而撰写！\n\u003Cbr>\n\n## 致谢\n待定\n\u003Cbr>","# Practical Machine Learning with Python 快速上手指南\n\n本指南基于《Practical Machine Learning with Python》开源项目，旨在帮助开发者快速搭建环境并运行书中的实战案例。该项目涵盖了从数据清洗、特征工程到模型构建与部署的全流程，涉及零售、金融、计算机视觉等多个领域的真实案例。\n\n## 环境准备\n\n在开始之前，请确保您的系统满足以下要求：\n\n*   **操作系统**：Windows, macOS 或 Linux\n*   **Python 版本**：推荐 Python 3.6 及以上版本（书中代码主要基于 Python 3 生态）\n*   **包管理器**：推荐使用 `conda` (Anaconda\u002FMiniconda) 或 `pip`\n*   **开发工具**：Jupyter Notebook 或 JupyterLab（书中示例主要以 Notebook 形式呈现）\n\n**核心依赖库**：\n本项目依赖强大的 Python 数据科学生态，主要包括：\n`pandas`, `numpy`, `scikit-learn`, `matplotlib`, `seaborn`, `statsmodels`, `nltk`, `gensim`, `spacy`, `tensorflow`, `keras` 等。\n\n## 安装步骤\n\n### 1. 克隆项目仓库\n首先，将包含所有代码、数据集和 Notebooks 的仓库克隆到本地。\n\n```bash\ngit clone https:\u002F\u002Fgithub.com\u002FdipanjanS\u002Fpractical-machine-learning-with-python.git\ncd practical-machine-learning-with-python\n```\n\n### 2. 创建虚拟环境（推荐）\n为了避免依赖冲突，建议创建一个独立的虚拟环境。\n\n**使用 Conda (推荐，尤其适合国内用户配置镜像源):**\n```bash\n# 创建名为 ml-book 的环境，指定 Python 版本\nconda create -n ml-book python=3.8\n\n# 激活环境\nconda activate ml-book\n\n# 【可选】配置清华镜像源以加速下载\nconda config --add channels https:\u002F\u002Fmirrors.tuna.tsinghua.edu.cn\u002Fanaconda\u002Fpkgs\u002Fmain\u002F\nconda config --add channels https:\u002F\u002Fmirrors.tuna.tsinghua.edu.cn\u002Fanaconda\u002Fpkgs\u002Ffree\u002F\nconda config --set show_channel_urls yes\n```\n\n**使用 Pip:**\n```bash\npython -m venv ml-book-env\n# Windows:\nml-book-env\\Scripts\\activate\n# macOS\u002FLinux:\nsource ml-book-env\u002Fbin\u002Factivate\n```\n\n### 3. 安装依赖\n进入项目目录后，安装所需的第三方库。由于依赖较多，建议分步安装或使用 requirements 文件（如果项目中存在）。若项目中未提供完整的 `requirements.txt`，可安装核心科学计算栈：\n\n**使用 Conda 安装核心库（速度快，编译好二进制包）：**\n```bash\nconda install pandas numpy scikit-learn matplotlib seaborn statsmodels jupyter notebook\nconda install -c conda-forge nltk gensim spacy\n# 深度学习框架可根据需求单独安装，例如：\nconda install tensorflow keras\n```\n\n**使用 Pip 安装（如需特定最新版本）：**\n```bash\npip install pandas numpy scikit-learn matplotlib seaborn statsmodels jupyterlab\npip install nltk gensim spacy requests\npip install tensorflow keras\n# 初始化 spaCy 模型（示例：英文模型）\npython -m spacy download en_core_web_sm\n```\n\n> **注意**：部分 NLP 库（如 `nltk`）首次使用时需要在代码中下载数据包，或在命令行运行 `python -m nltk.downloader all`。\n\n## 基本使用\n\n本项目的主要内容位于 `notebooks` 文件夹下，按章节组织了完整的机器学习流程案例。\n\n### 1. 启动 Jupyter Notebook\n在项目根目录下启动服务：\n\n```bash\njupyter notebook\n```\n或者使用 JupyterLab：\n```bash\njupyter lab\n```\n\n### 2. 运行第一个示例\n浏览器会自动打开界面。导航至 `notebooks` 目录，选择任意章节的 Notebook 文件开始学习。\n\n**示例：运行第一章基础概念 (Ch01)**\n1.  点击 `Ch01_Machine_Learning_Basics` 文件夹。\n2.  打开对应的 `.ipynb` 文件。\n3.  按顺序执行单元格（Cell），观察数据加载、预处理及简单模型的输出。\n\n**代码示例片段（源自书中典型流程）：**\n以下是一个典型的数据加载与简单线性回归的演示逻辑，您可以在新的 Notebook 单元格中尝试：\n\n```python\nimport pandas as pd\nfrom sklearn.model_selection import train_test_split\nfrom sklearn.linear_model import LinearRegression\nfrom sklearn.metrics import mean_squared_error\n\n# 1. 加载数据 (以书中常见的自行车共享数据为例，需确保数据文件在对应路径)\n# df = pd.read_csv('data\u002Fhour.csv') \n\n# 此处仅为演示结构，实际运行请参照具体章节 Notebook\ndata = {'feature': [1, 2, 3, 4, 5], 'target': [2, 4, 5, 4, 5]}\ndf = pd.DataFrame(data)\n\n# 2. 划分特征与目标\nX = df[['feature']]\ny = df['target']\n\n# 3. 拆分训练集和测试集\nX_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.2, random_state=42)\n\n# 4. 构建并训练模型\nmodel = LinearRegression()\nmodel.fit(X_train, y_train)\n\n# 5. 预测与评估\npredictions = model.predict(X_test)\nmse = mean_squared_error(y_test, predictions)\n\nprint(f\"Model Coefficients: {model.coef_}\")\nprint(f\"Mean Squared Error: {mse}\")\n```\n\n### 3. 探索实战案例\n完成基础学习后，建议直接进入 **Part III: Real-World Case Studies**：\n*   **情感分析**：查看 `Ch07_Analyzing_Movie_Reviews_Sentiment` 学习 NLP 处理。\n*   **客户细分**：查看 `Ch08_Customer_Segmentation_and_Effective_Cross_Selling` 学习聚类算法。\n*   **计算机视觉**：查看 `Ch12_Deep_Learning_for_Computer_Vision` 学习 CNN 与迁移学习。\n\n通过直接运行这些 Notebooks，您可以复现书中从数据清洗到模型部署的完整工业级流程。","某电商数据团队正试图构建一个智能推荐系统，以解决用户流失率高和商品转化率低的难题。\n\n### 没有 practical-machine-learning-with-python 时\n- **学习路径混乱**：面对网络上碎片化的教程，团队成员难以区分理论概念与实战代码，导致在环境配置和基础库选择上浪费大量时间。\n- **缺乏端到端案例**：手头只有孤立的算法片段，缺乏从数据清洗、特征工程到模型部署的完整真实业务场景参考，无法直接复用于电商数据。\n- **黑盒模型难解释**：构建了深度学习模型却无法向业务部门解释预测逻辑，缺乏如 `skater` 等模型解释工具的实际应用指导，导致项目难以落地。\n- **技术栈整合困难**：难以将 `pandas`、`scikit-learn`、`tensorflow` 等多个流行框架有机串联，代码结构松散，维护成本极高。\n\n### 使用 practical-machine-learning-with-python 后\n- **结构化技能掌握**：依托书中三层递进式方法论，团队快速掌握了利用 Python 生态系统解决复杂问题的核心技能，显著缩短了上手周期。\n- **真实案例直接复用**：直接参考书中基于真实数据集的案例研究，快速搭建起包含数据预处理、建模及评估的完整推荐系统流水线。\n- **模型透明可解释**：运用书中介绍的 `skater` 等框架对模型进行归因分析，清晰展示推荐逻辑，成功获得业务部门的信任与支持。\n- **全栈工具链打通**：学会了如何协同使用 `spaCy` 处理文本评论、`keras` 构建深度网络及 `pandas` 进行数据清洗，形成了规范高效的工程化代码体系。\n\npractical-machine-learning-with-python 通过提供系统的思维框架与丰富的实战代码，帮助开发者从“纸上谈兵”迅速转型为能独立构建并落地智能系统的实战专家。","https:\u002F\u002Foss.gittoolsai.com\u002Fimages\u002FdipanjanS_practical-machine-learning-with-python_2017377a.png","dipanjanS","Dipanjan (DJ) Sarkar","https:\u002F\u002Foss.gittoolsai.com\u002Favatars\u002FdipanjanS_e04e813c.jpg","Data Science Lead, Google Dev Expert - ML, Author",null,"https:\u002F\u002Fgithub.com\u002FdipanjanS",[82,86,90],{"name":83,"color":84,"percentage":85},"Jupyter Notebook","#DA5B0B",99.3,{"name":87,"color":88,"percentage":89},"Python","#3572A5",0.7,{"name":91,"color":92,"percentage":93},"HTML","#e34c26",0,2374,1655,"2026-04-02T08:35:46","Apache-2.0","未说明",{"notes":100,"python":98,"dependencies":101},"本项目为配套书籍《Practical Machine Learning with Python》的代码仓库，涵盖机器学习基础、数据预处理、特征工程及深度学习（计算机视觉等）案例。文中提及使用 Anaconda 和 Jupyter，建议通过 Anaconda 管理环境。由于涉及 TensorFlow\u002FKeras 深度学习内容，实际运行可能需要 GPU 支持，但 README 未明确具体硬件指标。",[102,103,104,105,106,107,108,109,110,111],"scikit-learn","pandas","statsmodels","spaCy","nltk","gensim","tensorflow","keras","skater","numpy",[13,26,14],[114,115,116,117,118,119,120,121,106,102,122,123,124,108,109,104,103,125,126,127],"machine-learning","deep-learning","python","classification","clustering","natural-language-processing","computer-vision","spacy","prophet","time-series-analysis","convolutional-neural-networks","jupyter","notebook","jupyter-notebook","2026-03-27T02:49:30.150509","2026-04-06T14:01:29.846293",[131,136,141,146,151,156],{"id":132,"question_zh":133,"answer_zh":134,"source_url":135},19143,"为什么下载 CSV 文件时得到的是 Git LFS 指针信息而不是实际文件内容？","这通常是因为仓库的 Git LFS 带宽或存储配额已用尽。维护者已购买新计划解决此问题，如果再次遇到，请尝试重新拉取代码或检查网络连接。错误信息通常包含 'oid sha256' 和 'size' 等字段，表明下载的是指针而非二进制大文件。","https:\u002F\u002Fgithub.com\u002FdipanjanS\u002Fpractical-machine-learning-with-python\u002Fissues\u002F1",{"id":137,"question_zh":138,"answer_zh":139,"source_url":140},19144,"运行 pyLDAvis 可视化主题模型时没有任何显示或报错，如何解决？","在 Jupyter Notebook 中使用时，需要先执行 `pyLDAvis.enable_notebook()` 启用交互式支持。如果仍然无法显示，可以尝试显式调用 `pyLDAvis.show(data)`。完整代码如下：\n```python\nimport pyLDAvis\npyLDAvis.enable_notebook()\ndata = pyLDAvis.sklearn.prepare(pos_nmf, ptvf_features, ptvf, R=15)\npyLDAvis.show(data)\n```\n如果遇到 'invalid value encountered in multiply' 警告，通常是数据预处理问题，请确保输入矩阵没有无效值。","https:\u002F\u002Fgithub.com\u002FdipanjanS\u002Fpractical-machine-learning-with-python\u002Fissues\u002F2",{"id":142,"question_zh":143,"answer_zh":144,"source_url":145},19145,"在使用 sklearn 的 scaler 进行时间序列数据处理时，遇到 'ValueError: Expected 2D array, got 1D array instead' 错误怎么办？","这是因为不同版本的 pandas 或 sklearn 对数组维度要求不同。需要将一维数组重塑为二维数组。解决方法是在调用 `fit_transform` 或 `inverse_transform` 前使用 `.reshape(-1, 1)`。例如：\n```python\nscaled_stock_series = scaler.fit_transform(time_series.reshape(-1, 1))\n# 逆变换时\ntrainPredict = scaler.inverse_transform(trainPredict.reshape(-1, 1))\n```","https:\u002F\u002Fgithub.com\u002FdipanjanS\u002Fpractical-machine-learning-with-python\u002Fissues\u002F5",{"id":147,"question_zh":148,"answer_zh":149,"source_url":150},19146,"导入 arima_utils 或 lstm_utils 模块时出现 'ModuleNotFoundError: No module named ...' 错误，如何修复？","`arima_utils` 和 `lstm_utils` 不是标准的 Python 库，无法通过 pip 安装。它们是本书配套的自定义工具模块。解决方法是下载对应章节的完整代码文件夹（例如第 11 章），确保笔记本文件与这些 `.py` 工具文件在同一目录下，或者将该目录添加到 Python 路径中。下载地址通常在书籍对应的 GitHub 仓库的 notebooks 文件夹中。","https:\u002F\u002Fgithub.com\u002FdipanjanS\u002Fpractical-machine-learning-with-python\u002Fissues\u002F16",{"id":152,"question_zh":153,"answer_zh":154,"source_url":155},19147,"Notebook 中导入 time_series_utils 失败，提示找不到模块，应该导入什么？","部分 Notebook 文件中的导入语句可能需要更新。如果看到 `from time_series_utils import ...` 报错，请尝试将其更改为 `from arima_utils import ...`。具体的函数如 `ad_fuller_test`, `plot_rolling_stats`, `plot_acf_pacf`, `arima_gridsearch_cv` 均位于 `arima_utils` 模块中。","https:\u002F\u002Fgithub.com\u002FdipanjanS\u002Fpractical-machine-learning-with-python\u002Fissues\u002F8",{"id":157,"question_zh":158,"answer_zh":159,"source_url":160},19148,"在使用 normalize_corpus 函数清洗文本后，为什么结果中仍然包含 '\u003Cbr \u002F>' 标签？","这通常是因为针对特定的情感分析模型（如 VADER），代码故意保留了部分原始格式或仅进行了基础预处理，未执行去除 HTML 标签或词形还原等操作。某些无监督词典情感分析框架可以直接处理这些标记，或者在该特定示例中，预处理步骤被简化以展示特定流程。如果需要彻底清洗，需手动添加去除 HTML 标签的步骤。","https:\u002F\u002Fgithub.com\u002FdipanjanS\u002Fpractical-machine-learning-with-python\u002Fissues\u002F4",[]]