[{"data":1,"prerenderedAt":-1},["ShallowReactive",2],{"similar-camoverride--lit":3,"tool-camoverride--lit":65},[4,18,32,41,49,57],{"id":5,"name":6,"github_repo":7,"description_zh":8,"stars":9,"difficulty_score":10,"last_commit_at":11,"category_tags":12,"status":17},4358,"openclaw","openclaw\u002Fopenclaw","OpenClaw 是一款专为个人打造的本地化 AI 助手，旨在让你在自己的设备上拥有完全可控的智能伙伴。它打破了传统 AI 助手局限于特定网页或应用的束缚，能够直接接入你日常使用的各类通讯渠道，包括微信、WhatsApp、Telegram、Discord、iMessage 等数十种平台。无论你在哪个聊天软件中发送消息，OpenClaw 都能即时响应，甚至支持在 macOS、iOS 和 Android 设备上进行语音交互，并提供实时的画布渲染功能供你操控。\n\n这款工具主要解决了用户对数据隐私、响应速度以及“始终在线”体验的需求。通过将 AI 部署在本地，用户无需依赖云端服务即可享受快速、私密的智能辅助，真正实现了“你的数据，你做主”。其独特的技术亮点在于强大的网关架构，将控制平面与核心助手分离，确保跨平台通信的流畅性与扩展性。\n\nOpenClaw 非常适合希望构建个性化工作流的技术爱好者、开发者，以及注重隐私保护且不愿被单一生态绑定的普通用户。只要具备基础的终端操作能力（支持 macOS、Linux 及 Windows WSL2），即可通过简单的命令行引导完成部署。如果你渴望拥有一个懂你",349277,3,"2026-04-06T06:32:30",[13,14,15,16],"Agent","开发框架","图像","数据工具","ready",{"id":19,"name":20,"github_repo":21,"description_zh":22,"stars":23,"difficulty_score":24,"last_commit_at":25,"category_tags":26,"status":17},2268,"ML-For-Beginners","microsoft\u002FML-For-Beginners","ML-For-Beginners 是由微软推出的一套系统化机器学习入门课程，旨在帮助零基础用户轻松掌握经典机器学习知识。这套课程将学习路径规划为 12 周，包含 26 节精炼课程和 52 道配套测验，内容涵盖从基础概念到实际应用的完整流程，有效解决了初学者面对庞大知识体系时无从下手、缺乏结构化指导的痛点。\n\n无论是希望转型的开发者、需要补充算法背景的研究人员，还是对人工智能充满好奇的普通爱好者，都能从中受益。课程不仅提供了清晰的理论讲解，还强调动手实践，让用户在循序渐进中建立扎实的技能基础。其独特的亮点在于强大的多语言支持，通过自动化机制提供了包括简体中文在内的 50 多种语言版本，极大地降低了全球不同背景用户的学习门槛。此外，项目采用开源协作模式，社区活跃且内容持续更新，确保学习者能获取前沿且准确的技术资讯。如果你正寻找一条清晰、友好且专业的机器学习入门之路，ML-For-Beginners 将是理想的起点。",85092,2,"2026-04-10T11:13:16",[15,16,27,28,13,29,30,14,31],"视频","插件","其他","语言模型","音频",{"id":33,"name":34,"github_repo":35,"description_zh":36,"stars":37,"difficulty_score":38,"last_commit_at":39,"category_tags":40,"status":17},5784,"funNLP","fighting41love\u002FfunNLP","funNLP 是一个专为中文自然语言处理（NLP）打造的超级资源库，被誉为\"NLP 民工的乐园”。它并非单一的软件工具，而是一个汇集了海量开源项目、数据集、预训练模型和实用代码的综合性平台。\n\n面对中文 NLP 领域资源分散、入门门槛高以及特定场景数据匮乏的痛点，funNLP 提供了“一站式”解决方案。这里不仅涵盖了分词、命名实体识别、情感分析、文本摘要等基础任务的标准工具，还独特地收录了丰富的垂直领域资源，如法律、医疗、金融行业的专用词库与数据集，甚至包含古诗词生成、歌词创作等趣味应用。其核心亮点在于极高的全面性与实用性，从基础的字典词典到前沿的 BERT、GPT-2 模型代码，再到高质量的标注数据和竞赛方案，应有尽有。\n\n无论是刚刚踏入 NLP 领域的学生、需要快速验证想法的算法工程师，还是从事人工智能研究的学者，都能在这里找到急需的“武器弹药”。对于开发者而言，它能大幅减少寻找数据和复现模型的时间；对于研究者，它提供了丰富的基准测试资源和前沿技术参考。funNLP 以开放共享的精神，极大地降低了中文自然语言处理的开发与研究成本，是中文 AI 社区不可或缺的宝藏仓库。",79857,1,"2026-04-08T20:11:31",[30,16,29],{"id":42,"name":43,"github_repo":44,"description_zh":45,"stars":46,"difficulty_score":38,"last_commit_at":47,"category_tags":48,"status":17},5773,"cs-video-courses","Developer-Y\u002Fcs-video-courses","cs-video-courses 是一个精心整理的计算机科学视频课程清单，旨在为自学者提供系统化的学习路径。它汇集了全球知名高校（如加州大学伯克利分校、新南威尔士大学等）的完整课程录像，涵盖从编程基础、数据结构与算法，到操作系统、分布式系统、数据库等核心领域，并深入延伸至人工智能、机器学习、量子计算及区块链等前沿方向。\n\n面对网络上零散且质量参差不齐的教学资源，cs-video-courses 解决了学习者难以找到成体系、高难度大学级别课程的痛点。该项目严格筛选内容，仅收录真正的大学层级课程，排除了碎片化的简短教程或商业广告，确保用户能接触到严谨的学术内容。\n\n这份清单特别适合希望夯实计算机基础的开发者、需要补充特定领域知识的研究人员，以及渴望像在校生一样系统学习计算机科学的自学者。其独特的技术亮点在于分类极其详尽，不仅包含传统的软件工程与网络安全，还细分了生成式 AI、大语言模型、计算生物学等新兴学科，并直接链接至官方视频播放列表，让用户能一站式获取高质量的教育资源，免费享受世界顶尖大学的课堂体验。",79792,"2026-04-08T22:03:59",[29,15,16,14],{"id":50,"name":51,"github_repo":52,"description_zh":53,"stars":54,"difficulty_score":10,"last_commit_at":55,"category_tags":56,"status":17},3128,"ragflow","infiniflow\u002Fragflow","RAGFlow 是一款领先的开源检索增强生成（RAG）引擎，旨在为大语言模型构建更精准、可靠的上下文层。它巧妙地将前沿的 RAG 技术与智能体（Agent）能力相结合，不仅支持从各类文档中高效提取知识，还能让模型基于这些知识进行逻辑推理和任务执行。\n\n在大模型应用中，幻觉问题和知识滞后是常见痛点。RAGFlow 通过深度解析复杂文档结构（如表格、图表及混合排版），显著提升了信息检索的准确度，从而有效减少模型“胡编乱造”的现象，确保回答既有据可依又具备时效性。其内置的智能体机制更进一步，使系统不仅能回答问题，还能自主规划步骤解决复杂问题。\n\n这款工具特别适合开发者、企业技术团队以及 AI 研究人员使用。无论是希望快速搭建私有知识库问答系统，还是致力于探索大模型在垂直领域落地的创新者，都能从中受益。RAGFlow 提供了可视化的工作流编排界面和灵活的 API 接口，既降低了非算法背景用户的上手门槛，也满足了专业开发者对系统深度定制的需求。作为基于 Apache 2.0 协议开源的项目，它正成为连接通用大模型与行业专有知识之间的重要桥梁。",77062,"2026-04-04T04:44:48",[13,15,14,30,29],{"id":58,"name":59,"github_repo":60,"description_zh":61,"stars":62,"difficulty_score":10,"last_commit_at":63,"category_tags":64,"status":17},519,"PaddleOCR","PaddlePaddle\u002FPaddleOCR","PaddleOCR 是一款基于百度飞桨框架开发的高性能开源光学字符识别工具包。它的核心能力是将图片、PDF 等文档中的文字提取出来，转换成计算机可读取的结构化数据，让机器真正“看懂”图文内容。\n\n面对海量纸质或电子文档，PaddleOCR 解决了人工录入效率低、数字化成本高的问题。尤其在人工智能领域，它扮演着连接图像与大型语言模型（LLM）的桥梁角色，能将视觉信息直接转化为文本输入，助力智能问答、文档分析等应用场景落地。\n\nPaddleOCR 适合开发者、算法研究人员以及有文档自动化需求的普通用户。其技术优势十分明显：不仅支持全球 100 多种语言的识别，还能在 Windows、Linux、macOS 等多个系统上运行，并灵活适配 CPU、GPU、NPU 等各类硬件。作为一个轻量级且社区活跃的开源项目，PaddleOCR 既能满足快速集成的需求，也能支撑前沿的视觉语言研究，是处理文字识别任务的理想选择。",75489,"2026-04-13T11:13:28",[30,15,14,29],{"id":66,"github_repo":67,"name":68,"description_en":69,"description_zh":70,"ai_summary_zh":70,"readme_en":71,"readme_zh":72,"quickstart_zh":73,"use_case_zh":74,"hero_image_url":75,"owner_login":76,"owner_name":77,"owner_avatar_url":78,"owner_bio":79,"owner_company":80,"owner_location":81,"owner_email":82,"owner_twitter":80,"owner_website":83,"owner_url":84,"languages":80,"stars":85,"forks":86,"last_commit_at":87,"license":80,"difficulty_score":38,"env_os":88,"env_gpu":89,"env_ram":89,"env_deps":90,"category_tags":93,"github_topics":94,"view_count":24,"oss_zip_url":80,"oss_zip_packed_at":80,"status":17,"created_at":102,"updated_at":103,"faqs":104,"releases":105},7262,"camoverride\u002Flit","lit","Literature for the self-taught AI practitioner! 📚","lit 是一个专为自学人工智能从业者精心整理的技术书单库。它解决了初学者在面对海量学习资料时难以筛选优质资源、缺乏系统学习路径的痛点。通过收录自然语言处理、深度学习、统计学及数据科学等领域的经典著作，lit 将书籍按主题分类，并大致按照从入门通识到专业深化的顺序排列，帮助用户构建清晰的知识体系。\n\n这份资源特别适合希望转行或自我提升的开发者、数据科学家以及 AI 研究者，尤其是那些没有正式计算机科班背景但渴望掌握核心技能的学习者。书单中既包含像《Speech and Language Processing》和《Deep Learning Book》这样侧重数学推导与理论深度的“圣经”级教材，也涵盖了《Hands on Machine Learning》等强调代码实战的指南，兼顾了学术严谨性与工程落地性。此外，项目还贴心地为完全零基础的用户提供了起步指南，指引他们如何根据自身情况选择第一本书。lit 不仅是一份资源列表，更是一位经验丰富的前辈为你绘制的成长地图，助你在 AI 领域稳步前行。","# lit\n\nLiterally just literature. Books for the self-taught AI practitioner.\n\nThis is a curated collection of technical books that I've found useful in my career as a data scientist. I'm mostly self-taught, though I have a formal background in psychology, linguistics, and laboratory science. My career has been based around NLP (specifically speech) with forays into other regions of the ML, AI, and the engineering world.\n\nThese books are sorted by topic; the books under each heading are vaguely ordered from introductory\u002Fgeneral to technical\u002Fspecific, with some of the \"behemoths\" listed last.\n\nI can't guarantee that all of these books have been acquired in a lawful way.\n\nHave fun 😉\n\nBut WAIT, *camoverride*, I'm totally new and don't even know where to get started! [Read this](https:\u002F\u002Fgithub.com\u002Fcamoverride\u002Flit\u002Fwiki\u002FRead-This-to-Know-What-Books-to-Read!) for more details about how to begin your journey.\n\n\n## artificial intelligence\n\n### NLTK Book\n\n[link](https:\u002F\u002Fwww.nltk.org\u002Fbook\u002F)\n\nA quick and easy tour-de-force of all the major topics in the field of natural language processing. This book is very code-focused, and mostly relies on native Python and the eponymous _NLTK_ library, which in practice serves mostly as a learning library. You should read this book in part because it's easy and useful, and in part because _everyone_ in NLP seems to have read it at some point.\n\n\n### Speech and Language Processing\n\n[link](https:\u002F\u002Fweb.stanford.edu\u002F~jurafsky\u002Fslp3\u002F)\n\nThis is the bible of natural language processing - it covers every major topic and treats them all with depth. Well written and very up-to-date. This book is formal and math-focused, with academic-style pseudocode and no Python in its chapters. If you're able to fully grok this book, you'll be cut out for any job in the NLP world.\n\n\n### Deep Learning Book\n\n[link](https:\u002F\u002Fwww.deeplearningbook.org\u002F)\n\nThe bible of deep learning - it has broad coverage of all the major topics in deep learning as well as lots of depth in each topic. Formal, mathy, and academic - in places this book is impenetrable, and in other places it reads like a good novel. If you want to take your knowledge of DL to the next level, you need to read this book. Also: read it because everyone else has read it.\n\n\n### Zero to Deep Learning\n\n[link](https:\u002F\u002Fgithub.com\u002Fcamoverride\u002Flit\u002Fblob\u002Fmaster\u002FZero-to-Deep-Learning.pdf)\n\nThis book is extremely hands-on: little formal math but lots of code examples. The purpose of this book is _not_ to give the reader an exhaustive knowledge of deep learning; rather, it's to give an engineer the conceptual framework and tools (keras, tensorflow, etc) to start solving real-world problems fast. The author of this book [plagiarized a ML app](https:\u002F\u002Fcamtsmith.com\u002Farticles\u002F2021-02\u002Fhow-to-build-ml-app-from-scratch) that I created so I don't feel guilty putting his book online.\n\n\n### The Elements of Statistical Learning\n\n[link](https:\u002F\u002Fgithub.com\u002Fcamoverride\u002Flit\u002Fblob\u002Fmaster\u002FThe-Elements-of-Statistical-Learning.pdf)\n\nA strong, mathy overview of machine learning, including deep learning. A book that most AI practitioners will encounter at some point.\n\n\n### Artificial Intelligence: A Modern Approach\n\n[link](https:\u002F\u002Fgithub.com\u002Fcamoverride\u002Flit\u002Fblob\u002Fmaster\u002FArtificial-Intelligence-A-Modern-Approach-3rd-ed.pdf)\n\nThe bible of artificial intelligence: all the background ideas, theories, and debates you'll encounter in the AI field will be presented here. This book can feel a little archaic as some of the techniques presented have fallen out of use, but understanding the history of the field is just as important as being a theoretician or a hacker.\n\n\n## data science\n\n### Hands on Machine Learning\n\n[link](https:\u002F\u002Fgithub.com\u002Fcamoverride\u002Flit\u002Fblob\u002Fmaster\u002FHands-On-Machine-Learning.pdf)\n\nA solid introduction to the practice of data science. Good mixture of code and theory. This book covers all the basics that an entry-level data scientist should be familiar with: descriptive statistics, modeling, data visualizations, deep learning, etc. Emphasis is placed on _sklearn_ and _tensorflow_. This was the first \"data scientist\" book that I read and it was enough to get me a job in the industry.\n\n\n### SQL Cookbook\n\n[link](https:\u002F\u002Fgithub.com\u002Fcamoverride\u002Flit\u002Fblob\u002Fmaster\u002FSQL-Cookbook.pdf)\n\nAnother massive O'Reilley textbook. I used this to learn the basics of SQL and still return to it from time to time with questions, but I've found the website [SQL Zoo](https:\u002F\u002Fsqlzoo.net\u002F) much more useful for getting up and running with SQL, so I'd start there before moving on to this book.\n\n\n## computer science\n\n### Code\n\n[link](https:\u002F\u002Fgithub.com\u002Fcamoverride\u002Flit\u002Fblob\u002Fmaster\u002FCode.pdf)\n\nIf you're self-taught like me, you probably developed a scattered knowledge of the useful bits of programming\u002Fengineering without knowing the fundamentals. However, like me, you probably had lurking doubts about \"how this stuff actually works\" - this book will calm your worried mind. Starting with the basics of electricity and circuits, this book moves slowly and carefully through the fundamental hardware pieces, logic gates, and assembly so that by the end you'll finally understand how code __actually__ works.\n\n\n### Cracking the Coding Interview\n\n[link](https:\u002F\u002Fgithub.com\u002Fcamoverride\u002Flit\u002Fblob\u002Fmaster\u002FCracking-the-Coding-Interview.pdf)\n\nThis book is the standard text recommended for the dreaded algorithms interview that software engineers encounter. It's written by a Google hiring manager and has practical interview advice as well as algorithm tips. There might be better books out there nowadays, but despite being written in _Java_ it's still good. Also, you should read it because everyone else reads it. This book is more advanced than _Grokking Algorithms_.\n\n\n### Introduction to Algorithms\n\n[link](https:\u002F\u002Fgithub.com\u002Fcamoverride\u002Flit\u002Fblob\u002Fmaster\u002FIntroduction-to-Algorithms.pdf)\n\nThis is the bible of computer science algorithms. Totally comprehensive, stuffed with terrifying equations, and too thick to fit in a backpack - if you memorize this book, you're guaranteed a SWE job at the FAANG company of your choice... That being said, I only consulted portions of this book, and that seems to be how it's generally used: everyone has encountered this behemoth, but only the truly brave have made it from cover to cover...\n\n\n### The Structure and Interpretation of Computer Programs\n\n[link](https:\u002F\u002Fgithub.com\u002Fcamoverride\u002Flit\u002Fblob\u002Fmaster\u002FThe-Structure-and-Interpretation-of-Computer-Programs.pdf)\n\nA big, broad, biblical overview of the field of computer science. A book that all the \"old guru's\" have read. Definitely archaic (code is written in _Lisp_) but worth a non-close-reading.\n\n\n## programming\n\n### Learn Python the Hard Way\n\n[link](https:\u002F\u002Fgithub.com\u002Fcamoverride\u002Flit\u002Fblob\u002Fmaster\u002FLearn-Python-3-The-Hard-Way.pdf)\n\nThis book uses the _throw the baby in the pool_ approach to teaching: you look at the code, you see how it works, and you learn. Simple, in your face, and a good place for a total beginner to start. This was the first programming book I ever read (I used the Python 2 version) but I recently opened it again and the Python 3 version (this one) is still great.\n\n\n### Effective Python\n\n[link](https:\u002F\u002Fgithub.com\u002Fcamoverride\u002Flit\u002Fblob\u002Fmaster\u002FEffective-Python.pdf)\n\nThis is a really solid intermediate Python text. Once you've mastered the basics of the language and done a few little projects but feel like you're ready to learn how professionals really do things, this is a good book to turn to. In addition to demonstrating concepts you probably haven't encountered in an introductory text, it emphasizes what __not__ to do.\n\n\n### JavaScript: the Definitive Guide\n\n[link](https:\u002F\u002Fgithub.com\u002Fcamoverride\u002Flit\u002Fblob\u002Fmaster\u002FJavascript-The-Definitive-Guide.pdf)\n\nI'm mostly a Python guy, but I dip into JS every once in a while. This is a pretty standard O'Reilly book, but it's useful for getting a feel for the language. When I write in JavaScript, I still find myself thinking in Python and translating to JS, but it's useful to have a reference text like this to know how things _should_ be written.\n\n\n### Mastering Python Regular Expressions\n\n[link](https:\u002F\u002Fgithub.com\u002Fcamoverride\u002Flit\u002Fblob\u002Fmaster\u002FMastering-Python-Regular-Expressions.pdf)\n\nKind of specific, but regexes are useful and you shouldn't just ignore them when you can sit down, read a short book like this one, and leave feeling totally comfortable with them. There were periods in my career when I used regexes every day.\n\n\n## operating systems\n\n### Unix for Poets\n\n[link](https:\u002F\u002Fgithub.com\u002Fcamoverride\u002Flit\u002Fblob\u002Fmaster\u002FUnix-for-Poets.pdf)\n\nLiterally everyone who ever plans to use the command line should read this short monograph - even experienced users will learn something. Short little haikus that allow you to unlock the potential of the unix terminal and do some cool stuff. The focus is on NLP, but you'll find uses elsewhere. This was recommended by one of the [genius ancient computer scientists](https:\u002F\u002Fscholar.google.com\u002Fcitations?user=bufdhOkAAAAJ&hl=en) on the Siri team when I worked at Apple.\n\n\n### How Linux Works\n\n[link](https:\u002F\u002Fgithub.com\u002Fcamoverride\u002Flit\u002Fblob\u002Fmaster\u002FHow-Linux-Works-What-Every-Superuser-Should-Know.pdf)\n\nIf you've just started exploring the world of the command line (peering beneath the surface of the Graphical User Interface) and feel frightened by what you see, this book will demystify things. Lots of details about the shell, the file system, and other things you encounter when exploring Linux - most of the knowledge here is transferrable to a MacOS environment as well.\n\n\n## math\u002Fstats\n\n### Doing Math with Python\n\n[link](https:\u002F\u002Fgithub.com\u002Fcamoverride\u002Flit\u002Fblob\u002Fmaster\u002FDoing-Math-With-Python.pdf)\n\nA cute re-introduction to math from the perspective of programming. No super advanced or surprising concepts make it into this book (it's all just high school math), but it's a nice reality check if you're used to thinking of math and programming as two totally different domains.\n\n\n### Introduction to Mathematical Statistics\n\n[link](https:\u002F\u002Fgithub.com\u002Fcamoverride\u002Flit\u002Fblob\u002Fmaster\u002FIntroduction-to-Mathematical-Statistics.pdf)\n\nA great introduction to the fundamentals of mathematical statistics. Although this is a math book, the problems and proofs are convincing and practical, and it's pleasant to read. A similar but slightly inferior book is [Mathematical Statistics and Data Analysis](https:\u002F\u002Fgithub.com\u002Fcamoverride\u002Flit\u002Fblob\u002Fmaster\u002FMathematical-Statistics-and-Data-Analysis.pdf) which I decided to include here too.\n\n\n### Mathematics for Machine Learning\n\n[link](https:\u002F\u002Fmml-book.github.io\u002F)\n\nIf you've started a career in AI but have some nagging doubts that your math knowledge might be incomplete, this is the book that'll plug up all the holes in your knowledge. The title says it all: it's a tour de force of all the math you need to understand ML and feel confident.\n\n\n### Elements of Information Theory\n\n[link](https:\u002F\u002Fgithub.com\u002Fcamoverride\u002Flit\u002Fblob\u002Fmaster\u002FElements-of-Information-Theory.pdf)\n\nInformation theory is not often talked about in the corporate world, but it has many parallels with the \"standard suite\" of mathematical techniques you'll encounter. For someone wanting a truly well-rounded AI background, information theory is worth looking into.\n\n\n### Understanding Digital Signal Processing\n\n[link](https:\u002F\u002Fgithub.com\u002Fcamoverride\u002Flit\u002Fblob\u002Fmaster\u002FUnderstanding-Digital-Signal-Processing.pdf)\n\nLike information theory, DSP is another area of mathematics that AI people will extract tools and ideas from (like fourier analysis) but usually not go deeply into - if you want to go deep, this is the book to read!\n\n\n### The Princeton Companion to Mathematics\n\n[link](https:\u002F\u002Fgithub.com\u002Fcamoverride\u002Flit\u002Fblob\u002Fmaster\u002FThe-Princeton-Companion-to-Mathematics.pdf)\n\nThere's a lot else going on in math apart from linear algebra, information theory, and DSP. This cinderblock-sized book is comprehensive - read it and you'll be able to have a real conversation with a mathematician without your eyes glazing over.\n\n\n## networks\n\n### Flask Web Development\n\n[link](https:\u002F\u002Fgithub.com\u002Fcamoverride\u002Flit\u002Fblob\u002Fmaster\u002FFlask-Web-Development.pdf)\n\nIf you're using Python and want to start doing web development, this is a good place to start. Flask is ubiquitous in the Python world and if you want to make interesting projects, you should get confortable with it. This book is hands-on and not at all concerned with theory. If you've never heard of HTML before, this might be a little bit too advanced for you, but if [you've made a static website](https:\u002F\u002Fdocs.github.com\u002Fen\u002Fpages\u002Fsetting-up-a-github-pages-site-with-jekyll) and want to get to the next level, start here.\n\n\n### Internet Routing Architectures\n\n[link](https:\u002F\u002Fgithub.com\u002Fcamoverride\u002Flit\u002Fblob\u002Fmaster\u002FInternet-Routing-Architectures.pdf)\n\nA painfully in-depth book about how the internet works. Unlike the Flask book, this one is theoretical and won't be super relevant to your job as an AI practitioner. But if you ever want a boost of confidence - the feeling that you _get_ how the internet works - this is a great resource.\n","# 文学\n\n字面意思就是文学。为自学成才的 AI 从业者准备的书籍。\n\n这是一份精心挑选的技术书籍合集，它们在我作为数据科学家的职业生涯中非常有用。我主要是自学成才，不过我也接受过心理学、语言学和实验室科学的正规教育。我的职业生涯主要围绕自然语言处理（特别是语音）展开，并涉足机器学习、人工智能和工程领域的其他方向。\n\n这些书按主题分类；每个标题下的书籍大致按照从入门\u002F通用到技术\u002F具体的顺序排列，其中一些“巨著”被列在最后。\n\n我无法保证这些书籍都是合法获取的。\n\n祝你阅读愉快 😉\n\n等等，*camoverride*，我还是个新手呢，根本不知道从哪儿开始！[请阅读这篇](https:\u002F\u002Fgithub.com\u002Fcamoverride\u002Flit\u002Fwiki\u002FRead-This-to-Know-What-Books-to-Read!)，了解更多关于如何开启你的学习之旅的信息。\n\n\n## 人工智能\n\n### NLTK 书籍\n\n[链接](https:\u002F\u002Fwww.nltk.org\u002Fbook\u002F)\n\n这是一本快速而简洁的自然语言处理领域主要话题的概览性著作。本书以代码为主，主要使用原生 Python 和同名的 _NLTK_ 库，实际上它更多地充当一个学习工具。你应该读这本书，一方面因为它简单易懂且实用，另一方面因为 NLP 领域的几乎每个人都曾在某个时候读过它。\n\n\n### 语音与语言处理\n\n[链接](https:\u002F\u002Fweb.stanford.edu\u002F~jurafsky\u002Fslp3\u002F)\n\n这是自然语言处理的圣经——它涵盖了所有主要话题，并对每个话题都进行了深入探讨。文笔流畅，内容也非常前沿。这本书风格正式，注重数学推导，采用学术风格的伪代码，章节中完全没有 Python 代码。如果你能完全理解这本书的内容，那么你在 NLP 领域将如鱼得水。\n\n\n### 深度学习书籍\n\n[链接](https:\u002F\u002Fwww.deeplearningbook.org\u002F)\n\n深度学习领域的圣经——它广泛覆盖了深度学习的所有主要话题，并且对每个话题都有相当深入的探讨。风格正式、数学化且学术性强，有些部分晦涩难懂，而另一些部分则像一本好小说一样引人入胜。如果你想把你的深度学习知识提升到一个新的水平，就必须读这本书。另外，大家都读过它，你也应该读一读。\n\n\n### 从零开始学习深度学习\n\n[链接](https:\u002F\u002Fgithub.com\u002Fcamoverride\u002Flit\u002Fblob\u002Fmaster\u002FZero-to-Deep-Learning.pdf)\n\n这本书非常注重实践：几乎没有正式的数学推导，但有大量的代码示例。本书的目的并不是让读者全面掌握深度学习的知识，而是为工程师提供概念框架和工具（如 Keras、TensorFlow 等），以便快速解决实际问题。这本书的作者 [剽窃了我的一个机器学习应用](https:\u002F\u002Fcamtsmith.com\u002Farticles\u002F2021-02\u002Fhow-to-build-ml-app-from-scratch)，所以我并不觉得把他这本书放在网上有什么负罪感。\n\n\n### 统计学习要素\n\n[链接](https:\u002F\u002Fgithub.com\u002Fcamoverride\u002Flit\u002Fblob\u002Fmaster\u002FThe-Elements-of-Statistical-Learning.pdf)\n\n这是一本强大而充满数学推导的机器学习概述，其中包括深度学习的内容。大多数 AI 从业者都会在某个阶段接触到这本书。\n\n\n### 人工智能：一种现代方法\n\n[链接](https:\u002F\u002Fgithub.com\u002Fcamoverride\u002Flit\u002Fblob\u002Fmaster\u002FArtificial-Intelligence-A-Modern-Approach-3rd-ed.pdf)\n\n人工智能领域的圣经：你在 AI 领域会遇到的所有背景思想、理论和争论都会在这里呈现出来。这本书可能会显得有些过时，因为其中介绍的一些技术已经不再流行，但了解该领域的发展历史，与成为理论家或黑客同样重要。\n\n\n## 数据科学\n\n### 动手学机器学习\n\n[链接](https:\u002F\u002Fgithub.com\u002Fcamoverride\u002Flit\u002Fblob\u002Fmaster\u002FHands-On-Machine-Learning.pdf)\n\n这是一本扎实的数据科学实践入门书。代码与理论结合得很好。本书涵盖了初级数据科学家应掌握的所有基础知识：描述性统计、建模、数据可视化、深度学习等。重点介绍了 _scikit-learn_ 和 _tensorflow_。这是我读的第一本“数据科学家”书籍，它足以让我在行业内找到一份工作。\n\n\n### SQL 烹饪书\n\n[链接](https:\u002F\u002Fgithub.com\u002Fcamoverride\u002Flit\u002Fblob\u002Fmaster\u002FSQL-Cookbook.pdf)\n\n又一本 O'Reilly 出版的大型教材。我用它来学习 SQL 的基础知识，至今仍时不时翻阅以解答疑问。不过，我发现 [SQL Zoo](https:\u002F\u002Fsqlzoo.net\u002F) 这个网站在快速上手 SQL 方面更加实用，所以我会建议先从那里开始，再过渡到这本书。\n\n\n## 计算机科学\n\n### 代码\n\n[链接](https:\u002F\u002Fgithub.com\u002Fcamoverride\u002Flit\u002Fblob\u002Fmaster\u002FCode.pdf)\n\n如果你像我一样是自学成才的，那你很可能对编程和工程中的有用部分有一些零散的认识，却缺乏扎实的基础知识。然而，就像我一样，你可能一直心存疑惑：“这些东西到底是怎么工作的？”这本书将会消除你的疑虑。从电学和电路的基础讲起，本书循序渐进地介绍了计算机硬件的基本组件、逻辑门和汇编语言，直到最后让你真正理解代码是如何运作的。\n\n\n### 破解编码面试\n\n[链接](https:\u002F\u002Fgithub.com\u002Fcamoverride\u002Flit\u002Fblob\u002Fmaster\u002FCracking-the-Coding-Interview.pdf)\n\n这本书是软件工程师们 dreaded 算法面试的标准参考书。它由一位 Google 的招聘经理撰写，不仅提供了实用的面试建议，还有算法技巧。如今或许有更好的书籍，但尽管它是用 _Java_ 写成的，仍然值得一读。此外，大家都读过它，你也应该读一读。这本书比 _Grokking Algorithms_ 更高级。\n\n\n### 算法导论\n\n[链接](https:\u002F\u002Fgithub.com\u002Fcamoverride\u002Flit\u002Fblob\u002Fmaster\u002FIntroduction-to-Algorithms.pdf)\n\n这是计算机科学算法领域的圣经。内容极其全面，充斥着令人望而生畏的公式，厚得连背包都装不下——如果你能把这本书背下来，就一定能进入你心仪的 FAANG 公司担任 SWE 职位……不过，我个人只是偶尔查阅其中的部分内容，这也正是它的常见用法：每个人都听说过这部巨著，但真正从头到尾读完的人却寥寥无几。\n\n\n### 计算机程序的构造和解释\n\n[链接](https:\u002F\u002Fgithub.com\u002Fcamoverride\u002Flit\u002Fblob\u002Fmaster\u002FThe-Structure-and-Interpretation-of-Computer-Programs.pdf)\n\n这是一本庞大而全面的计算机科学领域概述性著作。所有“老前辈”都读过的经典。虽然有些过时（代码是用 _Lisp_ 编写的），但仍然值得粗略地浏览一下。\n\n\n## 编程\n\n### 用艰难的方式学习 Python\n\n[链接](https:\u002F\u002Fgithub.com\u002Fcamoverride\u002Flit\u002Fblob\u002Fmaster\u002FLearn-Python-3-The-Hard-Way.pdf)\n\n这本书采用了“直接把孩子扔进泳池”的教学方法：你只需查看代码、理解其工作原理，便能学会。简单直接，非常适合完全的新手入门。这是我读过的第一本编程书（当时用的是 Python 2 版本），但最近我又重新翻阅了它，发现 Python 3 版本（也就是这本）依然非常出色。\n\n\n### 有效 Python\n\n[链接](https:\u002F\u002Fgithub.com\u002Fcamoverride\u002Flit\u002Fblob\u002Fmaster\u002FEffective-Python.pdf)\n\n这是一本非常扎实的 Python 中级教材。当你已经掌握了语言的基础知识，并完成了一些小项目，觉得自己准备好学习专业开发者的工作方式时，这本书便是不错的选择。除了展示你在入门教材中可能尚未接触的概念之外，它还特别强调哪些做法是__不应该__采用的。\n\n\n### JavaScript 权威指南\n\n[链接](https:\u002F\u002Fgithub.com\u002Fcamoverride\u002Flit\u002Fblob\u002Fmaster\u002FJavascript-The-Definitive-Guide.pdf)\n\n我主要是 Python 程序员，不过偶尔也会用到 JavaScript。这本书属于典型的 O’Reilly 出版社风格，对于熟悉这门语言很有帮助。当我用 JavaScript 编写代码时，仍然会不自觉地以 Python 的思维方式思考，然后再转换成 JavaScript；而像这样一本参考书，则能让我清楚地知道代码究竟应该如何编写。\n\n\n### 掌握 Python 正则表达式\n\n[链接](https:\u002F\u002Fgithub.com\u002Fcamoverride\u002Flit\u002Fblob\u002Fmaster\u002FMastering-Python-Regular-Expressions.pdf)\n\n内容较为具体，但正则表达式非常实用，不应被忽视。只要静下心来阅读这样一本书，就能轻松掌握正则表达式的使用技巧。在我的职业生涯中，曾有很长一段时间每天都需要用到正则表达式。\n\n\n## 操作系统\n\n### 诗人的 Unix\n\n[链接](https:\u002F\u002Fgithub.com\u002Fcamoverride\u002Flit\u002Fblob\u002Fmaster\u002FUnix-for-Poets.pdf)\n\n从字面上讲，任何计划使用命令行的人都应该阅读这篇短文——即便是经验丰富的用户也能从中获益。书中以简短的俳句形式，帮助你发掘 Unix 终端的潜力，完成一些有趣的操作。虽然重点在于自然语言处理，但这些技巧在其他领域也同样适用。这份推荐来自我在苹果公司工作时，Siri 团队中一位[天才般的古代计算机科学家](https:\u002F\u002Fscholar.google.com\u002Fcitations?user=bufdhOkAAAAJ&hl=en)。\n\n\n### Linux 是如何工作的\n\n[链接](https:\u002F\u002Fgithub.com\u002Fcamoverride\u002Flit\u002Fblob\u002Fmaster\u002FHow-Linux-Works-What-Every-Superuser-Should-Know.pdf)\n\n如果你刚刚开始探索命令行的世界（试图揭开图形用户界面背后的真相），并对所见景象感到畏惧，那么这本书将为你揭开其中的神秘面纱。书中详细介绍了 shell、文件系统以及在 Linux 系统中会遇到的其他内容；其中许多知识同样适用于 macOS 环境。\n\n\n## 数学\u002F统计学\n\n### 用 Python 做数学\n\n[链接](https:\u002F\u002Fgithub.com\u002Fcamoverride\u002Flit\u002Fblob\u002Fmaster\u002FDoing-Math-With-Python.pdf)\n\n这是一本从编程视角出发，重新介绍数学的趣味书籍。书中并未涉及过于高深或令人惊讶的概念（全部都是高中数学的内容），但对于那些习惯于将数学和编程视为两个完全不同领域的读者来说，它却是一个很好的提醒，帮助你认识到这两者之间的联系。\n\n\n### 数理统计导论\n\n[链接](https:\u002F\u002Fgithub.com\u002Fcamoverride\u002Flit\u002Fblob\u002Fmaster\u002FIntroduction-to-Mathematical-Statistics.pdf)\n\n这是一本优秀的数理统计基础入门书。尽管这是一本数学书籍，但其中的例题和证明都极具说服力且贴近实际，阅读起来也十分流畅。另一本类似但稍逊一筹的书籍是《数理统计与数据分析》（[链接](https:\u002F\u002Fgithub.com\u002Fcamoverride\u002Flit\u002Fblob\u002Fmaster\u002FMathematical-Statistics-and-Data-Analysis.pdf)），我也将其收录在此。\n\n\n### 机器学习中的数学\n\n[链接](https:\u002F\u002Fmml-book.github.io\u002F)\n\n如果你已经开始从事人工智能相关工作，却又隐隐担心自己的数学基础不够扎实，那么这本书将帮你填补所有知识漏洞。正如书名所示，它全面涵盖了理解机器学习所需的所有数学知识，让你从此充满信心。\n\n\n### 信息论要素\n\n[链接](https:\u002F\u002Fgithub.com\u002Fcamoverride\u002Flit\u002Fblob\u002Fmaster\u002FElements-of-Information-Theory.pdf)\n\n在企业界，信息论并不常被提及，但它与你通常接触到的“标准数学工具集”有着诸多相似之处。对于希望拥有真正全面的人工智能背景的人来说，信息论值得深入研究。\n\n\n### 数字信号处理的理解\n\n[链接](https:\u002F\u002Fgithub.com\u002Fcamoverride\u002Flit\u002Fblob\u002Fmaster\u002FUnderstanding-Digital-Signal-Processing.pdf)\n\n与信息论类似，数字信号处理也是人工智能从业者经常借鉴其工具和思想（例如傅里叶分析）的一个领域，但很少有人会深入钻研。如果你想深入了解这一主题，那么这本书便是最佳选择！\n\n\n### 普林斯顿数学伴侣\n\n[链接](https:\u002F\u002Fgithub.com\u002Fcamoverride\u002Flit\u002Fblob\u002Fmaster\u002FThe-Princeton-Companion-to-Mathematics.pdf)\n\n数学的世界远不止线性代数、信息论和数字信号处理。这本厚达数百页的书籍内容极为全面——读完之后，你便能够与数学家进行一场真正的对话，而不会因为听不懂对方的话而茫然失措。\n\n\n## 网络\n\n### Flask Web 开发\n\n[链接](https:\u002F\u002Fgithub.com\u002Fcamoverride\u002Flit\u002Fblob\u002Fmaster\u002FFlask-Web-Development.pdf)\n\n如果你正在使用 Python，并希望开始进行 Web 开发，那么 Flask 就是一个不错的起点。Flask 在 Python 社区中非常普及，如果你想做出一些有趣的项目，就应该熟练掌握它。这本书注重实践，几乎不涉及理论部分。如果你从未听说过 HTML，那么这本书可能会有些难度；但如果你已经搭建过一个静态网站（参见 [GitHub Pages 使用 Jekyll 搭建站点](https:\u002F\u002Fdocs.github.com\u002Fen\u002Fpages\u002Fsetting-up-a-github-pages-site-with-jekyll)），并希望更进一步，那么就从这里开始吧。\n\n\n### Internet 路由架构\n\n[链接](https:\u002F\u002Fgithub.com\u002Fcamoverride\u002Flit\u002Fblob\u002Fmaster\u002FInternet-Routing-Architectures.pdf)\n\n这是一本极其深入的书籍，详细阐述了互联网的工作原理。与 Flask 书籍不同，这本书偏重理论，对你的 AI 工作可能并没有直接的帮助。然而，如果你渴望增强自信心——确切地理解互联网究竟是如何运作的——那么这本书无疑是一个绝佳的资源。","# lit 快速上手指南\n\n**工具简介**：`lit` 并非一个需要编译或安装运行的软件库，而是一个由社区维护的**精选 AI 与技术书籍开源清单**。它专为自学型 AI 从业者设计，涵盖了从人工智能、数据科学、计算机科学到数学统计等领域的经典著作。本指南将帮助你快速获取这些学习资源。\n\n## 环境准备\n\n由于 `lit` 本质上是托管在 GitHub 上的文档集合，因此无需特定的操作系统或复杂的依赖环境。\n\n*   **系统要求**：任意支持现代浏览器的操作系统（Windows, macOS, Linux）。\n*   **前置依赖**：\n    *   稳定的网络连接（用于访问 GitHub 或下载 PDF\u002F在线书籍）。\n    *   PDF 阅读器（如 Adobe Acrobat, Chrome 浏览器内置阅读器等），因为大部分书籍以 PDF 格式提供。\n    *   （可选）Git 客户端：如果你希望将整个书单克隆到本地离线阅读。\n\n## 获取与安装步骤\n\n你可以通过以下两种方式获取书单内容：\n\n### 方式一：直接在线阅读或下载（推荐）\n\n直接访问 GitHub 仓库页面，根据分类点击链接在线阅读或下载对应的 PDF 文件。\n\n1.  访问项目主页：\n    ```text\n    https:\u002F\u002Fgithub.com\u002Fcamoverride\u002Flit\n    ```\n2.  在 `README` 中找到你感兴趣的类别（如 `artificial intelligence`, `data science` 等）。\n3.  点击具体书籍名称后的 `[link]` 即可跳转。\n\n> **国内加速建议**：如果直接访问 GitHub 速度较慢，可使用国内代码托管平台镜像（如 Gitee 搜索同名项目）或使用 GitHub 加速服务进行访问。\n\n### 方式二：克隆到本地（适合离线整理）\n\n如果你希望将所有资源下载到本地管理，可以使用 `git` 命令：\n\n```bash\ngit clone https:\u002F\u002Fgithub.com\u002Fcamoverride\u002Flit.git\n```\n\n进入目录查看文件结构：\n\n```bash\ncd lit\nls\n```\n\n## 基本使用\n\n`lit` 的使用核心在于**按图索骥**，根据你的当前水平选择书籍开始阅读。\n\n### 1. 新手入门路径\n如果你是完全的新手，不确定从哪里开始，请优先阅读作者推荐的入门指引：\n*   **行动**：访问 [Read This to Know What Books to Read!](https:\u002F\u002Fgithub.com\u002Fcamoverride\u002Flit\u002Fwiki\u002FRead-This-to-Know-What-Books-to-Read!)\n*   **目的**：获取针对初学者的阅读顺序建议。\n\n### 2. 按主题选书示例\n假设你想学习 **自然语言处理 (NLP)**，请参考 `artificial intelligence` 章节：\n\n*   **入门实战**：阅读 *NLTK Book*\n    *   特点：代码导向，基于 Python 和 NLTK 库，适合快速上手。\n    *   链接：`https:\u002F\u002Fwww.nltk.org\u002Fbook\u002F`\n*   **理论深造**：阅读 *Speech and Language Processing*\n    *   特点：NLP 领域的“圣经”，数学公式丰富，学术性强，涵盖深度理论。\n    *   链接：`https:\u002F\u002Fweb.stanford.edu\u002F~jurafsky\u002Fslp3\u002F`\n\n假设你想夯实 **Python 编程基础**，请参考 `programming` 章节：\n\n*   **零基础**：阅读 *Learn Python the Hard Way*\n    *   特点：通过大量代码练习教学，适合从未写过代码的人。\n*   **进阶提升**：阅读 *Effective Python*\n    *   特点：面向已掌握基础的开发者，讲解专业写法与避坑指南。\n\n### 3. 学习建议\n书单中的书籍大致按**从入门\u002F通用**到**技术\u002F专精**的顺序排列，部分大部头著作列在最后。建议结合自身背景（如是否有数学基础、是否熟悉编程）按需选取，不必强求从头读到尾。","一位自学转行的 NLP 工程师正试图从基础统计模型进阶到深度学习，以解决复杂的语音识别问题。\n\n### 没有 lit 时\n- **选书迷茫**：面对海量技术书籍，无法分辨哪些是行业公认的“圣经”，哪些已过时，导致在低质量资料上浪费数周时间。\n- **路径断层**：缺乏系统性的学习路线，直接从代码实战跳入高深数学理论，因基础不牢而难以理解核心算法原理。\n- **资源分散**：需要在全网搜索各类 PDF 和链接，不仅效率低下，还面临获取渠道不安全或版本混乱的风险。\n- **视野狭窄**：仅关注热门框架教程，忽视了语言学、心理学等跨学科背景知识，限制了处理复杂语音场景的能力。\n\n### 使用 lit 后\n- **精准导航**：直接依据 lit 整理的清单，按“入门通用”到“技术深潜”的顺序阅读，快速锁定《Speech and Language Processing》等必读经典。\n- **平滑进阶**：遵循推荐路径，先通过《NLTK Book》掌握代码实战，再逐步攻克《Deep Learning Book》中的数学难点，建立稳固的知识体系。\n- **一站获取**：直接在仓库中找到按主题分类的高质量书籍集合，省去了繁琐的检索过程，确保学习资料的权威性与完整性。\n- **跨界融合**：接触到涵盖语言学及实验室科学背景的精选书目，拓宽了思维维度，能够更灵活地设计针对特定语音问题的解决方案。\n\nlit 将零散的行业智慧浓缩为一张清晰的自学地图，让自学者能以最低试错成本构建起专业级的 AI 知识大厦。","https:\u002F\u002Foss.gittoolsai.com\u002Fimages\u002Fcamoverride_lit_b0e07911.png","camoverride","Cameron Smith","https:\u002F\u002Foss.gittoolsai.com\u002Favatars\u002Fcamoverride_723c8b18.png","Seattle-based artist. ex data scientist 📈 ",null,"Seattle","camoverride@gmail.com","smith.cam","https:\u002F\u002Fgithub.com\u002Fcamoverride",1768,533,"2026-04-13T08:54:16","","未说明",{"notes":91,"python":89,"dependencies":92},"该项目并非软件工具或代码库，而是一个 curated（精选）的技术书籍和文档集合（PDF、网页链接等），旨在为自学 AI 的从业者提供学习资源。因此，它没有特定的操作系统、GPU、内存、Python 版本或依赖库要求。用户只需具备阅读电子书或访问网页的能力即可使用。部分书籍内容涉及 Python、NLTK、TensorFlow、Keras、Sklearn 等编程知识，但本项目本身不包含可执行的代码环境需求。",[],[29,16],[95,96,97,98,99,100,101],"bookshelf","books","textbook","textbooks","artificial-intelligence","data-science","math","2026-03-27T02:49:30.150509","2026-04-14T05:04:06.359211",[],[]]