[{"data":1,"prerenderedAt":-1},["ShallowReactive",2],{"similar-kaixxx--noScribe":3,"tool-kaixxx--noScribe":65},[4,23,32,40,49,57],{"id":5,"name":6,"github_repo":7,"description_zh":8,"stars":9,"difficulty_score":10,"last_commit_at":11,"category_tags":12,"status":22},2268,"ML-For-Beginners","microsoft\u002FML-For-Beginners","ML-For-Beginners 是由微软推出的一套系统化机器学习入门课程，旨在帮助零基础用户轻松掌握经典机器学习知识。这套课程将学习路径规划为 12 周，包含 26 节精炼课程和 52 道配套测验，内容涵盖从基础概念到实际应用的完整流程，有效解决了初学者面对庞大知识体系时无从下手、缺乏结构化指导的痛点。\n\n无论是希望转型的开发者、需要补充算法背景的研究人员，还是对人工智能充满好奇的普通爱好者，都能从中受益。课程不仅提供了清晰的理论讲解，还强调动手实践，让用户在循序渐进中建立扎实的技能基础。其独特的亮点在于强大的多语言支持，通过自动化机制提供了包括简体中文在内的 50 多种语言版本，极大地降低了全球不同背景用户的学习门槛。此外，项目采用开源协作模式，社区活跃且内容持续更新，确保学习者能获取前沿且准确的技术资讯。如果你正寻找一条清晰、友好且专业的机器学习入门之路，ML-For-Beginners 将是理想的起点。",85267,2,"2026-04-18T11:00:28",[13,14,15,16,17,18,19,20,21],"图像","数据工具","视频","插件","Agent","其他","语言模型","开发框架","音频","ready",{"id":24,"name":25,"github_repo":26,"description_zh":27,"stars":28,"difficulty_score":29,"last_commit_at":30,"category_tags":31,"status":22},5784,"funNLP","fighting41love\u002FfunNLP","funNLP 是一个专为中文自然语言处理（NLP）打造的超级资源库，被誉为\"NLP 民工的乐园”。它并非单一的软件工具，而是一个汇集了海量开源项目、数据集、预训练模型和实用代码的综合性平台。\n\n面对中文 NLP 领域资源分散、入门门槛高以及特定场景数据匮乏的痛点，funNLP 提供了“一站式”解决方案。这里不仅涵盖了分词、命名实体识别、情感分析、文本摘要等基础任务的标准工具，还独特地收录了丰富的垂直领域资源，如法律、医疗、金融行业的专用词库与数据集，甚至包含古诗词生成、歌词创作等趣味应用。其核心亮点在于极高的全面性与实用性，从基础的字典词典到前沿的 BERT、GPT-2 模型代码，再到高质量的标注数据和竞赛方案，应有尽有。\n\n无论是刚刚踏入 NLP 领域的学生、需要快速验证想法的算法工程师，还是从事人工智能研究的学者，都能在这里找到急需的“武器弹药”。对于开发者而言，它能大幅减少寻找数据和复现模型的时间；对于研究者，它提供了丰富的基准测试资源和前沿技术参考。funNLP 以开放共享的精神，极大地降低了中文自然语言处理的开发与研究成本，是中文 AI 社区不可或缺的宝藏仓库。",79857,1,"2026-04-08T20:11:31",[19,14,18],{"id":33,"name":34,"github_repo":35,"description_zh":36,"stars":37,"difficulty_score":29,"last_commit_at":38,"category_tags":39,"status":22},5773,"cs-video-courses","Developer-Y\u002Fcs-video-courses","cs-video-courses 是一个精心整理的计算机科学视频课程清单，旨在为自学者提供系统化的学习路径。它汇集了全球知名高校（如加州大学伯克利分校、新南威尔士大学等）的完整课程录像，涵盖从编程基础、数据结构与算法，到操作系统、分布式系统、数据库等核心领域，并深入延伸至人工智能、机器学习、量子计算及区块链等前沿方向。\n\n面对网络上零散且质量参差不齐的教学资源，cs-video-courses 解决了学习者难以找到成体系、高难度大学级别课程的痛点。该项目严格筛选内容，仅收录真正的大学层级课程，排除了碎片化的简短教程或商业广告，确保用户能接触到严谨的学术内容。\n\n这份清单特别适合希望夯实计算机基础的开发者、需要补充特定领域知识的研究人员，以及渴望像在校生一样系统学习计算机科学的自学者。其独特的技术亮点在于分类极其详尽，不仅包含传统的软件工程与网络安全，还细分了生成式 AI、大语言模型、计算生物学等新兴学科，并直接链接至官方视频播放列表，让用户能一站式获取高质量的教育资源，免费享受世界顶尖大学的课堂体验。",79792,"2026-04-08T22:03:59",[18,13,14,20],{"id":41,"name":42,"github_repo":43,"description_zh":44,"stars":45,"difficulty_score":46,"last_commit_at":47,"category_tags":48,"status":22},3128,"ragflow","infiniflow\u002Fragflow","RAGFlow 是一款领先的开源检索增强生成（RAG）引擎，旨在为大语言模型构建更精准、可靠的上下文层。它巧妙地将前沿的 RAG 技术与智能体（Agent）能力相结合，不仅支持从各类文档中高效提取知识，还能让模型基于这些知识进行逻辑推理和任务执行。\n\n在大模型应用中，幻觉问题和知识滞后是常见痛点。RAGFlow 通过深度解析复杂文档结构（如表格、图表及混合排版），显著提升了信息检索的准确度，从而有效减少模型“胡编乱造”的现象，确保回答既有据可依又具备时效性。其内置的智能体机制更进一步，使系统不仅能回答问题，还能自主规划步骤解决复杂问题。\n\n这款工具特别适合开发者、企业技术团队以及 AI 研究人员使用。无论是希望快速搭建私有知识库问答系统，还是致力于探索大模型在垂直领域落地的创新者，都能从中受益。RAGFlow 提供了可视化的工作流编排界面和灵活的 API 接口，既降低了非算法背景用户的上手门槛，也满足了专业开发者对系统深度定制的需求。作为基于 Apache 2.0 协议开源的项目，它正成为连接通用大模型与行业专有知识之间的重要桥梁。",77062,3,"2026-04-04T04:44:48",[17,13,20,19,18],{"id":50,"name":51,"github_repo":52,"description_zh":53,"stars":54,"difficulty_score":46,"last_commit_at":55,"category_tags":56,"status":22},519,"PaddleOCR","PaddlePaddle\u002FPaddleOCR","PaddleOCR 是一款基于百度飞桨框架开发的高性能开源光学字符识别工具包。它的核心能力是将图片、PDF 等文档中的文字提取出来，转换成计算机可读取的结构化数据，让机器真正“看懂”图文内容。\n\n面对海量纸质或电子文档，PaddleOCR 解决了人工录入效率低、数字化成本高的问题。尤其在人工智能领域，它扮演着连接图像与大型语言模型（LLM）的桥梁角色，能将视觉信息直接转化为文本输入，助力智能问答、文档分析等应用场景落地。\n\nPaddleOCR 适合开发者、算法研究人员以及有文档自动化需求的普通用户。其技术优势十分明显：不仅支持全球 100 多种语言的识别，还能在 Windows、Linux、macOS 等多个系统上运行，并灵活适配 CPU、GPU、NPU 等各类硬件。作为一个轻量级且社区活跃的开源项目，PaddleOCR 既能满足快速集成的需求，也能支撑前沿的视觉语言研究，是处理文字识别任务的理想选择。",75872,"2026-04-18T10:54:57",[19,13,20,18],{"id":58,"name":59,"github_repo":60,"description_zh":61,"stars":62,"difficulty_score":29,"last_commit_at":63,"category_tags":64,"status":22},3215,"awesome-machine-learning","josephmisiti\u002Fawesome-machine-learning","awesome-machine-learning 是一份精心整理的机器学习资源清单，汇集了全球优秀的机器学习框架、库和软件工具。面对机器学习领域技术迭代快、资源分散且难以甄选的痛点，这份清单按编程语言（如 Python、C++、Go 等）和应用场景（如计算机视觉、自然语言处理、深度学习等）进行了系统化分类，帮助使用者快速定位高质量项目。\n\n它特别适合开发者、数据科学家及研究人员使用。无论是初学者寻找入门库，还是资深工程师对比不同语言的技术选型，都能从中获得极具价值的参考。此外，清单还延伸提供了免费书籍、在线课程、行业会议、技术博客及线下聚会等丰富资源，构建了从学习到实践的全链路支持体系。\n\n其独特亮点在于严格的维护标准：明确标记已停止维护或长期未更新的项目，确保推荐内容的时效性与可靠性。作为机器学习领域的“导航图”，awesome-machine-learning 以开源协作的方式持续更新，旨在降低技术探索门槛，让每一位从业者都能高效地站在巨人的肩膀上创新。",72149,"2026-04-03T21:50:24",[20,18],{"id":66,"github_repo":67,"name":68,"description_en":69,"description_zh":70,"ai_summary_zh":70,"readme_en":71,"readme_zh":72,"quickstart_zh":73,"use_case_zh":74,"hero_image_url":75,"owner_login":76,"owner_name":77,"owner_avatar_url":78,"owner_bio":77,"owner_company":77,"owner_location":77,"owner_email":77,"owner_twitter":77,"owner_website":77,"owner_url":79,"languages":80,"stars":100,"forks":101,"last_commit_at":102,"license":103,"difficulty_score":10,"env_os":104,"env_gpu":105,"env_ram":106,"env_deps":107,"category_tags":116,"github_topics":117,"view_count":10,"oss_zip_url":77,"oss_zip_packed_at":77,"status":22,"created_at":123,"updated_at":124,"faqs":125,"releases":161},9253,"kaixxx\u002FnoScribe","noScribe","Cutting edge AI technology for automated audio transcription. A nice GUI for OpenAIs Whisper and pyannote (speaker identification)","noScribe 是一款专为高质量访谈转录设计的免费开源桌面应用，特别适用于定性社会研究和新闻工作。它巧妙地将 OpenAI 的 Whisper 语音识别技术与 pyannote 说话人分离算法相结合，通过友好的图形界面，让用户能在本地电脑上轻松完成音频转文字任务。\n\n这款工具核心解决了研究人员在处理大量访谈录音时的痛点：它不仅支持约 60 种语言的自动转录，还能智能区分不同说话人，生成带角色标记的文稿。最重要的是，noScribe 完全在本地运行，无需上传数据到云端，彻底保障了敏感访谈内容的机密性。内置的编辑器方便用户对转录结果进行校对和修正，弥补了自动转录可能存在的误差。\n\nnoScribe 非常适合社会科学研究者、记者、学生以及任何需要处理访谈录音的专业人士使用。虽然转录速度受硬件性能影响（一小时音频可能需要数小时处理），且对录音质量有一定要求，但其完全免费、开源透明且保护隐私的特性，使其成为学术和专业领域的可靠选择。开发者 Kai Dröge 凭借社会学与计算机科学的双重背景，打造这款旨在减轻学术工作负担的工具，让繁琐的听写工作变得不再痛苦。","﻿# noScribe\n### Cutting Edge AI Technology for Automated Audio Transcription\n\u003C\u002Fbr>\n\n> [!NOTE]\n> ### 🚀 The new official website for noScribe: https:\u002F\u002Fnoscribe.de\n> Learn how to install and use the software, and find tips to improve transcription quality.\n>\n> 🌐 Available in **English, German, Spanish, Italian, and Dutch**.\n>\n> Please update your links. \n\n---\n\n> [!WARNING]\n> Somebody has registered the domain **noscribe(dot)ai** to sell transcription services. **Stay away from this platform, I have nothing to do with it.** The real noScribe is free and always will be. This is obviously an attempt to profit from the popularity of my software and the reputation it gained over the years. Very sad. \n\n## What is noScribe?\n- An app to produce **high quality transcripts of interviews** for qualitative social research or journalistic use\n- noScribe is **free and open source** ([GPL-3.0](https:\u002F\u002Fwww.gnu.org\u002Flicenses\u002Fgpl-3.0.html)), available for Windows, MacOS and Linux \n- It runs **completely locally** on your computer, protecting the confidentiality of your interviews. No cloud, no worries\n- It can distinguish between different **speakers** and understands around 60 languages (more or less, see below)\n- It includes a **nice editor** to review, verify and correct the resulting transcript\n- It is standing on the shoulders of giants: [Whisper from OpenAI](https:\u002F\u002Fgithub.com\u002Fopenai\u002Fwhisper), [faster-whisper by Guillaume Klein](https:\u002F\u002Fgithub.com\u002Fguillaumekln\u002Ffaster-whisper) and [pyannote from Hervé Bredin](https:\u002F\u002Fgithub.com\u002Fpyannote\u002Fpyannote-audio)\n\n\u003C\u002Fbr>\n\n![Main window](https:\u002F\u002Foss.gittoolsai.com\u002Fimages\u002Fkaixxx_noScribe_readme_1022ee3fdd91.png)\n(The transcript is from [this interview](https:\u002F\u002Fwww.youtube.com\u002Fwatch?v=vOwajAbvPzQ&t=2018s) which I did in May 2022 with the Russian sociologist Natalia Savelyeva.)\n\n## Limitations\n- The download is quite large (several gigabytes) due to the included AI models. \n- Beware that a one hour interview can take up to three hours to transcribe, depending on your machine. \n- Poor audio and background noise will lead to poor transcription results.\n- No automatic transcription is perfect, there will always be some manual revision necessary. Use the [included Editor](#noscribeedit) to check your transcripts thoroughly. (See also [\"Factors Influencing the Quality\"](#factors-influencing-the-quality-of-the-transcription) and [\"Known Issues\"](#known-issues) below.)\n\nIf you want to know more and can understand German, Rebecca Schmidt from the University of Paderborn wrote a nice [review of noScribe,](https:\u002F\u002Fsozmethode.hypotheses.org\u002F2315) also discussing its limitations. Also the German [computer magazine c't recommended noScribe in a recent review](https:\u002F\u002Fwww.heise.de\u002Fselect\u002Fct\u002F2025\u002F2\u002F2433207582191637980).\n\n## Why the Name \"noScribe\"?\nThe [urban dictionary](https:\u002F\u002Fwww.urbandictionary.com\u002Fdefine.php?term=Scribe) defines **scribe** as *\"a person whose entire miserable existence has been reduced to academic grunge and pain\".* I hope this software will make your academic life a little less painful and grungy, hence the name noScribe :)\n\n## About Me\n**Kai Dröge**, PhD in sociology (with a background in computer science), qualitative researcher and teacher, [Lucerne University for Applied Science (Switzerland)](https:\u002F\u002Fwww.hslu.ch\u002Fde-ch\u002Fhochschule-luzern\u002Fueber-uns\u002Fpersonensuche\u002Fprofile\u002F?pid=823) and [Institute for Social Research, Frankfurt\u002FM. (Germany)](https:\u002F\u002Fwww.ifs.uni-frankfurt.de\u002Fpersonendetails\u002Fkai-droege.html).\n\n## Donate\nNoScribe is free and always will be. However, developing it costs real money. I have purchased hardware for testing and pay Apple annually for a developer ID. If you would like to support this project, you can make a donation on Ko-Fi. Thanks! \n\n[![ko-fi](https:\u002F\u002Fko-fi.com\u002Fimg\u002Fgithubbutton_sm.svg)](https:\u002F\u002Fko-fi.com\u002Fnoscribe)\n\n## Download and Installation\n\n**Current Version Number: 0.7** (see [changelog](CHANGELOG.md))\n> All releases are hosted on SWITCHdrive, a secure data sharing platform for Swiss universities.\n\n### Windows\n\u003Cdetails>\n    \u003Csummary>Click to expand\u003C\u002Fsummary>\n\n- **Download:**\n    - The **general purpose version** for normal PCs without an NVIDIA graphics card: [https:\u002F\u002Fdrive.switch.ch\u002Findex.php\u002Fs\u002FEIVup04qkSHb54j?path=%2FnoScribe%20vers.%200.7%2FWindows%2Fnormal](https:\u002F\u002Fdrive.switch.ch\u002Findex.php\u002Fs\u002FEIVup04qkSHb54j?path=%2FnoScribe%20vers.%200.7%2FWindows%2Fnormal) \n    - A special version using **CUDA acceleration on NVIDIA graphics cards** with at least 6 GB of VRAM: [https:\u002F\u002Fdrive.switch.ch\u002Findex.php\u002Fs\u002FEIVup04qkSHb54j?path=%2FnoScribe%20vers.%200.7%2FWindows%2Fcuda](https:\u002F\u002Fdrive.switch.ch\u002Findex.php\u002Fs\u002FEIVup04qkSHb54j?path=%2FnoScribe%20vers.%200.7%2FWindows%2Fcuda). Make sure that your NVIDIA drivers are on version 570.65 or higher. You must also install the [CUDA toolkit from here](https:\u002F\u002Fdeveloper.nvidia.com\u002Fcuda-downloads?target_os=Windows) (a reboot is required afterwards).\n- **Installation**: \n    - Start the downloaded setup file. This may take a while, be patient.\n    - If you get a warning that \"Windows protected your PC\" and the app comes from an \"Unknown publisher\", you have to trust us and click \"Run anyway\"\n    - To do a silent install on a larger group of computers, start the setup with the argument `\u002FS`.\n- **Known Issues:**\n    - It seems that the RTX\u002FGTX 1XXX generation of cards is no longer supported in CUDA. Use the normal version instead. \n    - If you receive the following error message: \"Transcription worker exited unexpectedly (code 3221226505),\" try forcing the use of the CPU for transcription instead of the graphics card. This method is slower but more reliable. To do so, follow these steps: Close noScribe. Open the file `C:\\Users\\\u003CUSERNAME>\\AppData\\Local\\noScribe\\noScribe\\config.yml` in a text editor. Change the value for `force_whisper_cpu` to `'True'`. Don't forget the quotation marks around 'True'. Now, restart NoScribe. \n\n\u003C\u002Fdetails>\n\n### MacOS\n\u003Cdetails>\n    \u003Csummary>Click to expand\u003C\u002Fsummary>\n\nported by [gernophil](https:\u002F\u002Fgithub.com\u002Fgernophil) \u003C\u002Fbr>\n\n- **Newer Macs with Apple Silicon M1-M4 processors and macOS 14 or newer**\n    - Download: [https:\u002F\u002Fdrive.switch.ch\u002Findex.php\u002Fs\u002FEIVup04qkSHb54j?path=%2FnoScribe%20vers.%200.7%2FmacOS%2FApple%20Silicon](https:\u002F\u002Fdrive.switch.ch\u002Findex.php\u002Fs\u002FEIVup04qkSHb54j?path=%2FnoScribe%20vers.%200.7%2FmacOS%2FApple%20Silicon)\n    - Double-click on the downloaded dmg-file, then drag noScribe and noScribeEdit into the link to your applications folder (labeled \"drag both here to install\").\n    - You will need Apple's Rosetta2 Intel emulator since one component (ffmpeg) is still made for Intel CPUs. If you don't have it installed already, do this as follows:\n        - Open the Terminal (located at `\u002FApplications\u002FUtilities\u002FTerminal.app`).\n        - Type `softwareupdate --install-rosetta` or `softwareupdate --install-rosetta --agree-to-license`.\n        - Hit enter and follow the instructions on the screen.\n    - Start noScribe and\u002For noScribeEdit by double-clicking the app within your applications.\n\n- **Older Macs with Intel processors**\n    - **Note: Version 0.7 is currently not available for Intel based Macs** due to incompatibilities with the newest pyannote release. You can, however, use version 0.6: [https:\u002F\u002Fdrive.switch.ch\u002Findex.php\u002Fapps\u002Ffiles\u002F?dir=\u002FnoScribe\u002FnoScribe%20releases\u002FnoScribe%20vers.%200.6\u002FmacOS\u002Fx86_64%20(Intel)&fileid=8266174681](https:\u002F\u002Fdrive.switch.ch\u002Findex.php\u002Fapps\u002Ffiles\u002F?dir=\u002FnoScribe\u002FnoScribe%20releases\u002FnoScribe%20vers.%200.6\u002FmacOS\u002Fx86_64%20(Intel)&fileid=8266174681).\n    - Older versions may be available here: [https:\u002F\u002Fdrive.switch.ch\u002Findex.php\u002Fs\u002FEIVup04qkSHb54j](https:\u002F\u002Fdrive.switch.ch\u002Findex.php\u002Fs\u002FEIVup04qkSHb54j)\n    - Unfortunately, we are not able to sign the x86_64 package correctly, so you will get a warning that noScribe and noScribeEdit are from unregistered developers. You have to manually allow noScribe and noScribeEdit to be executed, if your Gatekeeper is active. Follow these steps:\n    - Double-click the downloaded dmg-file.\n    - Drag noScribe and noScribeEdit into the link to your applications folder (labeled \"drag both here to install\").\n    - Start noScribe by double-clicking the app within your applications folder. You will get an error that noScribe is from an unregistered developer. Do the same with the noScribe Editor.\n    - Go to Settings -> Privacy and Security -> Scroll down until you see a message stating noScribe was prevented from starting and click \"open anyway\". Again, do the same with the noScribe Editor.\n    - From now on, both programs should start without issues.\n\n\u003C\u002Fdetails>\n\n### Linux\n\u003Cdetails>\n    \u003Csummary>Click to expand\u003C\u002Fsummary>\n\nported by [Eckhard Kadasch](https:\u002F\u002Fgithub.com\u002Feckhrd) and [Florian Dobener](https:\u002F\u002Fgithub.com\u002Fdomna); executable generated by [gernophil](https:\u002F\u002Fgithub.com\u002Fgernophil).\n\n#### Executable Installation\n\n  - Download the CUDA or CPU version of noScribe 0.7 for Linux here: [https:\u002F\u002Fdrive.switch.ch\u002Findex.php\u002Fs\u002FEIVup04qkSHb54j?path=%2FnoScribe%20vers.%200.7%2FLinux](https:\u002F\u002Fdrive.switch.ch\u002Findex.php\u002Fs\u002FEIVup04qkSHb54j?path=%2FnoScribe%20vers.%200.7%2FLinux)\n    - Issues with noScribe 0.7 for Linux on some systems have been reported. If noScribe 0.7 for Linux does not work on your system, please use noScribe 0.6.2 while we try to solve these issues: [https:\u002F\u002Fdrive.switch.ch\u002Findex.php\u002Fs\u002FEIVup04qkSHb54j?path=%2FnoScribe%20vers.%200.6%2FLinux](https:\u002F\u002Fdrive.switch.ch\u002Findex.php\u002Fs\u002FEIVup04qkSHb54j?path=%2FnoScribe%20vers.%200.6%2FLinux)\n  - Untar the file using the terminal command `tar -xzvf noScribe_0.7.0_cpu_linux_amd64.tar.gz` or `tar -xzvf noScribe_0.7.0_cuda_linux_amd64.tar.gz`.\n  - Execute noScribe using the terminal by `cd`ing into the noScribe folder and executing `.\u002FnoScribe`.\n  - Optionally: Edit the files `noScribe.desktop` and `noScribeEdit.desktop` with a text editor and enter the complete path in the lines starting with `Exec=` and `Icon=`.\n\n#### Manual Installation From Source\n\nSee [this discussion](https:\u002F\u002Fgithub.com\u002Fkaixxx\u002FnoScribe\u002Fdiscussions\u002F83) for\nmore information.\n\nIf you want to install from source, `git` and `git-lfs` are necessary to get\nall required pieces. The latest sources are directly fetched from the\nrepository. Please use the installation above (executable installation) if\nyou want to install a specific version.\n\n```bash\ngit clone https:\u002F\u002Fgithub.com\u002Fkaixxx\u002FnoScribe.git\n\n# After that, everything happens in the `noScribe` directory.\ncd noScribe\n\n# Install noScribeEdit\nrm -rf noScribeEdit\u002F\ngit clone https:\u002F\u002Fgithub.com\u002Fkaixxx\u002FnoScribeEditor.git noScribeEdit\n\n# Create a python virtual environment.\npython3 -m venv venv\n\n# After the following command, python from the virtual environment is used. Also\n# package installations land here. Remember to run this command every time, you\n# want to start noScribe.\nsource venv\u002Fbin\u002Factivate\n\n# Install necessary dependencies.\npip install -r environments\u002Frequirements_linux.txt\npip install -r noScribeEdit\u002Fenvironments\u002Frequirements.txt\n\n# Download model files. Here, the precise as well as the fast models are\n# downloaded. If you use only one of them, it is enough to download only the\n# respective models.\nrm -rf models\u002Ffast\nrm -rf models\u002Fprecise\ngit clone https:\u002F\u002Fhuggingface.co\u002Fmukowaty\u002Ffaster-whisper-int8 models\u002Ffast\ngit clone https:\u002F\u002Fhuggingface.co\u002Fmobiuslabsgmbh\u002Ffaster-whisper-large-v3-turbo models\u002Fprecise\n\n# Run noScribe.\npython3 .\u002FnoScribe.py\n```\n\n\u003C\u002Fdetails>\n\n### Old versions:\n- [https:\u002F\u002Fdrive.switch.ch\u002Findex.php\u002Fs\u002FEIVup04qkSHb54j](https:\u002F\u002Fdrive.switch.ch\u002Findex.php\u002Fs\u002FEIVup04qkSHb54j)\n\n## Citation (APA Style)\nDröge, K. (2025). noScribe. AI-powered Audio Transcription (Version XXX) [Computer software]. https:\u002F\u002Fgithub.com\u002Fkaixxx\u002FnoScribe\n\n## Usage\n### Settings\n\u003Cimg align=\"left\" src=\"https:\u002F\u002Foss.gittoolsai.com\u002Fimages\u002Fkaixxx_noScribe_readme_f3b724080966.png\" width=\"300\">\n\n- Select your **audio file**. NoScribe supports almost any audio or video format. You can also select several files at once for [batch transcription](#batch-transcription).\n- Select the **filename for the transcript.** You can also choose the file type: *.html is the default, supported also by the noScribe editor. *.vtt is a video subtitles format and is especially useful if you want to import your transcript into [EXMARaLDA](https:\u002F\u002Fexmaralda.org\u002F) for further annotation. *.txt exports the transcript as plain text.\n- **Start** and **Stop** accept timestamps in the format hh:mm:ss. Use this to limit the transcription to a particular part of the recording. This is especially helpful for testing your settings with a small sample before committing to transcribing the whole interview, which may take several hours. Leave **Stop** empty if you want to transcribe until the end of the audio file.\n- **Language:** Select the language of your transcript. Set it to \"auto\" to detect the language, or choose \"multilingual\" if your audio contains more than one language (experimental).\n- **Quality:** \"Precise\" is the recommended setting for the most accurate transcript. On slower machines, you may opt for the \"fast\" option. This will be quicker but might necessitate more manual revision later. You can also [install custom models](https:\u002F\u002Fgithub.com\u002Fkaixxx\u002FnoScribe\u002Fwiki\u002FAdd-custom-Whisper-models-for-transcription), fine-tuned for specific languages, etc.\n- **Mark Pause**: If enabled, parts of your audio without voice activity will be marked as pauses. Pauses are transcribed as round brackets with one dot per second inside, e.g., \"(..)\" for a two-second pause. Pauses longer than 10 seconds are written out as \"(XX seconds pause)\" or \"(XX minutes pause)\". You have the option to mark either pauses of one second and more (\"1sec+\"), two seconds and more (\"2sec+\"), or only the longer ones of three seconds and more (\"3sec+\"). Choose \"none\" to disable this feature entirely.\n- **Speaker Detection:** This feature uses the Pyannote AI model to identify distinct speakers in your audio and organizes the transcript accordingly. Choose the number of speakers if known, or select \"auto.\" Opting for \"none\" bypasses this step altogether, reducing the processing time by approximately half. However, the resultant transcript will be a continuous block of text without any indicators of speaker transitions.\n- **Overlapping Speech**: If enabled, noScribe attempts to mark instances where two people speak simultaneously. The overlapping section is demarcated with \u002F\u002Fdouble slashes\u002F\u002F. (Note: This is an experimental feature.)\n- **Disfluencies**: If enabled, common speech disfluencies like filler words (\"um\"), unfinished words or sentences, etc. will also be transcribed. Note that this is not a hard on\u002Foff switch, but more of a 'recommendation' for the transcription AI model which only works to some extent.   \n- **Timestamps**: When enabled, noScribe incorporates timestamps in the format [hh:mm:ss] into the transcript either at every change of speaker or every 60 seconds. I find these timestamps somewhat distracting, hence my decision to disable them by default. However, they can be quite useful in certain contexts. Even with timestamps disabled, determining the audio timecode for a specific segment is straightforward: simply open the transcript in the noScribe Editor, navigate through the text, and the corresponding timecode will appear in the bottom right corner of the app.\n\n### Transcription process\n- If you are ready, click the **Start**-button in the bottom left. **Cancel** will abort the process.\n- Be aware that **a one-hour interview can take up to three hours processing time** and will put a heavy load on your machine. Doing this on battery-power is not recommended.\n- A **progress indicator** at the bottom of the app will show how far you are into the whole process.\n- The **main window** (log tab) will show progress-messages and errors. It will also print the text of your interview during the last step of the transcription.\n- The transcript will be auto saved every few seconds under the given filename.\n- By default, noScribe produces an HTML-file. This can be opened in every common word editor (including MS Word, LibreOffice) or QDA-package (MAXQDA, ATLAS.ti, QualCoder...).\n- Before working with the transcript though, you should check it with the included editor. There will always be some errors.\n\n### Batch transcription \n(new in version 0.7)\n\n\u003Cimg src=\"https:\u002F\u002Foss.gittoolsai.com\u002Fimages\u002Fkaixxx_noScribe_readme_5dc269d65719.png\" width=\"800\">\n\n- The \"Queue\" tab in the main window shows a list of all jobs as well as their state and progress.\n- If you start a new job while another is still running, the new job will wait in the queue to be processed afterwards.\n- To start multiple jobs at once with the same settings, select as many files as you want in the audio file dialog. The output files will be named automatically. Use the \"Save transcript as\" dialog to select a different output folder if needed. Otherwise, the transcripts will be stored in the same folders as the audio. \n- The job buttons:\n    - `X` Deletes a job from the list or cancels a running one.\n    - `✔` Opens the transcript in the included editor. This also works for unfinished transcripts in case of an error or if the job was canceled by the user.\n    - `⟲` Restarts the job (only available in case of errors or cancelation).       \n\n## noScribeEdit\nThe included editor to check the final transcript.\n\n![The transcript in the noScribe Editor](https:\u002F\u002Foss.gittoolsai.com\u002Fimages\u002Fkaixxx_noScribe_readme_05eb1f6f12bc.png)\n\nThe noScribe Editor is a separate app. It will open automatically once the transcript is finished, but can also be run independently from noScribe. It contains some handy features to check your finished transcript for errors and correct them:\n- Press **Ctrl + Spacebar** (^Space on Mac) or the **orange button in the toolbar** to hear the audio which corresponds to your current position in the text.\n- The **selection of the text will follow the audio that you hear**. If you want to **make changes,** click anywhere in the text with your mouse or use the arrow keys to move the cursor. The audio will stop, and you can edit the text.\n- You can also **stop the audio** by pressing Ctrl + Spacebar again or clicking the orange button.\n- If you want to **speed up or slow down the audio**, change the \"100%\"-field next to the \"Play\u002FPause Audio\"-Button to the appropriate speed.\n- To change the **speaker names,** use the Search & Replace feature, accessible from the magnifying glass icon or the Edit menu.\n- Use the plus and minus icons in the toolbar to **zoom in or out**\n- You will find the **most common features of a basic text editor** in the toolbar as well as in the menu at the top (basic text formatting, cut, copy & paste, undo & redo).\n- Your typical **hotkeys** will also work (e.g., Ctrl+S for Save, Ctrl+F for Find & Replace). You can see all the hotkeys if you open the menu. As already mentioned, 'Ctrl+Space' is the hotkey you'll use the most as it starts or pauses the audio.\n\n\nThe source code of the editor can be found here: [https:\u002F\u002Fgithub.com\u002Fkaixxx\u002FnoScribeEditor](https:\u002F\u002Fgithub.com\u002Fkaixxx\u002FnoScribeEditor)\n\n## Factors Influencing the Quality of the Transcription\n- A **good audio recording with clear voices and no ambient noise** is crucial for a high-quality transcription. Investing some effort in the quality of the recording will save you much time in the manual revision process later.\n- Whisper (the AI powering noScribe) understands around 60 different languages, but the quality of the transcription varies widely between them. **Spanish, Italian, English, Portuguese and German** are best supported (see [here for more info]( https:\u002F\u002Fgithub.com\u002Fopenai\u002Fwhisper#available-models-and-languages)).\n- Whisper handles **dialects** fairly well (e.g., Swiss-German), but the transcript might need more manual work in the revision.\n\n## Known Issues\n- The output of this software always needs to be checked for quality, misunderstandings, and wrong speaker diarization. This software is based on [OpenAI's Whisper model](https:\u002F\u002Fgithub.com\u002Fopenai\u002Fwhisper). Typical word error rates can be seen [here](https:\u002F\u002Fgithub.com\u002Fopenai\u002Fwhisper?tab=readme-ov-file#available-models-and-languages). See also [this paper](https:\u002F\u002Fdoi.org\u002F10.1145\u002F3576915.3624380) for a comparison of different transcription services and their errors.\n- Like any other large language model, the whisper model can sometimes **hallucinate**. This is especially prevalent in silent audio passages or when background noise is treated as \"text\" (see [this study from the Cornell University](https:\u002F\u002Ffacctconference.org\u002Fstatic\u002Fpapers24\u002Ffacct24-111.pdf) for more info about the issue). We use voice activity detection (VAD) to filter out sections without speech as best as possible. \n  More severely, users also reported cases where words were hallucinated that would fit syntactically into the context, but were actually not present in the original audio. Such errors are especially hard to catch.\n- **Names of people, places or organizations** are often transcribed with errors.  \n- The whisper AI can sometimes get **stuck in a loop of repeating text,** especially on longer audio files. If this happens, try to transcribe shorter sections (using the \"Start\" and \"Stop\" fields in noScribe), and join them manually.\n- **Multilingual audio** is now supported, but experimental. Sometimes it can happen that words in other languages than the main language are translated.\n- **Nonverbal expressions** like laughter are not included in the transcript and must be added later in the editor if you need them.\n- **Speaker diarization:** In some recordings, the AI used by noScribe may not be able to tell the voices of certain speakers apart, even if they sound quite different to the human ear. Check the results carefully.\n- It can happen that **punctuation and capitalization** are lost over time, especially in longer interviews. If you run into this issue, you can\n  - Try to transcribe shorter sections (using the \"Start\" and \"Stop\" fields in noScribe), and join them manually.\n  - Try to use another model, especially \"faster-whisper-large-v2\", which is less prone to this problem. You have to install this model first as described [in the Wiki](https:\u002F\u002Fgithub.com\u002Fkaixxx\u002FnoScribe\u002Fwiki\u002FAdd-custom-Whisper-models-for-transcription).\n\n\n## Advanced Options\n\n- NoScribe now also includes a command line interface, ideal for scripting. Type in `noScribe.exe --help` for more information. You may also want to use the `--no-gui` option in scripting scenarios. \n- Config file: After the app has run for the first time, you will find a file named `config.yml` in the user config directory (on Windows: `C:\\Users\\\u003Cusername>\\AppData\\Local\\noScribe\\noScribe\\config.yml`; on Mac OS: `~\u002FLibrary\u002FApplication Support\u002Fnoscribe\u002Fconfig.yml`; on Linux: `~\u002F.config\u002FnoScribe\u002Fconfig.yml`). Here, you can change a few **extra settings**, e.g., the language of the user interface and model parameters.\n- Also in the user config directory you will find a folder named `log` with detailed log-files for every transcript (also unfinished ones). This can be helpful in the case of any errors. Be aware though that these files also contain the text of your transcripts which might include sensitive information.\n- If you want to use **custom whisper models** with noScribe, follow the [instructions in the Wiki](https:\u002F\u002Fgithub.com\u002Fkaixxx\u002FnoScribe\u002Fwiki\u002FAdd-custom-Whisper-models-for-transcription).\n\n## Development and Contribution\n- I developed noScribe in python 3.12\n- I cannot host the whisper-models on GitHub because they are too large. There is a readme in the models-folder with instructions on how to get them.\n- I am happy to review tests, bug reports and pull requests (if my time allows it)\n\n### Translations\n- The noScribe UI has already been translated into many languages (thanks mlynar-czyk).\n- Since most of the translations have been created with ChatGPT, there will be problems. Please report any errors that you’ll find and make – if possible – a pull request with a better translation.\n- You will find the language files in the folder \"trans\".\n- If you change anything in the language files, make sure to follow the conventions of the YAML language.\n- If you want to change the language of the user interface, you have to change the value of the \"locale\" setting in the advanced settings (see above).\n\n## Other Software\nIf you are interested in open source software for the analysis of qualitative data, take a look at my other project [QualCoder](https:\u002F\u002Fgithub.com\u002Fccbogel\u002FQualCoder).\n\n\n\n\n\n\n","﻿# noScribe\n### 用于自动音频转录的尖端人工智能技术\n\u003C\u002Fbr>\n\n> [!NOTE]\n> ### 🚀 noScribe 的新官方网站：https:\u002F\u002Fnoscribe.de\n> 了解如何安装和使用该软件，并获取提升转录质量的技巧。\n>\n> 🌐 支持 **英语、德语、西班牙语、意大利语和荷兰语**。\n>\n> 请更新您的链接。\n\n---\n\n> [!WARNING]\n> 有人注册了域名 **noscribe(dot)ai**，用于销售转录服务。**请远离该平台，我与此事毫无关系。** 真正的 noScribe 是免费的，而且永远都会是免费的。这显然是利用我的软件及其多年来积累的声誉牟利的行为。令人非常遗憾。\n\n## 什么是 noScribe？\n- 一款用于为定性社会研究或新闻报道生成 **高质量访谈记录** 的应用程序\n- noScribe 是 **免费且开源** 的（[GPL-3.0](https:\u002F\u002Fwww.gnu.org\u002Flicenses\u002Fgpl-3.0.html)），适用于 Windows、MacOS 和 Linux\n- 它在您的计算机上 **完全本地运行**，从而保护您访谈的机密性。无需云端，无后顾之忧\n- 它能够区分不同的 **说话人**，并支持约 60 种语言（多寡不一，详见下文）\n- 它内置了一个 **优秀的编辑器**，用于审查、核对和修正生成的文本\n- 它基于多项前沿技术：[OpenAI 的 Whisper](https:\u002F\u002Fgithub.com\u002Fopenai\u002Fwhisper)、[Guillaume Klein 的 faster-whisper](https:\u002F\u002Fgithub.com\u002Fguillaumekln\u002Ffaster-whisper) 以及 [Hervé Bredin 的 pyannote](https:\u002F\u002Fgithub.com\u002Fpyannote\u002Fpyannote-audio)\n\n\u003C\u002Fbr>\n\n![主界面](https:\u002F\u002Foss.gittoolsai.com\u002Fimages\u002Fkaixxx_noScribe_readme_1022ee3fdd91.png)\n（该转录来自我于 2022 年 5 月与俄罗斯社会学家 Natalia Savelyeva 进行的访谈。）\n\n## 局限性\n- 由于包含大量 AI 模型，下载文件较大（数 GB）。\n- 请注意，根据您的设备性能，转录一小时的访谈可能需要长达三小时。\n- 音质较差或背景噪音过大的录音会导致转录效果不佳。\n- 没有任何自动转录工具能做到完美，通常仍需人工校对。请务必使用 [内置编辑器](#noscribeedit) 对转录内容进行全面检查。（另请参阅下方的“影响转录质量的因素”和“已知问题”。）\n\n如果您想了解更多，且能看懂德语，帕德博恩大学的 Rebecca Schmidt 撰写了一篇关于 noScribe 的精彩 **评测**，其中也讨论了其局限性。此外，德国知名计算机杂志 c't 也在近期的一篇评测中推荐了 noScribe：[https:\u002F\u002Fwww.heise.de\u002Fselect\u002Fct\u002F2025\u002F2\u002F2433207582191637980](https:\u002F\u002Fwww.heise.de\u002Fselect\u002Fct\u002F2025\u002F2\u002F2433207582191637980)。\n\n## 为什么叫“noScribe”？\n根据 [Urban Dictionary](https:\u002F\u002Fwww.urbandictionary.com\u002Fdefine.php?term=Scribe)，**scribe** 被定义为 *“一个将自己悲惨的人生全部简化为学术上的苦闷与痛苦的人”*。我希望这款软件能让您的学术生活少一些痛苦与烦闷，因此得名 noScribe :)\n\n## 关于我\n**Kai Dröge**，社会学博士（兼具计算机科学背景），定性研究者及教师，现任瑞士卢塞恩应用科学大学 [卢塞恩应用科学大学](https:\u002F\u002Fwww.hslu.ch\u002Fde-ch\u002Fhochschule-luzern\u002Fueber-uns\u002Fpersonensuche\u002Fprofile\u002F?pid=823) 教师，同时也在德国法兰克福社会研究所 [法兰克福社会研究所](https:\u002F\u002Fwww.ifs.uni-frankfurt.de\u002Fpersonendetails\u002Fkai-droege.html) 工作。\n\n## 捐赠\nnoScribe 始终保持免费。然而，开发它确实需要资金投入。我曾购买硬件用于测试，并每年向 Apple 支付开发者证书费用。如果您愿意支持该项目，欢迎通过 Ko-Fi 进行捐赠。感谢！\n[![ko-fi](https:\u002F\u002Fko-fi.com\u002Fimg\u002Fgithubbutton_sm.svg)](https:\u002F\u002Fko-fi.com\u002Fnoscribe)\n\n## 下载与安装\n\n**当前版本号：0.7**（参见 [变更日志](CHANGELOG.md)）\n> 所有发布版本均托管于 SWITCHdrive，这是一个面向瑞士高校的安全数据共享平台。\n\n### Windows\n\u003Cdetails>\n    \u003Csummary>点击展开\u003C\u002Fsummary>\n\n- **下载：**\n    - 适用于普通 PC（无 NVIDIA 显卡）的 **通用版本**：[https:\u002F\u002Fdrive.switch.ch\u002Findex.php\u002Fs\u002FEIVup04qkSHb54j?path=%2FnoScribe%20vers.%200.7%2FWindows%2Fnormal](https:\u002F\u002Fdrive.switch.ch\u002Findex.php\u002Fs\u002FEIVup04qkSHb54j?path=%2FnoScribe%20vers.%200.7%2FWindows%2Fnormal) \n    - 使用 **NVIDIA 显卡 CUDA 加速** 的特殊版本（显存至少 6 GB）：[https:\u002F\u002Fdrive.switch.ch\u002Findex.php\u002Fs\u002FEIVup04qkSHb54j?path=%2FnoScribe%20vers.%200.7%2FWindows%2Fcuda](https:\u002F\u002Fdrive.switch.ch\u002Findex.php\u002Fs\u002FEIVup04qkSHb54j?path=%2FnoScribe%20vers.%200.7%2FWindows%2Fcuda)。请确保您的 NVIDIA 驱动程序版本为 570.65 或更高。此外，您还需从 [此处](https:\u002F\u002Fdeveloper.nvidia.com\u002Fcuda-downloads?target_os=Windows) 安装 CUDA 工具包（安装后需重启）。\n- **安装：**\n    - 启动下载的安装程序。此过程可能较慢，请耐心等待。\n    - 如果出现“Windows 保护了您的电脑”的警告，并提示该应用来自“未知发布者”，请信任我们并点击“仍要运行”。\n    - 若要在多台计算机上进行静默安装，可在启动安装程序时添加参数 `\u002FS`。\n- **已知问题：**\n    - 目前看来，RTX\u002FGTX 1XXX 系列显卡已不再受 CUDA 支持。建议改用普通版本。\n    - 如果遇到以下错误信息：“转录工作进程意外退出（代码 3221226505）”，请尝试强制使用 CPU 进行转录，而非显卡。虽然速度较慢，但更为可靠。操作步骤如下：关闭 noScribe；用文本编辑器打开文件 `C:\\Users\\\u003CUSERNAME>\\AppData\\Local\\noScribe\\noScribe\\config.yml`；将 `force_whisper_cpu` 的值改为 `'True'`，注意引号不可省略。保存后重新启动 noScribe。\n\n\u003C\u002Fdetails>\n\n### macOS\n\u003Cdetails>\n    \u003Csummary>点击展开\u003C\u002Fsummary>\n\n由 [gernophil](https:\u002F\u002Fgithub.com\u002Fgernophil) 移植 \u003C\u002Fbr>\n\n- **配备 Apple Silicon M1-M4 处理器且运行 macOS 14 或更高版本的新款 Mac**\n    - 下载地址：[https:\u002F\u002Fdrive.switch.ch\u002Findex.php\u002Fs\u002FEIVup04qkSHb54j?path=%2FnoScribe%20vers.%200.7%2FmacOS%2FApple%20Silicon](https:\u002F\u002Fdrive.switch.ch\u002Findex.php\u002Fs\u002FEIVup04qkSHb54j?path=%2FnoScribe%20vers.%200.7%2FmacOS%2FApple%20Silicon)\n    - 双击下载的 .dmg 文件，然后将 noScribe 和 noScribeEdit 拖拽到应用程序文件夹的快捷方式处（标有“将两者拖至此处以安装”）。\n    - 由于其中一个组件（ffmpeg）仍为 Intel CPU 构建，您需要使用 Apple 的 Rosetta2 Intel 模拟器。如果尚未安装，请按以下步骤操作：\n        - 打开终端（位于 `\u002FApplications\u002FUtilities\u002FTerminal.app`）。\n        - 输入 `softwareupdate --install-rosetta` 或 `softwareupdate --install-rosetta --agree-to-license`。\n        - 按下回车键，并按照屏幕上的指示完成安装。\n    - 双击应用程序文件夹中的 noScribe 和\u002F或 noScribeEdit 图标即可启动。\n\n- **配备 Intel 处理器的老款 Mac**\n    - **注意：目前 0.7 版本尚不适用于基于 Intel 的 Mac**，原因是与最新版 pyannote 不兼容。不过，您可以使用 0.6 版本：[https:\u002F\u002Fdrive.switch.ch\u002Findex.php\u002Fapps\u002Ffiles\u002F?dir=\u002FnoScribe\u002FnoScribe%20releases\u002FnoScribe%20vers.%200.6\u002FmacOS\u002Fx86_64%20(Intel)&fileid=8266174681](https:\u002F\u002Fdrive.switch.ch\u002Findex.php\u002Fapps\u002Ffiles\u002F?dir=\u002FnoScribe\u002FnoScribe%20releases\u002FnoScribe%20vers.%200.6\u002FmacOS\u002Fx86_64%20(Intel)&fileid=8266174681)。\n    - 更早的版本也可在此处获取：[https:\u002F\u002Fdrive.switch.ch\u002Findex.php\u002Fs\u002FEIVup04qkSHb54j](https:\u002F\u002Fdrive.switch.ch\u002Findex.php\u002Fs\u002FEIVup04qkSHb54j)。\n    - 遗憾的是，我们无法正确签名 x86_64 软件包，因此系统会提示 noScribe 和 noScribeEdit 来自未注册开发者。如果您的 Gatekeeper 功能已启用，您必须手动允许运行这些程序。具体步骤如下：\n    - 双击下载的 .dmg 文件。\n    - 将 noScribe 和 noScribeEdit 拖拽到应用程序文件夹的快捷方式处（标有“将两者拖至此处以安装”）。\n    - 双击应用程序文件夹中的 noScribe 图标启动程序，此时会出现错误提示，表明该程序来自未注册开发者。对 noScribe Editor 重复此操作。\n    - 前往“系统设置”→“隐私与安全性”，向下滚动至显示 noScribe 被阻止启动的消息，然后点击“仍要打开”。同样地，对 noScribe Editor 也执行此操作。\n    - 从今以后，这两个程序应能正常启动。\n\n\u003C\u002Fdetails>\n\n### Linux\n\u003Cdetails>\n    \u003Csummary>点击展开\u003C\u002Fsummary>\n\n由 [Eckhard Kadasch](https:\u002F\u002Fgithub.com\u002Feckhrd) 和 [Florian Dobener](https:\u002F\u002Fgithub.com\u002Fdomna) 移植；可执行文件由 [gernophil](https:\u002F\u002Fgithub.com\u002Fgernophil) 生成。\n\n#### 可执行文件安装\n\n  - 在此处下载 noScribe 0.7 的 CUDA 或 CPU 版本 for Linux：[https:\u002F\u002Fdrive.switch.ch\u002Findex.php\u002Fs\u002FEIVup04qkSHb54j?path=%2FnoScribe%20vers.%200.7%2FLinux](https:\u002F\u002Fdrive.switch.ch\u002Findex.php\u002Fs\u002FEIVup04qkSHb54j?path=%2FnoScribe%20vers.%200.7%2FLinux)\n    - 有报告称，某些系统上 noScribe 0.7 for Linux 存在问题。如果您使用的系统无法运行 noScribe 0.7 for Linux，请暂时使用 noScribe 0.6.2，直到我们解决这些问题：[https:\u002F\u002Fdrive.switch.ch\u002Findex.php\u002Fs\u002FEIVup04qkSHb54j?path=%2FnoScribe%20vers.%200.6%2FLinux](https:\u002F\u002Fdrive.switch.ch\u002Findex.php\u002Fs\u002FEIVup04qkSHb54j?path=%2FnoScribe%20vers.%200.6%2FLinux)\n  - 使用终端命令 `tar -xzvf noScribe_0.7.0_cpu_linux_amd64.tar.gz` 或 `tar -xzvf noScribe_0.7.0_cuda_linux_amd64.tar.gz` 解压文件。\n  - 进入 noScribe 文件夹并执行 `.\u002FnoScribe` 启动程序。\n  - 可选：使用文本编辑器编辑 `noScribe.desktop` 和 `noScribeEdit.desktop` 文件，在以 `Exec=` 和 `Icon=` 开头的行中填写完整路径。\n\n#### 从源代码手动安装\n\n更多信息请参阅 [此讨论](https:\u002F\u002Fgithub.com\u002Fkaixxx\u002FnoScribe\u002Fdiscussions\u002F83)。\n\n如需从源代码安装，您需要 `git` 和 `git-lfs` 以获取所有必要组件。最新的源代码可直接从仓库中拉取。如果您希望安装特定版本，请使用上述可执行文件安装方法。\n\n```bash\ngit clone https:\u002F\u002Fgithub.com\u002Fkaixxx\u002FnoScribe.git\n\n# 之后的所有操作都在 noScribe 目录中进行。\ncd noScribe\n\n# 安装 noScribeEdit\nrm -rf noScribeEdit\u002F\ngit clone https:\u002F\u002Fgithub.com\u002Fkaixxx\u002FnoScribeEditor.git noScribeEdit\n\n# 创建 Python 虚拟环境。\npython3 -m venv venv\n\n# 执行此命令后，将使用虚拟环境中安装的 Python。此外，所有软件包也将安装于此。每次启动 noScribe 时，请务必先运行此命令。\nsource venv\u002Fbin\u002Factivate\n\n# 安装必要的依赖项。\npip install -r environments\u002Frequirements_linux.txt\npip install -r noScribeEdit\u002Fenvironments\u002Frequirements.txt\n\n# 下载模型文件。这里同时下载了精确模型和快速模型。如果您只使用其中一种，则只需下载相应的模型即可。\nrm -rf models\u002Ffast\nrm -rf models\u002Fprecise\ngit clone https:\u002F\u002Fhuggingface.co\u002Fmukowaty\u002Ffaster-whisper-int8 models\u002Ffast\ngit clone https:\u002F\u002Fhuggingface.co\u002Fmobiuslabsgmbh\u002Ffaster-whisper-large-v3-turbo models\u002Fprecise\n\n# 运行 noScribe。\npython3 .\u002FnoScribe.py\n```\n\n\u003C\u002Fdetails>\n\n### 旧版本：\n- [https:\u002F\u002Fdrive.switch.ch\u002Findex.php\u002Fs\u002FEIVup04qkSHb54j](https:\u002F\u002Fdrive.switch.ch\u002Findex.php\u002Fs\u002FEIVup04qkSHb54j)\n\n## 引用（APA 格式）\nDröge, K. (2025). noScribe. 基于人工智能的音频转录（版本 XXX）[计算机软件]。https:\u002F\u002Fgithub.com\u002Fkaixxx\u002FnoScribe\n\n## 使用\n\n### 设置\n\u003Cimg align=\"left\" src=\"https:\u002F\u002Foss.gittoolsai.com\u002Fimages\u002Fkaixxx_noScribe_readme_f3b724080966.png\" width=\"300\">\n\n- 选择您的 **音频文件**。NoScribe 支持几乎任何音频或视频格式。您也可以一次选择多个文件进行 [批量转录](#batch-transcription)。\n- 选择转录文件的 **文件名**。您还可以选择文件类型：*.html 是默认格式，也受 noScribe 编辑器支持。*.vtt 是一种视频字幕格式，如果您想将转录导入 [EXMARaLDA](https:\u002F\u002Fexmaralda.org\u002F) 进行进一步标注，则尤为有用。*.txt 会将转录导出为纯文本。\n- **开始** 和 **停止** 接受 hh:mm:ss 格式的时间戳。使用此功能可将转录限制在录音的特定部分。这在正式转录整个访谈之前，先用一个小样本测试设置时特别有帮助，因为完整访谈的转录可能需要数小时。如果您想转录到音频文件的结尾，请将 **停止** 留空。\n- **语言**：选择转录的语言。将其设置为“自动”以检测语言，或选择“多语言”，如果您的音频包含多种语言（实验性功能）。\n- **质量**：“精确”是推荐设置，可获得最准确的转录。在较慢的设备上，您可以选择“快速”选项。这样速度更快，但后续可能需要更多手动校对。您还可以 [安装自定义模型](https:\u002F\u002Fgithub.com\u002Fkaixxx\u002FnoScribe\u002Fwiki\u002FAdd-custom-Whisper-models-for-transcription)，这些模型针对特定语言等进行了微调。\n- **标记停顿**：启用后，音频中无人声的部分将被标记为停顿。停顿会以圆括号表示，每秒一个点，例如，两秒的停顿写为“(..)”。超过10秒的停顿则会写成“(XX 秒停顿)”或“(XX 分钟停顿)”。您可以选择标记持续1秒及以上的停顿（“1sec+”）、2秒及以上的停顿（“2sec+”），或仅标记3秒及以上的较长停顿（“3sec+”）。选择“无”可完全禁用此功能。\n- **说话人检测**：此功能使用 Pyannote AI 模型来识别音频中的不同说话人，并相应地组织转录内容。如果已知说话人数，请选择相应的数字；否则可选择“自动”。选择“无”则会跳过此步骤，从而将处理时间缩短约一半。不过，生成的转录将是一整段连续文本，不会显示说话人切换的标志。\n- **重叠语音**：启用后，noScribe 会尝试标记两人同时说话的情况。重叠部分会用双斜杠 \u002F\u002F 来分隔。（注：此功能尚处于实验阶段。）\n- **语流不畅**：启用后，常见的语流不畅现象，如填充词（“嗯”）、未完成的词语或句子等，也会被转录出来。请注意，这不是一个简单的开关，而更像是对转录 AI 模型的一种“建议”，其效果有限。\n- **时间戳**：启用后，noScribe 会在转录中加入 [hh:mm:ss] 格式的时间戳，可以是每次说话人切换时，也可以每60秒插入一次。我个人觉得这些时间戳有些分散注意力，因此默认将其关闭。不过，在某些情况下它们确实很有用。即使时间戳被关闭，要确定特定片段的音频时间码也很简单：只需在 noScribe 编辑器中打开转录文件，浏览文本，相应的时间码就会显示在应用程序右下角。\n\n### 转录流程\n- 如果准备就绪，请点击左下角的 **开始** 按钮。**取消** 将终止该进程。\n- 请注意，**一小时的访谈可能需要长达三小时的处理时间**，并且会对您的设备造成较大负担。不建议在电池供电状态下进行此操作。\n- 应用程序底部的 **进度指示器** 会显示当前处理进度。\n- **主窗口**（日志标签）会显示进度信息和错误提示。在转录的最后一步，它还会打印您的访谈文本。\n- 转录文件会每隔几秒钟自动保存一次，保存为指定的文件名。\n- 默认情况下，noScribe 会生成 HTML 文件。该文件可以在任何常用的文字编辑器（包括 MS Word、LibreOffice）或定性数据分析软件包（MAXQDA、ATLAS.ti、QualCoder 等）中打开。\n- 不过，在使用转录文件之前，您应该先用内置编辑器检查一下。通常总会存在一些错误。\n\n### 批量转录\n（版本 0.7 中新增）\n\n\u003Cimg src=\"https:\u002F\u002Foss.gittoolsai.com\u002Fimages\u002Fkaixxx_noScribe_readme_5dc269d65719.png\" width=\"800\">\n\n- 主窗口中的“队列”选项卡会显示所有任务的列表及其状态和进度。\n- 如果您在另一个任务仍在运行时启动新任务，新任务将排队等待后续处理。\n- 若要使用相同设置同时启动多个任务，请在音频文件对话框中选择任意数量的文件。输出文件将自动命名。如有需要，可使用“另存转录为”对话框选择不同的输出文件夹。否则，转录文件将与音频文件存储在同一文件夹中。\n- 任务按钮：\n    - `X` 从列表中删除任务，或取消正在运行的任务。\n    - `✔` 在内置编辑器中打开转录文件。即使任务因错误或用户取消而未完成，也可通过此按钮查看转录内容。\n    - `⟲` 重新启动任务（仅在出现错误或任务被取消时可用）。\n\n## noScribe编辑器\n随附的编辑器用于检查最终的转录文本。\n\n![noScribe编辑器中的转录文本](https:\u002F\u002Foss.gittoolsai.com\u002Fimages\u002Fkaixxx_noScribe_readme_05eb1f6f12bc.png)\n\nnoScribe编辑器是一个独立的应用程序。转录完成后它会自动打开，但也可以不依赖noScribe单独运行。它包含一些实用功能，可以帮助您检查已完成的转录文本中的错误并进行修正：\n- 按下 **Ctrl + 空格键**（Mac上为^Space）或点击 **工具栏中的橙色按钮**，即可播放与当前文本位置对应的音频。\n- 当前选中的文本会跟随您听到的音频同步移动。如果您想 **进行修改**，可以用鼠标点击文本中的任意位置，或者使用方向键移动光标。此时音频会停止，您可以开始编辑文本。\n- 您也可以再次按下 Ctrl + 空格键或点击橙色按钮来 **停止音频**。\n- 如果您想 **加快或减慢音频速度**，可以将“播放\u002F暂停音频”按钮旁边的“100%”选项调整为合适的速率。\n- 若要更改 **发言者姓名**，请使用搜索和替换功能，可通过放大镜图标或“编辑”菜单访问。\n- 使用工具栏中的加号和减号图标可以 **放大或缩小视图**。\n- 工具栏以及顶部菜单中还提供了 **基本文本编辑器的常用功能**（如基本文本格式化、剪切、复制与粘贴、撤销与重做）。\n- 您常用的 **快捷键** 也适用（例如，Ctrl+S 保存，Ctrl+F 查找与替换）。所有快捷键都可以在菜单中查看。正如之前提到的，‘Ctrl+Space’ 是您最常使用的快捷键，因为它可以启动或暂停音频。\n\n该编辑器的源代码可在以下链接找到：[https:\u002F\u002Fgithub.com\u002Fkaixxx\u002FnoScribeEditor](https:\u002F\u002Fgithub.com\u002Fkaixxx\u002FnoScribeEditor)\n\n## 影响转录质量的因素\n- 一份 **清晰的人声且无环境噪声的良好录音** 对高质量的转录至关重要。在录音质量上多花些功夫，将为您后续的手动校对节省大量时间。\n- Whisper（驱动noScribe的AI）能够理解约60种不同的语言，但不同语言之间的转录质量差异较大。其中，**西班牙语、意大利语、英语、葡萄牙语和德语** 的支持效果最佳（更多信息请参见 [此处]( https:\u002F\u002Fgithub.com\u002Fopenai\u002Fwhisper#available-models-and-languages)）。\n- Whisper 对 **方言** 的处理能力较好（例如瑞士德语），但在后期校对时可能仍需较多人工干预。\n\n## 已知问题\n- 本软件的输出结果始终需要检查其质量、是否存在误解以及发言者区分是否正确。该软件基于 [OpenAI的Whisper模型](https:\u002F\u002Fgithub.com\u002Fopenai\u002Fwhisper)。典型的词错误率可参考 [这里](https:\u002F\u002Fgithub.com\u002Fopenai\u002Fwhisper?tab=readme-ov-file#available-models-and-languages)。另请参阅 [这篇论文](https:\u002F\u002Fdoi.org\u002F10.1145\u002F3576915.3624380)，其中对比了不同转录服务及其错误情况。\n- 与其他大型语言模型一样，Whisper 模型有时也会出现 **幻觉** 现象。这种情况在静音片段或背景噪音被误认为“文本”时尤为常见（更多信息请参见 [康奈尔大学的一项研究](https:\u002F\u002Ffacctconference.org\u002Fstatic\u002Fpapers24\u002Ffacct24-111.pdf)）。我们使用语音活动检测（VAD）尽可能过滤掉无语音的部分。\n- 更严重的是，用户还报告了一些案例，即模型生成了在语法上符合上下文但实际上并未出现在原始音频中的词语。这类错误尤其难以发现。\n- **人名、地名或组织名称** 往往会被错误转录。\n- Whisper AI 有时可能会陷入 **重复文本的循环**，尤其是在较长的音频文件中。如果发生这种情况，建议将音频分成较短的段落进行转录（使用 noScribe 中的“开始”和“结束”字段），然后手动拼接。\n- **多语言音频** 目前已支持，但仍处于实验阶段。有时可能会出现将主语言之外的其他语言词汇翻译出来的情况。\n- **非语言表达**（如笑声）不会包含在转录文本中，如果您需要这些内容，则必须在编辑器中手动添加。\n- **发言者区分**：在某些录音中，noScribe 所使用的 AI 即使在人类听觉上能明显区分不同说话者的声音，也可能无法准确识别。请仔细检查结果。\n- 长时间后，尤其是较长的访谈中，可能会出现 **标点符号和大小写丢失** 的情况。如果遇到此类问题，您可以：\n  - 将音频分成较短的段落进行转录（使用 noScribe 中的“开始”和“结束”字段），然后手动拼接。\n  - 尝试使用其他模型，特别是“faster-whisper-large-v2”，该模型较少出现此类问题。您需要先按照 [维基中的说明](https:\u002F\u002Fgithub.com\u002Fkaixxx\u002FnoScribe\u002Fwiki\u002FAdd-custom-Whisper-models-for-transcription) 安装此模型。\n\n## 高级选项\n\n- noScribe 现在也提供命令行界面，非常适合脚本编写。输入 `noScribe.exe --help` 可获取更多信息。在脚本场景中，您还可以使用 `--no-gui` 选项。\n- 配置文件：应用程序首次运行后，您会在用户配置目录中找到名为 `config.yml` 的文件（Windows 上为 `C:\\Users\\\u003C用户名>\\AppData\\Local\\noScribe\\noScribe\\config.yml`；Mac OS 上为 `~\u002FLibrary\u002FApplication Support\u002Fnoscribe\u002Fconfig.yml`；Linux 上为 `~\u002F.config\u002FnoScribe\u002Fconfig.yml`）。在这里，您可以更改一些 **额外设置**，例如用户界面的语言和模型参数。\n- 在用户配置目录中，您还会找到一个名为 `log` 的文件夹，其中包含每份转录（包括未完成的转录）的详细日志文件。这在出现任何错误时非常有帮助。不过请注意，这些文件中也包含您的转录文本，可能涉及敏感信息。\n- 如果您希望在 noScribe 中使用 **自定义的 Whisper 模型**，请按照 [维基中的说明](https:\u002F\u002Fgithub.com\u002Fkaixxx\u002FnoScribe\u002Fwiki\u002FAdd-custom-Whisper-models-for-transcription)操作。\n\n## 开发与贡献\n- 我使用 Python 3.12 开发了 noScribe。\n- 由于 Whisper 模型体积过大，我无法将其托管在 GitHub 上。模型文件夹中有一份 README 文件，其中提供了获取这些模型的说明。\n- 如果时间允许，我很乐意审阅测试、错误报告和拉取请求。\n\n### 翻译\n- noScribe 的用户界面已经被翻译成多种语言（感谢 mlynar-czyk）。\n- 由于大部分翻译都是使用 ChatGPT 生成的，可能会存在一些问题。如果您发现任何错误，请及时报告，并尽可能提交一个包含更佳翻译的拉取请求。\n- 语言文件位于“trans”文件夹中。\n- 如果您对语言文件进行修改，请务必遵循 YAML 语言的规范。\n- 如果您想更改用户界面的语言，需要在高级设置中修改“locale”配置项的值（见上文）。\n\n## 其他软件\n如果您对用于定性数据分析的开源软件感兴趣，可以查看我的另一个项目 [QualCoder](https:\u002F\u002Fgithub.com\u002Fccbogel\u002FQualCoder)。","# noScribe 快速上手指南\n\nnoScribe 是一款免费开源的本地化 AI 音频转录工具，专为定性社会研究和新闻采访设计。它基于 Whisper、faster-whisper 和 pyannote-audio 构建，支持约 60 种语言，能够区分不同说话人，且所有数据均在本地处理，无需上传云端，确保隐私安全。\n\n## 环境准备\n\n### 系统要求\n- **操作系统**：Windows、macOS (Intel\u002FApple Silicon) 或 Linux\n- **硬件建议**：\n  - **通用版**：适用于无独立显卡或显存小于 6GB 的机器（使用 CPU 运算，速度较慢）。\n  - **加速版**：适用于配备 NVIDIA 显卡（显存 ≥ 6GB）的机器。需安装 CUDA Toolkit 并确保驱动版本 ≥ 570.65。\n  - **macOS**：M1-M4 芯片需安装 Rosetta 2（用于兼容 ffmpeg 组件）；Intel 芯片用户建议使用 v0.6 版本。\n- **存储空间**：由于包含大型 AI 模型，安装包体积较大（数 GB），请确保磁盘空间充足。\n\n### 前置依赖\n- **Windows\u002FLinux (NVIDIA 加速版)**：需预先安装 [CUDA Toolkit](https:\u002F\u002Fdeveloper.nvidia.com\u002Fcuda-downloads)。\n- **macOS (Apple Silicon)**：需在终端运行 `softwareupdate --install-rosetta` 安装 Rosetta 2。\n- **源码安装 (Linux)**：需安装 `git`, `git-lfs`, `python3`。\n\n> **注意**：目前官方未提供中国镜像源，下载模型和安装包需访问国际网络。若从源码安装，克隆 HuggingFace 模型时如遇网络问题，建议配置本地代理或使用国内 HuggingFace 镜像站。\n\n## 安装步骤\n\n### 方式一：直接使用预编译包（推荐）\n\n#### Windows\n1. 根据硬件选择下载：\n   - **普通 PC**：[下载链接 (Normal)](https:\u002F\u002Fdrive.switch.ch\u002Findex.php\u002Fs\u002FEIVup04qkSHb54j?path=%2FnoScribe%20vers.%200.7%2FWindows%2Fnormal)\n   - **NVIDIA 显卡 (≥6GB VRAM)**：[下载链接 (CUDA)](https:\u002F\u002Fdrive.switch.ch\u002Findex.php\u002Fs\u002FEIVup04qkSHb54j?path=%2FnoScribe%20vers.%200.7%2FWindows%2Fcuda)\n2. 运行下载的 setup 文件。\n   - 若出现\"Windows 保护了你的电脑”提示，点击“更多信息” -> “仍要运行”。\n   - 静默安装参数：`\u002FS`。\n\n#### macOS\n1. **Apple Silicon (M1-M4)**:\n   - [下载链接](https:\u002F\u002Fdrive.switch.ch\u002Findex.php\u002Fs\u002FEIVup04qkSHb54j?path=%2FnoScribe%20vers.%200.7%2FmacOS%2FApple%20Silicon)\n   - 双击 `.dmg` 文件，将 `noScribe` 和 `noScribeEdit` 拖入 Applications 文件夹。\n2. **Intel 芯片**:\n   - 当前 v0.7 不支持 Intel Mac，请下载 [v0.6 版本](https:\u002F\u002Fdrive.switch.ch\u002Findex.php\u002Fapps\u002Ffiles\u002F?dir=\u002FnoScribe\u002FnoScribe%20releases\u002FnoScribe%20vers.%200.6\u002FmacOS\u002Fx86_64%20(Intel)&fileid=8266174681)。\n   - 首次运行若被拦截，请前往 `系统设置` -> `隐私与安全性`，点击“仍要打开”。\n\n#### Linux\n1. 下载对应版本（CPU 或 CUDA）：[Linux 下载页](https:\u002F\u002Fdrive.switch.ch\u002Findex.php\u002Fs\u002FEIVup04qkSHb54j?path=%2FnoScribe%20vers.%200.7%2FLinux)\n2. 解压并运行：\n   ```bash\n   # 解压文件 (根据实际文件名调整)\n   tar -xzvf noScribe_0.7.0_cpu_linux_amd64.tar.gz\n   \n   # 进入目录并启动\n   cd noScribe\n   .\u002FnoScribe\n   ```\n\n### 方式二：从源码安装 (仅限 Linux\u002F高级用户)\n\n```bash\n# 1. 克隆主项目\ngit clone https:\u002F\u002Fgithub.com\u002Fkaixxx\u002FnoScribe.git\ncd noScribe\n\n# 2. 克隆编辑器组件\nrm -rf noScribeEdit\u002F\ngit clone https:\u002F\u002Fgithub.com\u002Fkaixxx\u002FnoScribeEditor.git noScribeEdit\n\n# 3. 创建 Python 虚拟环境\npython3 -m venv venv\nsource venv\u002Fbin\u002Factivate\n\n# 4. 安装依赖\npip install -r environments\u002Frequirements_linux.txt\npip install -r noScribeEdit\u002Fenvironments\u002Frequirements.txt\n\n# 5. 下载 AI 模型 (需 git-lfs)\nrm -rf models\u002Ffast models\u002Fprecise\ngit clone https:\u002F\u002Fhuggingface.co\u002Fmukowaty\u002Ffaster-whisper-int8 models\u002Ffast\ngit clone https:\u002F\u002Fhuggingface.co\u002Fmobiuslabsgmbh\u002Ffaster-whisper-large-v3-turbo models\u002Fprecise\n\n# 6. 启动程序\npython3 .\u002FnoScribe.py\n```\n\n## 基本使用\n\n1. **启动程序**：运行安装好的 `noScribe` 应用程序。\n2. **加载音频**：\n   - 在主界面选择音频或视频文件（支持批量选择）。\n   - 支持几乎所有主流音视频格式。\n3. **配置参数**：\n   - **转录文件命名**：设置输出文件名及格式。\n   - **语言选择**：指定音频语言（支持约 60 种），或设为自动检测。\n   - **说话人分离**：启用此功能可自动区分不同发言者（基于 pyannote）。\n4. **开始转录**：\n   - 点击开始按钮。根据硬件性能，1 小时音频可能需要 1-3 小时处理时间。\n   - 进度条会显示当前状态。\n5. **编辑与校对**：\n   - 转录完成后，程序会自动调用 **noScribe Editor**。\n   - 在编辑器中听取音频片段，核对文本，修正识别错误，并调整说话人标签。\n   - 导出最终文本（支持 TXT, DOCX 等格式）。\n\n> **提示**：自动转录无法达到 100% 准确，尤其是背景噪音较大时。务必使用内置编辑器进行人工复核以保证质量。","社会学研究员李博士正在处理一项关于移民社区的质性研究，需要将 20 小时的多语言深度访谈录音转化为可分析的文字稿。\n\n### 没有 noScribe 时\n- **隐私泄露风险高**：不得不将敏感的访谈录音上传至云端转录服务，担心受访者隐私数据被第三方留存或滥用。\n- **人工听写效率极低**：依靠人工逐字听写，每小时录音需耗费 4-5 小时，且难以区分重叠发言的不同受访者。\n- **多语言处理困难**：面对混合了德语、英语和阿拉伯语的对话，通用工具识别率骤降，需手动查阅词典修正大量错误。\n- **后期整理繁琐**：获得的纯文本缺乏说话人标签，重新梳理“谁说了什么”需要反复回听录音，极易出错。\n\n### 使用 noScribe 后\n- **数据完全本地化**：noScribe 在李博士的笔记本电脑上离线运行，所有音频与转录稿均保留在本地，彻底消除隐私顾虑。\n- **自动区分说话人**：内置的 pyannote 模型自动识别并标记不同受访者（如“说话人 A\"、“说话人 B\"），大幅减少人工分拣时间。\n- **高精度多语言支持**：基于 Whisper 模型，noScribe 准确捕捉了三种语言的混合对话，即使带有口音也能保持高识别率。\n- **一体化编辑体验**：直接在 noScribe 自带的编辑器中对照波形图校对，快速修正个别误差，即刻生成可用于编码分析的标准文稿。\n\nnoScribe 通过本地化的高精度自动转录与说话人分离技术，将研究员从繁重的听写工作中解放出来，同时严守了质性研究最核心的数据伦理底线。","https:\u002F\u002Foss.gittoolsai.com\u002Fimages\u002Fkaixxx_noScribe_1022ee3f.png","kaixxx",null,"https:\u002F\u002Foss.gittoolsai.com\u002Favatars\u002Fkaixxx_bc732b71.png","https:\u002F\u002Fgithub.com\u002Fkaixxx",[81,85,89,93,97],{"name":82,"color":83,"percentage":84},"Python","#3572A5",89.1,{"name":86,"color":87,"percentage":88},"HTML","#e34c26",10.5,{"name":90,"color":91,"percentage":92},"Batchfile","#C1F12E",0.2,{"name":94,"color":95,"percentage":96},"Shell","#89e051",0.1,{"name":98,"color":99,"percentage":96},"Dockerfile","#384d54",1861,319,"2026-04-17T17:09:01","GPL-3.0","Windows, macOS, Linux","非必需。若使用 NVIDIA GPU 加速（CUDA 版本），需显存至少 6GB，驱动版本 570.65 或更高，并安装 CUDA Toolkit。RTX\u002FGTX 1xxx 系列显卡可能不支持 CUDA 加速，建议使用 CPU 版本。Mac Apple Silicon (M1-M4) 支持原生加速。","未说明",{"notes":108,"python":109,"dependencies":110},"软件完全本地运行以保护隐私。安装包体积较大（数 GB），因包含 AI 模型。转录速度较慢，1 小时音频可能需要长达 3 小时处理时间。macOS Apple Silicon 用户需安装 Rosetta2 以兼容 ffmpeg 组件。旧版 Intel Mac 暂不支持 v0.7 版本。若遇到特定错误代码，可强制切换至 CPU 模式运行。","未说明（源码安装需 python3）",[111,112,113,114,115],"faster-whisper","pyannote-audio","ffmpeg","git","git-lfs",[18,21],[118,119,120,121,122,111],"audio-transcription","interview","pyannote","qualitative-research","transcription","2026-03-27T02:49:30.150509","2026-04-19T03:03:46.080793",[126,131,136,141,146,151,156],{"id":127,"question_zh":128,"answer_zh":129,"source_url":130},41536,"如何在 macOS 上运行 noScribe？","目前已有由 gernophil 创建的 macOS Alpha 版本可用。您可以从以下地址下载：https:\u002F\u002Fgithub.com\u002Fkaixxx\u002FnoScribe\u002Ftree\u002FmacOS。注意：该版本的 Gatekeeper 限制可能不如正式版严格，如果遇到启动问题，可能需要调整安全设置。","https:\u002F\u002Fgithub.com\u002Fkaixxx\u002FnoScribe\u002Fissues\u002F14",{"id":132,"question_zh":133,"answer_zh":134,"source_url":135},41537,"遇到'Torch not compiled with CUDA enabled'错误怎么办？","这通常意味着配置未正确启用 GPU。请尝试以下步骤：\n1. 下载并安装最新的安装包（支持 CUDA 的版本）。\n2. 删除或重命名现有的 config.yml 文件。\n3. 启动 noScribe，执行一次简短的转录后关闭程序。\n4. 检查新生成的 config.yml，确保 'pyannote_xpu' 和 'whisper_xpu' 选项均设置为 'cuda'（如果之前默认为 'cpu'）。\n如果问题依旧，请确认已安装对应版本的 CUDA Toolkit（如 12.9）。","https:\u002F\u002Fgithub.com\u002Fkaixxx\u002FnoScribe\u002Fissues\u002F79",{"id":137,"question_zh":138,"answer_zh":139,"source_url":140},41538,"noScribe 默认使用 CPU 而不是 NVIDIA 显卡，如何强制使用 GPU？","需要在配置文件中进行设置。首先确保安装了兼容的 speechbrain 版本（建议使用 0.5.16，因为 1.0.0 版本可能存在兼容性问题）。然后在配置中将处理单元（xpu）设置为 cuda。如果使用的是多显卡系统（如同时有 ATI 和 NVIDIA），请确保驱动程序正确安装且 CUDA 版本匹配（如 12.2 或 12.3）。","https:\u002F\u002Fgithub.com\u002Fkaixxx\u002FnoScribe\u002Fissues\u002F34",{"id":142,"question_zh":143,"answer_zh":144,"source_url":145},41539,"在 Ubuntu 22.04 上运行时提示'GLIBC_2.36' not found 或出现核心转储（core dump）如何解决？","noScribe 依赖 glibc 2.36，而 Ubuntu 22.04 默认版本为 2.35。虽然可以尝试手动安装 glibc 2.36，但这可能导致不稳定。建议优先尝试官方提供的预编译版本或更新后的安装包，社区反馈表明相关问题已在后续讨论和版本更新中得到解决。如果必须手动操作，可使用 LD_PRELOAD 指定路径，但需注意浮点异常风险。","https:\u002F\u002Fgithub.com\u002Fkaixxx\u002FnoScribe\u002Fissues\u002F185",{"id":147,"question_zh":148,"answer_zh":149,"source_url":150},41540,"转录时提示'Invalid file type'或文件名中缺少扩展名怎么办？","这是一个已知问题，特别是在文件名包含空格或特殊字符时。解决方法包括：\n1. 确保输入的文件名包含正确的扩展名。\n2. 如果保存文件时扩展名未自动显示，可以尝试多次输入文件名直到扩展名出现。\n3. 注意：在 macOS 上无法自定义扩展名，系统会根据选择自动添加 .html、.txt 或 .vtt。如果选择 .doc，可能会生成 .doc.html 文件。该问题已在代码提交 8887293 中修复。","https:\u002F\u002Fgithub.com\u002Fkaixxx\u002FnoScribe\u002Fissues\u002F254",{"id":152,"question_zh":153,"answer_zh":154,"source_url":155},41541,"GeForce RTX 系列显卡（如 5060 Ti）上 PyAnnote 报错且 Whisper 无响应，如何处理？","这通常是因为缺少针对特定 CUDA 版本优化的安装包。维护者已发布专门支持 CUDA 的新版本（例如 cuda1 分支）。请下载最新的支持 CUDA 的安装包（链接通常在 Issue 评论中提供），并安装匹配的 CUDA Toolkit（如 12.9）。测试显示，启用 CUDA 后转录速度可从 45 分钟提升至 2 分钟左右（针对 39 分钟音频）。","https:\u002F\u002Fgithub.com\u002Fkaixxx\u002FnoScribe\u002Fissues\u002F175",{"id":157,"question_zh":158,"answer_zh":159,"source_url":160},41542,"转录结果中出现字幕持续时间为负数（negative duration）是怎么回事？","这是一个已知的 Bug，在某些版本（如 0.7）中复现。表现为部分字幕的时间戳计算错误导致持续时间为负。维护者正在评估修复方案。如果遇到此问题，建议尝试更新到最新版本或关注官方发布的补丁通知。目前暂无官方确认的稳定变通方法，避免重复转录同一文件可暂时规避完全一致的错误输出。","https:\u002F\u002Fgithub.com\u002Fkaixxx\u002FnoScribe\u002Fissues\u002F253",[162,167,172,177,182,187,192],{"id":163,"version":164,"summary_zh":165,"released_at":166},333509,"v0.7","- 新增批量转录：可同时处理多个文件\n- 改进的说话人识别功能\n- 更好的标点符号处理\n- 命令行界面，便于脚本化操作\n- 多项小修复、改进和更新\n\n安装说明：https:\u002F\u002Fgithub.com\u002Fkaixxx\u002FnoScribe\u002Ftree\u002Fmain#download-and-installation\n\nMac 用户请注意：目前仅支持 M1 至 M5 芯片的 Mac 设备，不支持基于 Intel 芯片的机型。","2025-12-08T14:31:27",{"id":168,"version":169,"summary_zh":170,"released_at":171},333510,"v0.6","- 转录速度提升3倍\n- 新增：可安装自定义Whisper模型\n- 新增选项，用于选择是否包含口吃和填充词\n- 编辑器：新增“查找并替换”功能，可快速更改发言者姓名等信息\n- 修复、优化和更新了多项细节\n- 安装说明：https:\u002F\u002Fgithub.com\u002Fkaixxx\u002FnoScribe\u002Ftree\u002Fmain#download-and-installation\n\n面向搭载Intel芯片的Mac用户的Apple用户：\n我们现也为你们提供了实验性版本0.6。请协助我们进行测试：https:\u002F\u002Fgithub.com\u002Fkaixxx\u002FnoScribe\u002Fdiscussions\u002F143","2025-02-12T08:49:32",{"id":173,"version":174,"summary_zh":175,"released_at":176},333511,"v0.5","新增：\n- WebVTT 输出，支持将 noScribe 转录文件导入 [EXMARaLDA](https:\u002F\u002Fexmaralda.org)\n- 纯文本输出 (*.txt)\n- 改进的说话人检测：可预先定义说话人数量\n- 通过添加 VAD 滤波器，减少幻听和循环问题\n- 新版本发布时自动提示更新\n- 编辑器会自动打开（可在高级选项中关闭）\n\n修复：\n- CUDA 支持现已正式版，不再处于测试阶段\n- 修复了希伯来语语言设置、中文界面本地化、Linux 支持等方面的小问题。","2024-06-13T08:55:34",{"id":178,"version":179,"summary_zh":180,"released_at":181},333512,"v0.4.1","0.4 版本存在一个 bug，在 Windows 系统下与 NVIDIA 显卡搭配使用时会导致问题。0.4.1 版本已修复该问题。如果您并未遇到此 bug，则无需更新。","2023-10-21T14:57:48",{"id":183,"version":184,"summary_zh":185,"released_at":186},333513,"v0.4b","**更新日志**：\n- **说话人检测\u002F分离**功能大幅改进\n- 新增标记转录文本中**静音段**（无语音活动的部分）的选项\n- 新增标记**重叠话语**的选项（实验性功能）\n- 新增在转录文本中包含**时间戳**的选项\n- 推出全新的**noScribe Editor**应用程序，用于检查和校对转录文本（不再依赖 MS Word 宏）\n- noScribe 现在会输出一个**HTML 文件**，可在各大主流文字处理软件（如 MS Word、LibreOffice、OpenOffice 等）或定性数据分析软件中打开\n- 进行了多项底层优化，为即将发布的 macOS 版本做准备，并提升转录的可靠性和质量\n- 已将基础框架从“whisper.cpp”切换至“faster-whisper”（主要原因是其时间戳更加精确）","2023-10-17T14:07:42",{"id":188,"version":189,"summary_zh":190,"released_at":191},333514,"v0.3","这是**首个非测试版发布**。  \n我修复了一些问题，并添加了一些新功能和改进。  \n您可以从这里**下载** v0.3：https:\u002F\u002Fdrive.switch.ch\u002Findex.php\u002Fs\u002FEIVup04qkSHb54j?path=%2FnoScribe%20vers.%200.3\n\n**新增功能：**\n- 用户界面现已支持西班牙语、法语、意大利语、日语、葡萄牙语、俄语和中文翻译。感谢 [mlynar-czyk]( https:\u002F\u002Fgithub.com\u002Fmlynar-czyk) 的贡献！请注意：这些翻译是通过巧妙地利用 ChatGPT 生成的。如果您发现任何错误，请及时报告；如果可能的话，也欢迎提交包含更佳译文的 Pull Request。\n- 主窗口中增加了超链接。现在只需点击日志中的文件名，即可直接打开已完成的转录文本。\n- 改进了说话人识别功能，尤其是在发言者快速切换的情况下（通过将 Whisper 的 `max-len` 参数降低至 30）。\n- 安装程序现在无需管理员权限即可运行。这意味着您可以在没有管理员权限的计算机上安装 noScribe（例如，由学校 IT 部门管理的机器）。感谢 [BabyFnord](https:\u002F\u002Fgithub.com\u002FBabyFnord) 的建议！\n\n**修复内容：**\n- 为解决 GitHub 问题 #2 中描述的问题（转录过程中出现错误代码 3221225794 或 3221225501），我已集成了一个支持较旧硬件（不支持 AVX2 指令集）的 whisper.cpp 版本。noScribe 会自动选择合适的版本使用。不过请注意，使用这类老旧硬件会导致转录速度非常缓慢。\n- 修复了 UTF-8 编码错误，该问题曾导致部分语言（如日语、匈牙利语）的转录失败。感谢两位通过电子邮件反馈此问题的用户！\n- 修复：自动保存功能在转录过程中过于频繁。\n- 修复：Word 中的“同步播放”功能有时无法正确找到转录文本的起始位置。\n- 修复：README 文件中存在一处有趣的笔误（“sensible data” 应为 “sensitive data”）。感谢 [TheOnlyWayUp](https:\u002F\u002Fgithub.com\u002FTheOnlyWayUp)!","2023-06-08T14:36:05",{"id":193,"version":194,"summary_zh":195,"released_at":196},333515,"v0.2-beta","很高兴发布我的第一个版本！\n\n你可以从这里下载 **Windows 安装程序**：\n**[https:\u002F\u002Fdrive.switch.ch\u002Findex.php\u002Fs\u002FEIVup04qkSHb54j](https:\u002F\u002Fdrive.switch.ch\u002Findex.php\u002Fs\u002FEIVup04qkSHb54j) > 进入“0.2b”文件夹**\n\nSwitch Drive 是瑞士各大学使用的安全数据共享平台。\n请将此版本视为测试版。","2023-05-16T21:24:55"]