A curated guide for LLM-agent-driven scientific research automation
🌐 View Interactive Multilingual README →
🇨🇳 中文 · 🇺🇸 English · 🇰🇷 한국어 · 🇯🇵 日本語 · 🇩🇪 Deutsch · 🇫🇷 Français · 🇪🇸 Español · 🇮🇹 Italiano · 🇵🇹 Português · 🇸🇦 العربية · 🇹🇭 ไทย · 🇻🇳 Tiếng Việt · 🇷🇺 Русский
Automate the research loop with LLM agents: literature review → idea generation → experiment execution → paper writing → peer review.
This repo is a research-first landing page for the field: use it to choose the right track, then move into the topic pages for detail. The center is still Vibe Research, but the map now also tracks personal assistants, agent-native software layers, and companion apps around that core.
Start here: Getting Started · Tools & Platforms · Claw Park · Vibe Coding
|
Core Question How far can AI move from research assistant to research operator? Focus: literature, ideation, experiment, writing, and evaluation. |
What Changed In 2026 Research copilots got stronger, learning layers became real, autonomous research systems got more credible, and Vibe Coding became the execution layer. |
How To Use This Repo Treat the README as a map. Treat the topic pages as the actual guide. |
Five trends are now shaping the field:
- Research copilots are getting stronger: Deep Research products, NotebookLM-style source-grounded reading, and scientific workspaces such as Prism are making literature synthesis and report writing much faster.
- Autonomous research systems are maturing: AI Scientist-v2, Agent Laboratory, and EvoScientist push the field from "paper summary bots" toward iterative ideation, execution, and evaluation.
- Research is no longer isolated from the personal-agent wave: OpenClaw, Hermes Agent, Goose, Khoj, and AnythingLLM show how research workflows increasingly sit next to messaging-native, knowledge-native, and workspace-native personal assistants.
- Agent-native software layers are becoming real infrastructure: CLI-Anything, MCP registries, chat bridges, skills, and plugin registries are turning existing tools and software surfaces into agent-operable environments.
- Learning layers and companion UX are both accelerating: self-evolving stacks such as Agent Lightning and AgentEvolver matter more, but so do approval, monitoring, and jump-back interfaces such as Vibe Island and xisland around long-running coding agents.
This guide keeps Vibe Research as the core topic, then adds adjacent sections for the broader agent-native landscape so the repo can expand without losing scope.
Several current signals make the field feel less like a loose collection of demos and more like an emerging stack:
- Research is still the center, but no longer the whole map: the most useful way to read the field now is research core plus adjacent assistant, software-surface, learning, and companion-UX layers.
- Personal agent assistants are converging with research workflows: OpenClaw, Hermes Agent, Goose, Khoj, and AnythingLLM show how long-running assistants, personal memory, and local-first agent shells are becoming normal infrastructure around research and coding work.
- Agent-native software and harness layers are emerging fast: CLI-Anything turns software into agent-operable CLI surfaces, while MCP registries, cc-connect, anthropics/skills, and ClawHub make capabilities more installable and portable.
- Learning and RL remain a first-class layer: Agent Lightning, Agent0, AgentEvolver, EvoAgentX, and Acontext keep pushing the field from tool use toward agent improvement.
- Companion apps are becoming their own category: Vibe Island, xisland, and projects such as Crush show that coding-agent UX is no longer just terminal UI, but also monitoring, approval, and agent-ops surface design.
|
🟢 New to Vibe Research Start: Getting Started Then: Tools & Platforms |
🔵 Developer / Builder Start: Tools & Platforms Then: Vibe Coding · Systems · Experiment |
|
🔴 Researcher Start: Surveys Then: Ideation · Benchmarks |
🟣 Creator / Operator Start: Vibe Anything Then: Vibe Coding · Tools & Platforms |
Only have 5 minutes? Install InnoClaw and try it out.
Vibe Research remains the center of this repo. But in practice, the field now sits inside a larger agent-native landscape: personal assistants, software-surface layers, self-improving agents, and companion UX around coding agents.
These are not all "research agents", but they increasingly shape the environment research agents live in.
| Project | What it is | Why it matters here |
|---|---|---|
| OpenClaw | Gateway-native assistant runtime with chat control, plugins, bundles, and deployment surfaces | Shows how personal assistants, plugins, and research flows can share one substrate instead of staying as separate demos |
| Hermes Agent | General-purpose personal agent stack with gateway, CLI, plugins, skills, and long-running plans | Important signal that "personal assistant" is becoming a real open-source platform layer, not only a chatbot wrapper |
| Goose | Open-source extensible agent that can install, execute, edit, and test with any LLM | Represents the dev-native branch of personal assistants that sits close to repo work and engineering execution |
| Khoj | Self-hostable AI second brain and autonomous personal AI with deep-research and automation hooks | Shows the knowledge-native assistant pattern where long-term memory, personal docs, and web retrieval become part of the assistant surface |
| AnythingLLM | Privacy-first workspace-style AI productivity layer | Useful reference for the workspace-native assistant pattern where teams want one local-first surface for models, docs, and agents |
This layer matters because the frontier is no longer only "which agent is best", but also "which software surfaces are now agent-operable".
| Project | What it is | Why it matters here |
|---|---|---|
| CLI-Anything | Turns software into agent-native CLI surfaces and ships many agent-harness adapters |
Strong signal that existing tools are being retrofitted into agent-operable interfaces instead of being rebuilt from scratch |
| cc-connect | Chat control plane for Claude Code, Codex, Cursor, Gemini CLI, and related agents | Shows how chat surfaces are becoming remote-control shells for coding and research agents |
| Official MCP Registry | Canonical discovery layer for MCP servers and tools | Makes tool discovery and installation a real infrastructure concern instead of ad hoc glue |
| anthropics/skills | Public reusable skill substrate for Claude Code | Shows how skills are becoming portable capability units rather than only local prompts |
| ClawHub · OpenClaw Plugin Bundles | Registry, bundle compatibility, and install layer around OpenClaw | Good reference for how agent-native ecosystems package and distribute capabilities |
This is still one of the most important structural shifts in the field: not just tool-using agents, but agents that can be trained, optimized, or improved over time.
| Layer | Representative resources | Why it matters |
|---|---|---|
| Agent training / optimization | Agent Lightning | Brings RL, automatic prompt optimization, and SFT to arbitrary agent systems with near-zero code changes |
| Zero-data self-evolution | Agent0 · AgentEvolver | Shows how agents can generate tasks, feedback, and training signals without human-curated data pipelines |
| Evolving workflows | EvoAgentX · EvoScientist · MetaClaw | Shifts the focus from optimizing one prompt to evolving whole workflows, skill graphs, or scientist loops |
| Skill & memory substrate | Acontext · anthropics/skills | Makes skills, context, and reusable experience part of the learning layer |
| Landscape map | Awesome-Self-Evolving-Agents | Best current GitHub-native overview of the optimization, evolution, and lifelong-agent literature |
This is a newer branch, but it is growing quickly enough that it deserves to be called out separately.
| Project | What it is | Why it matters here |
|---|---|---|
| Crush | Open-source agentic coding UX from Charm | Good signal that the UI layer around coding agents is now its own product surface, not just a wrapper around a model API |
| Vibe Island | Commercial macOS Dynamic Island companion for many coding agents | Shows the new monitor / approve / ask / jump-back interaction layer around long-running coding-agent sessions |
| xisland | Free macOS notch companion for Claude Code, Codex, Gemini CLI, and OpenCode | Similar signal from an indie distribution angle: approval and session-monitoring UX is becoming a category of its own |
The Claw family now spans more than research agents. The cleaner reading is:
| Layer | Representative projects | Why it matters |
|---|---|---|
| Gateway / foundation | OpenClaw | Core runtime, control surface, and the base layer other Claw projects increasingly build on |
| Registry / discovery | ClawHub · awesome-openclaw-skills | Skill and plugin discovery are now a separate ecosystem layer |
| Compatibility / bundles | OpenClaw Plugin Bundles | Makes it easier to import Codex, Claude, and Cursor ecosystem formats into OpenClaw-native features |
| Packaging / deployment | nix-openclaw | Gives the ecosystem a reproducible deployment path for serious self-hosting and ops |
| Research surfaces | InnoClaw · ResearchClaw · ResearchClaw Desktop App | Covers grounded workspaces, daily research copilots, and lighter-weight reading surfaces |
| Scientific specialist / evolution | ScienceClaw · MetaClaw | Pushes deeper into scientific specialization, persistent memory, and online learning |
| Autonomous pipeline | AutoResearchClaw | Represents the maximum-autonomy idea-to-paper direction |
Full map: → Claw Park
| Layer | Representative projects | Why it matters |
|---|---|---|
| Research copilots | OpenAI Deep Research · Gemini Deep Research · NotebookLM · Prism | Fast literature synthesis, source-grounded reading, and scientific writing assistance |
| Research systems | InnoClaw · ResearchClaw · FARS · AI Scientist · Agent Laboratory · EvoScientist | End-to-end research assistance, automation, and experiment execution |
| AI scientist platforms | FutureHouse Platform · Robin · Edison Scientific · Kosmos | Shows the field moving from paper demos to persistent web/API platforms and validated scientific workflows |
| Personal agent assistants | OpenClaw · Hermes Agent · Goose · Khoj · AnythingLLM | Shows the wider assistant layer surrounding research: messaging-native, dev-native, knowledge-native, and workspace-native agents |
| Agent-native software / harnesses | CLI-Anything · cc-connect · Official MCP Registry · anthropics/skills | Shows how software surfaces, registries, and skills are becoming agent-operable infrastructure |
| Learning / self-evolving layer | Agent Lightning · Agent0 · AgentEvolver · EvoAgentX · Acontext | Turns agent training, self-generated data, evolving workflows, and persistent skill/context memory into a real stack layer |
| Claw ecosystem | OpenClaw · ClawHub · OpenClaw Plugin Bundles · nix-openclaw · InnoClaw · ResearchClaw · ScienceClaw · MetaClaw · AutoResearchClaw | Gateway, registry, compatibility, deployment, research workspaces, scientific specialization, online learning, and autonomous pipelines |
| Execution layer | Claude Code · Codex · Cursor Background Agents · GitHub Copilot Coding Agent | The coding and repo workflow layer that increasingly powers research execution |
| Companion apps / coding UX | Crush · Vibe Island · xisland | Shows the monitoring, approval, and session-jump UX layer forming around long-running coding agents |
| Adjacent prompt-native tools | v0 · Lovable · Replit Agent | Useful for prototyping, but not the core of Vibe Research |
| → Tools & Platforms | → Claw Park | → Vibe Coding | → Vibe Anything |
A new layer is forming between "agent" and "workflow": plugin surfaces, MCP registries, skill catalogs, and chat bridges that make research agents easier to extend, discover, and operate.
| Layer | Representative resources | Why it matters |
|---|---|---|
| Bridge & control surfaces | cc-connect | Runs Claude Code, Cursor, Gemini CLI, Codex, and similar agents from chat surfaces such as Feishu/Lark, Slack, Telegram, and WeCom |
| Plugin / customization layer | CLI-Anything · ClawHub · OpenClaw Plugin Bundles · awesome-claude-code-plugins | Shows how agent ecosystems are moving toward software harnesses, skill registries, plugin marketplaces, bundle compatibility, and installable capability packs |
| Learning / memory substrate | Acontext · anthropics/skills | Shows how context, memory, and reusable skills are turning into persistent substrates for agent improvement |
| Claude Code workflow layer | wshobson/agents · SuperClaude Framework · claude-task-master | Shows how commands, agent teams, skills, and task systems are turning Claude Code into a fuller development environment |
| Routing / agent-ops layer | claude-code-router · Claude Squad · Repomix | Highlights provider routing, multi-agent session management, and codebase packaging as new operational layers around coding agents |
| Registry / discovery layer | Official MCP Registry · awesome-mcp-servers · awesome-openclaw-skills | Makes it easier to find, compare, and install the rapidly growing tool and skill ecosystem |
| Research connectors | OpenAlex Research MCP · Academia MCP · PapersWithCode MCP | Connects agents directly to literature graphs, code artifacts, datasets, and benchmark metadata |
More detailed map: → Tools & Platforms
| Topic | Description | Link |
|---|---|---|
| 🚀 Getting Started | 5-min demo → 30-min agent deployment → full automation | → Getting Started |
| 🧰 Tools & Platforms | Core research platforms plus assistant, connector, and software-surface layers | → Tools & Platforms |
| 🦞 Claw Park | Ecosystem map for what each Claw project is building and where it fits | → Claw Park |
| 💻 Vibe Coding | Terminal agents, coding agents, companion apps, and repo guardrails | → Vibe Coding |
| 🎨 Vibe Anything | Adjacent prompt-native workflows for apps, design, writing, slides, and ops | → Vibe Anything |
| Topic | Core Question | Papers | Link |
|---|---|---|---|
| 📄 Surveys | Landscape & evolution of the field | 5 | → Surveys |
| ⚙️ Systems | How to design end-to-end research systems | 6 | → Systems |
| 💡 Ideation | Can LLMs generate novel ideas | 6 | → Ideation |
| 📚 Synthesis | How to synthesize literature at scale | 5 | → Synthesis |
| 🧪 Experiment | How agents automate experiments | 4 | → Experiment |
| ✍️ Writing & Review | LLM-assisted writing & peer review | 4 | → Writing & Review |
| 📊 Benchmarks | How to evaluate research agents | 5 | → Benchmarks |
|
Read The Field Surveys · Systems · Benchmarks |
Build The Stack Tools & Platforms · Claw Park · Vibe Coding |
Prototype Beyond Research Vibe Anything |
Introductions: AI for Science (Nature) · LLM Agents (Lilian Weng) · Agentic Patterns (Andrew Ng)
Awesome Lists: LLM Agent Survey · AI Agents · Scientific Idea Generation
Research Core: Semantic Scholar · Elicit · Consensus · Connected Papers
AI Scientist Platforms: FutureHouse Platform · Robin · Edison Scientific · Kosmos
Personal Agent Assistants: OpenClaw · Hermes Agent · Goose · Khoj · AnythingLLM
Agent-Native Interfaces & Harnesses: CLI-Anything · cc-connect · Official MCP Registry · anthropics/skills · ClawHub
Claw Ecosystem: OpenClaw · ClawHub · Plugin Bundles · nix-openclaw · Claw Park
Learning / Self-Evolving Agents: Agent Lightning · Agent0 · AgentEvolver · EvoAgentX · Acontext · Awesome-Self-Evolving-Agents
Execution / Vibe Coding: Claude Code · Codex · Cursor Background Agents · GitHub Copilot Coding Agent · Gemini CLI · Crush
Claude Code Ecosystem: anthropics/skills · wshobson/agents · SuperClaude Framework · claude-code-router · Claude Squad · claude-task-master · Repomix
Companion Apps: Vibe Island · xisland
Prototyping: v0 · Lovable · Replit Agent · Figma AI · Canva AI
Conferences: NeurIPS · ICML · ICLR · ACL · AAAI · EMNLP
Submit resources via Resource Suggestion · Contribute via PR · Follow the curation guidelines
Citation
@misc{viberesearch2026,
title = {Vibe Research Guide},
author = {Aaron Wang and Contributors},
year = {2026},
url = {https://github.com/SpectrAI-Initiative/Vibe-Research-Guide},
}
Changelog
- 2026-04-15: Reframed the README as a research-first but broader agent-native map, adding personal assistants, CLI-Anything / harness layers, and companion coding apps while keeping
Vibe Researchas the center - 2026-W17: Expanded Claw coverage from a short project list into a fuller family map, including ClawHub, Plugin Bundles, nix-openclaw, ResearchClaw Desktop App, and a clearer stack-layer taxonomy
- 2026-W16: Added a dedicated learning / RL / self-evolving layer to the guide, including Agent Lightning, Agent0, AgentEvolver, EvoAgentX, Acontext, and Awesome-Self-Evolving-Agents
- 2026-W14: Added 2026 Q1 signals for OpenClaw platformization, FutureHouse / Robin / BixBench, and Edison Scientific / Kosmos; refreshed ecosystem framing across the guide
- 2026-W14: Added recent Claude Code ecosystem signals, including anthropics/skills, wshobson/agents, SuperClaude, claude-code-router, Claude Squad, claude-task-master, and Repomix
- 2026-W13: Added a new plugin / bridge / registry layer to the guide, including cc-connect, OpenAlex Research MCP, Academia MCP, PapersWithCode MCP, and more Claw ecosystem positioning
- 2026-W13: Added core tools & platforms (InnoClaw, ResearchClaw, FARS, Orchestra, OpenClaw, EvoScientist); added Deep Research tools, OpenAI Prism, MCP Servers; switched all content to English; expanded to 35+ papers across 9 topic files
- 2026-W12: Redesigned README into a stronger landing page with cleaner hierarchy, card-style path selection, and a more visual ecosystem map
- 2026-W12: Hub-and-spoke architecture reorganization
- 2026-W12: Initial public release
Full history: CHANGELOG.md


