Skip to content

SpectrAI-Initiative/Vibe-Research-Guide

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

26 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Vibe Research Guide

A curated guide for LLM-agent-driven scientific research automation

Stars Last Commit Issues MIT

🌐 View Interactive Multilingual README →
🇨🇳 中文 · 🇺🇸 English · 🇰🇷 한국어 · 🇯🇵 日本語 · 🇩🇪 Deutsch · 🇫🇷 Français · 🇪🇸 Español · 🇮🇹 Italiano · 🇵🇹 Português · 🇸🇦 العربية · 🇹🇭 ไทย · 🇻🇳 Tiếng Việt · 🇷🇺 Русский

Vibe Research Guide Overview


Vibe Research At The Center, With The Broader Agent-Native Stack Around It

Automate the research loop with LLM agents: literature review → idea generation → experiment execution → paper writing → peer review.

This repo is a research-first landing page for the field: use it to choose the right track, then move into the topic pages for detail. The center is still Vibe Research, but the map now also tracks personal assistants, agent-native software layers, and companion apps around that core.

Start here: Getting Started · Tools & Platforms · Claw Park · Vibe Coding

Vibe Research: AI assistant workflow (idea → literature → experiment → code → result → paper)

At A Glance

Core Question

How far can AI move from research assistant to research operator?

Focus: literature, ideation, experiment, writing, and evaluation.
What Changed In 2026

Research copilots got stronger, learning layers became real, autonomous research systems got more credible, and Vibe Coding became the execution layer.
How To Use This Repo

Treat the README as a map. Treat the topic pages as the actual guide.

2026 Landscape Snapshot

Five trends are now shaping the field:

  1. Research copilots are getting stronger: Deep Research products, NotebookLM-style source-grounded reading, and scientific workspaces such as Prism are making literature synthesis and report writing much faster.
  2. Autonomous research systems are maturing: AI Scientist-v2, Agent Laboratory, and EvoScientist push the field from "paper summary bots" toward iterative ideation, execution, and evaluation.
  3. Research is no longer isolated from the personal-agent wave: OpenClaw, Hermes Agent, Goose, Khoj, and AnythingLLM show how research workflows increasingly sit next to messaging-native, knowledge-native, and workspace-native personal assistants.
  4. Agent-native software layers are becoming real infrastructure: CLI-Anything, MCP registries, chat bridges, skills, and plugin registries are turning existing tools and software surfaces into agent-operable environments.
  5. Learning layers and companion UX are both accelerating: self-evolving stacks such as Agent Lightning and AgentEvolver matter more, but so do approval, monitoring, and jump-back interfaces such as Vibe Island and xisland around long-running coding agents.

This guide keeps Vibe Research as the core topic, then adds adjacent sections for the broader agent-native landscape so the repo can expand without losing scope.


2026 Spring Signals

Several current signals make the field feel less like a loose collection of demos and more like an emerging stack:

  1. Research is still the center, but no longer the whole map: the most useful way to read the field now is research core plus adjacent assistant, software-surface, learning, and companion-UX layers.
  2. Personal agent assistants are converging with research workflows: OpenClaw, Hermes Agent, Goose, Khoj, and AnythingLLM show how long-running assistants, personal memory, and local-first agent shells are becoming normal infrastructure around research and coding work.
  3. Agent-native software and harness layers are emerging fast: CLI-Anything turns software into agent-operable CLI surfaces, while MCP registries, cc-connect, anthropics/skills, and ClawHub make capabilities more installable and portable.
  4. Learning and RL remain a first-class layer: Agent Lightning, Agent0, AgentEvolver, EvoAgentX, and Acontext keep pushing the field from tool use toward agent improvement.
  5. Companion apps are becoming their own category: Vibe Island, xisland, and projects such as Crush show that coding-agent UX is no longer just terminal UI, but also monitoring, approval, and agent-ops surface design.

Choose a Path

🟢 New to Vibe Research

Start: Getting Started
Then: Tools & Platforms
🔵 Developer / Builder

Start: Tools & Platforms
Then: Vibe Coding · Systems · Experiment
🔴 Researcher

Start: Surveys
Then: Ideation · Benchmarks
🟣 Creator / Operator

Start: Vibe Anything
Then: Vibe Coding · Tools & Platforms

Only have 5 minutes? Install InnoClaw and try it out.


Agent-Native Landscape Beyond Research

Vibe Research remains the center of this repo. But in practice, the field now sits inside a larger agent-native landscape: personal assistants, software-surface layers, self-improving agents, and companion UX around coding agents.

Personal Agent Assistants

These are not all "research agents", but they increasingly shape the environment research agents live in.

Project What it is Why it matters here
OpenClaw Gateway-native assistant runtime with chat control, plugins, bundles, and deployment surfaces Shows how personal assistants, plugins, and research flows can share one substrate instead of staying as separate demos
Hermes Agent General-purpose personal agent stack with gateway, CLI, plugins, skills, and long-running plans Important signal that "personal assistant" is becoming a real open-source platform layer, not only a chatbot wrapper
Goose Open-source extensible agent that can install, execute, edit, and test with any LLM Represents the dev-native branch of personal assistants that sits close to repo work and engineering execution
Khoj Self-hostable AI second brain and autonomous personal AI with deep-research and automation hooks Shows the knowledge-native assistant pattern where long-term memory, personal docs, and web retrieval become part of the assistant surface
AnythingLLM Privacy-first workspace-style AI productivity layer Useful reference for the workspace-native assistant pattern where teams want one local-first surface for models, docs, and agents

Agent-Native Software / CLI-Anything

This layer matters because the frontier is no longer only "which agent is best", but also "which software surfaces are now agent-operable".

Project What it is Why it matters here
CLI-Anything Turns software into agent-native CLI surfaces and ships many agent-harness adapters Strong signal that existing tools are being retrofitted into agent-operable interfaces instead of being rebuilt from scratch
cc-connect Chat control plane for Claude Code, Codex, Cursor, Gemini CLI, and related agents Shows how chat surfaces are becoming remote-control shells for coding and research agents
Official MCP Registry Canonical discovery layer for MCP servers and tools Makes tool discovery and installation a real infrastructure concern instead of ad hoc glue
anthropics/skills Public reusable skill substrate for Claude Code Shows how skills are becoming portable capability units rather than only local prompts
ClawHub · OpenClaw Plugin Bundles Registry, bundle compatibility, and install layer around OpenClaw Good reference for how agent-native ecosystems package and distribute capabilities

Learning, RL & Self-Evolving Agents

This is still one of the most important structural shifts in the field: not just tool-using agents, but agents that can be trained, optimized, or improved over time.

Layer Representative resources Why it matters
Agent training / optimization Agent Lightning Brings RL, automatic prompt optimization, and SFT to arbitrary agent systems with near-zero code changes
Zero-data self-evolution Agent0 · AgentEvolver Shows how agents can generate tasks, feedback, and training signals without human-curated data pipelines
Evolving workflows EvoAgentX · EvoScientist · MetaClaw Shifts the focus from optimizing one prompt to evolving whole workflows, skill graphs, or scientist loops
Skill & memory substrate Acontext · anthropics/skills Makes skills, context, and reusable experience part of the learning layer
Landscape map Awesome-Self-Evolving-Agents Best current GitHub-native overview of the optimization, evolution, and lifelong-agent literature

Vibe Coding Apps & Companion UX

This is a newer branch, but it is growing quickly enough that it deserves to be called out separately.

Project What it is Why it matters here
Crush Open-source agentic coding UX from Charm Good signal that the UI layer around coding agents is now its own product surface, not just a wrapper around a model API
Vibe Island Commercial macOS Dynamic Island companion for many coding agents Shows the new monitor / approve / ask / jump-back interaction layer around long-running coding-agent sessions
xisland Free macOS notch companion for Claude Code, Codex, Gemini CLI, and OpenCode Similar signal from an indie distribution angle: approval and session-monitoring UX is becoming a category of its own

Claw Stack At A Glance

The Claw family now spans more than research agents. The cleaner reading is:

Layer Representative projects Why it matters
Gateway / foundation OpenClaw Core runtime, control surface, and the base layer other Claw projects increasingly build on
Registry / discovery ClawHub · awesome-openclaw-skills Skill and plugin discovery are now a separate ecosystem layer
Compatibility / bundles OpenClaw Plugin Bundles Makes it easier to import Codex, Claude, and Cursor ecosystem formats into OpenClaw-native features
Packaging / deployment nix-openclaw Gives the ecosystem a reproducible deployment path for serious self-hosting and ops
Research surfaces InnoClaw · ResearchClaw · ResearchClaw Desktop App Covers grounded workspaces, daily research copilots, and lighter-weight reading surfaces
Scientific specialist / evolution ScienceClaw · MetaClaw Pushes deeper into scientific specialization, persistent memory, and online learning
Autonomous pipeline AutoResearchClaw Represents the maximum-autonomy idea-to-paper direction

Full map: → Claw Park


Ecosystem Snapshot

CLAW Ecosystem - Vibe Research tools and platforms

Layer Representative projects Why it matters
Research copilots OpenAI Deep Research · Gemini Deep Research · NotebookLM · Prism Fast literature synthesis, source-grounded reading, and scientific writing assistance
Research systems InnoClaw · ResearchClaw · FARS · AI Scientist · Agent Laboratory · EvoScientist End-to-end research assistance, automation, and experiment execution
AI scientist platforms FutureHouse Platform · Robin · Edison Scientific · Kosmos Shows the field moving from paper demos to persistent web/API platforms and validated scientific workflows
Personal agent assistants OpenClaw · Hermes Agent · Goose · Khoj · AnythingLLM Shows the wider assistant layer surrounding research: messaging-native, dev-native, knowledge-native, and workspace-native agents
Agent-native software / harnesses CLI-Anything · cc-connect · Official MCP Registry · anthropics/skills Shows how software surfaces, registries, and skills are becoming agent-operable infrastructure
Learning / self-evolving layer Agent Lightning · Agent0 · AgentEvolver · EvoAgentX · Acontext Turns agent training, self-generated data, evolving workflows, and persistent skill/context memory into a real stack layer
Claw ecosystem OpenClaw · ClawHub · OpenClaw Plugin Bundles · nix-openclaw · InnoClaw · ResearchClaw · ScienceClaw · MetaClaw · AutoResearchClaw Gateway, registry, compatibility, deployment, research workspaces, scientific specialization, online learning, and autonomous pipelines
Execution layer Claude Code · Codex · Cursor Background Agents · GitHub Copilot Coding Agent The coding and repo workflow layer that increasingly powers research execution
Companion apps / coding UX Crush · Vibe Island · xisland Shows the monitoring, approval, and session-jump UX layer forming around long-running coding agents
Adjacent prompt-native tools v0 · Lovable · Replit Agent Useful for prototyping, but not the core of Vibe Research
→ Tools & Platforms → Claw Park → Vibe Coding → Vibe Anything

Plugins, Bridges & Research Connectors

A new layer is forming between "agent" and "workflow": plugin surfaces, MCP registries, skill catalogs, and chat bridges that make research agents easier to extend, discover, and operate.

Layer Representative resources Why it matters
Bridge & control surfaces cc-connect Runs Claude Code, Cursor, Gemini CLI, Codex, and similar agents from chat surfaces such as Feishu/Lark, Slack, Telegram, and WeCom
Plugin / customization layer CLI-Anything · ClawHub · OpenClaw Plugin Bundles · awesome-claude-code-plugins Shows how agent ecosystems are moving toward software harnesses, skill registries, plugin marketplaces, bundle compatibility, and installable capability packs
Learning / memory substrate Acontext · anthropics/skills Shows how context, memory, and reusable skills are turning into persistent substrates for agent improvement
Claude Code workflow layer wshobson/agents · SuperClaude Framework · claude-task-master Shows how commands, agent teams, skills, and task systems are turning Claude Code into a fuller development environment
Routing / agent-ops layer claude-code-router · Claude Squad · Repomix Highlights provider routing, multi-agent session management, and codebase packaging as new operational layers around coding agents
Registry / discovery layer Official MCP Registry · awesome-mcp-servers · awesome-openclaw-skills Makes it easier to find, compare, and install the rapidly growing tool and skill ecosystem
Research connectors OpenAlex Research MCP · Academia MCP · PapersWithCode MCP Connects agents directly to literature graphs, code artifacts, datasets, and benchmark metadata

More detailed map: → Tools & Platforms


Topic Map

Core Guides

Topic Description Link
🚀 Getting Started 5-min demo → 30-min agent deployment → full automation → Getting Started
🧰 Tools & Platforms Core research platforms plus assistant, connector, and software-surface layers → Tools & Platforms
🦞 Claw Park Ecosystem map for what each Claw project is building and where it fits → Claw Park
💻 Vibe Coding Terminal agents, coding agents, companion apps, and repo guardrails → Vibe Coding
🎨 Vibe Anything Adjacent prompt-native workflows for apps, design, writing, slides, and ops → Vibe Anything

Research Topics (35+ papers)

Topic Core Question Papers Link
📄 Surveys Landscape & evolution of the field 5 → Surveys
⚙️ Systems How to design end-to-end research systems 6 → Systems
💡 Ideation Can LLMs generate novel ideas 6 → Ideation
📚 Synthesis How to synthesize literature at scale 5 → Synthesis
🧪 Experiment How agents automate experiments 4 → Experiment
✍️ Writing & Review LLM-assisted writing & peer review 4 → Writing & Review
📊 Benchmarks How to evaluate research agents 5 → Benchmarks

Reading Modes

Read The Field

Surveys · Systems · Benchmarks
Build The Stack

Tools & Platforms · Claw Park · Vibe Coding
Prototype Beyond Research

Vibe Anything

Useful Resources

Introductions: AI for Science (Nature) · LLM Agents (Lilian Weng) · Agentic Patterns (Andrew Ng)

Awesome Lists: LLM Agent Survey · AI Agents · Scientific Idea Generation

Research Core: Semantic Scholar · Elicit · Consensus · Connected Papers

AI Scientist Platforms: FutureHouse Platform · Robin · Edison Scientific · Kosmos

Personal Agent Assistants: OpenClaw · Hermes Agent · Goose · Khoj · AnythingLLM

Agent-Native Interfaces & Harnesses: CLI-Anything · cc-connect · Official MCP Registry · anthropics/skills · ClawHub

Claw Ecosystem: OpenClaw · ClawHub · Plugin Bundles · nix-openclaw · Claw Park

Learning / Self-Evolving Agents: Agent Lightning · Agent0 · AgentEvolver · EvoAgentX · Acontext · Awesome-Self-Evolving-Agents

Execution / Vibe Coding: Claude Code · Codex · Cursor Background Agents · GitHub Copilot Coding Agent · Gemini CLI · Crush

Claude Code Ecosystem: anthropics/skills · wshobson/agents · SuperClaude Framework · claude-code-router · Claude Squad · claude-task-master · Repomix

Companion Apps: Vibe Island · xisland

Prototyping: v0 · Lovable · Replit Agent · Figma AI · Canva AI

Conferences: NeurIPS · ICML · ICLR · ACL · AAAI · EMNLP


Contribute

Submit resources via Resource Suggestion · Contribute via PR · Follow the curation guidelines


Citation
@misc{viberesearch2026,
  title = {Vibe Research Guide},
  author = {Aaron Wang and Contributors},
  year = {2026},
  url = {https://github.com/SpectrAI-Initiative/Vibe-Research-Guide},
}
Changelog
  • 2026-04-15: Reframed the README as a research-first but broader agent-native map, adding personal assistants, CLI-Anything / harness layers, and companion coding apps while keeping Vibe Research as the center
  • 2026-W17: Expanded Claw coverage from a short project list into a fuller family map, including ClawHub, Plugin Bundles, nix-openclaw, ResearchClaw Desktop App, and a clearer stack-layer taxonomy
  • 2026-W16: Added a dedicated learning / RL / self-evolving layer to the guide, including Agent Lightning, Agent0, AgentEvolver, EvoAgentX, Acontext, and Awesome-Self-Evolving-Agents
  • 2026-W14: Added 2026 Q1 signals for OpenClaw platformization, FutureHouse / Robin / BixBench, and Edison Scientific / Kosmos; refreshed ecosystem framing across the guide
  • 2026-W14: Added recent Claude Code ecosystem signals, including anthropics/skills, wshobson/agents, SuperClaude, claude-code-router, Claude Squad, claude-task-master, and Repomix
  • 2026-W13: Added a new plugin / bridge / registry layer to the guide, including cc-connect, OpenAlex Research MCP, Academia MCP, PapersWithCode MCP, and more Claw ecosystem positioning
  • 2026-W13: Added core tools & platforms (InnoClaw, ResearchClaw, FARS, Orchestra, OpenClaw, EvoScientist); added Deep Research tools, OpenAI Prism, MCP Servers; switched all content to English; expanded to 35+ papers across 9 topic files
  • 2026-W12: Redesigned README into a stronger landing page with cleaner hierarchy, card-style path selection, and a more visual ecosystem map
  • 2026-W12: Hub-and-spoke architecture reorganization
  • 2026-W12: Initial public release

Full history: CHANGELOG.md

MIT License · Star History Chart

About

A curated guide for LLM-agent-driven scientific research automation — from getting started to the frontier.

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages