Skip to content

docs(skills): consolidate training skill disclosure#5530

Queued
njzjz-bot wants to merge 1 commit into
deepmodeling:masterfrom
njzjz-bothub:skill-progressive-train
Queued

docs(skills): consolidate training skill disclosure#5530
njzjz-bot wants to merge 1 commit into
deepmodeling:masterfrom
njzjz-bothub:skill-progressive-train

Conversation

@njzjz-bot

@njzjz-bot njzjz-bot commented Jun 13, 2026

Copy link
Copy Markdown
Contributor

Problem

  • The training skills are currently split by model, which forces agents to select a model-specific skill before the shared training workflow is established.
  • Adding more trainable models would keep growing top-level skills instead of progressively loading only the selected model configuration.

Change

  • Add a unified deepmd-train skill that first selects the model family, then reads only the chosen model reference under skills/deepmd-train/models/.
  • Move the existing DPA3 and se_e2_a recipes into concise model reference files and keep shared train/monitor/freeze/test workflow text in the top-level skill.
  • Update the existing doc/agent-skills.md page to list the consolidated training skill and adjust the verification example.

Notes

  • Ran pre-commit run --files doc/agent-skills.md skills/deepmd-train/SKILL.md skills/deepmd-train/models/dpa3.md skills/deepmd-train/models/se-e2-a.md; all selected hooks passed.
  • Also checked no new docs page/toctree entry remains, no developer-maintenance section remains in the docs/skill text, and se_e2_a spelling is lowercase.
  • Sphinx build was skipped because sphinx is not installed in this checkout environment.

Authored by OpenClaw (model: custom-chat-jinzhezeng-group/gpt-5.5)

Summary by CodeRabbit

  • Documentation
    • Consolidated DeePMD-kit training into a single progressive-disclosure workflow with in-place selection between DPA3 and SE_E2_A
    • Added comprehensive DPA3 and SE_E2_A reference guides: configuration examples, hyperparameter tables, checklists, benchmarks, selection guidance, and dynamic neighbor-selection notes
    • Removed older per-model training pages and updated the agent skills index to reference the new consolidated training skill; finetuning skill retained

@dosubot dosubot Bot added the Docs label Jun 13, 2026
@coderabbitai

coderabbitai Bot commented Jun 13, 2026

Copy link
Copy Markdown
Contributor

Review Change Stack

No actionable comments were generated in the recent review. 🎉

ℹ️ Recent review info
⚙️ Run configuration

Configuration used: Repository UI

Review profile: CHILL

Plan: Pro

Run ID: 6ffaa866-2069-4628-a871-f1d3aa6a5023

📥 Commits

Reviewing files that changed from the base of the PR and between 4ab0212 and a9dfc01.

📒 Files selected for processing (6)
  • doc/agent-skills.md
  • skills/deepmd-train-dpa3/SKILL.md
  • skills/deepmd-train-se-e2-a/SKILL.md
  • skills/deepmd-train/SKILL.md
  • skills/deepmd-train/models/dpa3.md
  • skills/deepmd-train/models/se-e2-a.md
💤 Files with no reviewable changes (2)
  • skills/deepmd-train-dpa3/SKILL.md
  • skills/deepmd-train-se-e2-a/SKILL.md
✅ Files skipped from review due to trivial changes (3)
  • skills/deepmd-train/models/se-e2-a.md
  • doc/agent-skills.md
  • skills/deepmd-train/models/dpa3.md

📝 Walkthrough

Walkthrough

This PR consolidates DeePMD-kit training documentation by replacing two separate model-specific skills (deepmd-train-dpa3 and deepmd-train-se-e2-a) with a unified deepmd-train skill using progressive disclosure, model-specific references, and updated framework documentation.

Changes

DeePMD Training Skills Consolidation

Layer / File(s) Summary
Agent Skills Framework Update
doc/agent-skills.md
Rewords the Agent Skills introduction, replaces the skill lineup with the new unified deepmd-train plus fine-tune/inference/deployment skills, expands skill organization guidance for model-specific subdocuments, and updates the "Minimal verification" example to use the consolidated deepmd-train skill with DPA3/SE_E2_A selection.
DeePMD Training Skill (Progressive Disclosure)
skills/deepmd-train/SKILL.md
Introduces the unified deepmd-train skill with YAML metadata, compatibility requirements, and a progressive-disclosure workflow: confirm environment/data, select between SE_E2_A and DPA3, read the chosen model reference from models/, execute training, monitor lcurve.out, freeze, and test. Includes agent checklist and references to training/input/model documentation.
DPA3 Model-Specific Training Reference
skills/deepmd-train/models/dpa3.md
Replaces the top-of-file metadata with reference guidance for post-selection reading, then provides DPA3 selection rationale, required inputs, merged JSON configuration with repflow dynamic neighbor selection, model size variants with benchmarks, comparison table versus SE_E2_A, and configuration checklist.
SE_E2_A Model-Specific Training Reference
skills/deepmd-train/models/se-e2-a.md
Removes prior YAML front-matter and provides SE_E2_A selection criteria, required inputs, merged JSON configuration with descriptor and fitting defaults, key hyperparameters, SE_E2_A checklist, and reference links.

Estimated code review effort

🎯 2 (Simple) | ⏱️ ~12 minutes

Possibly related PRs

  • deepmodeling/deepmd-kit#5437: Consolidation directly restructures the DeePMD-kit training agent documentation introduced there by deleting separate model skills and replacing them with the new unified deepmd-train skill and model-specific references.
  • deepmodeling/deepmd-kit#5497: Updates to doc/agent-skills.md wording and examples on the same guide document structure and deepmd-train/DPA3 selection patterns.

Suggested reviewers

  • njzjz
  • wanghan-iapcm
  • iProzd
🚥 Pre-merge checks | ✅ 5
✅ Passed checks (5 passed)
Check name Status Explanation
Description Check ✅ Passed Check skipped - CodeRabbit’s high-level summary is enabled.
Title check ✅ Passed The title 'consolidate training skill disclosure' accurately summarizes the main change—reorganizing split model-specific training skills into a unified progressive-disclosure skill structure.
Docstring Coverage ✅ Passed No functions found in the changed files to evaluate docstring coverage. Skipping docstring coverage check.
Linked Issues check ✅ Passed Check skipped because no linked issues were found for this pull request.
Out of Scope Changes check ✅ Passed Check skipped because no linked issues were found for this pull request.

✏️ Tip: You can configure your own custom pre-merge checks in the settings.

✨ Finishing Touches
🧪 Generate unit tests (beta)
  • Create PR with unit tests

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

Comment @coderabbitai help to get the list of available commands and usage tips.

@coderabbitai coderabbitai Bot left a comment

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

🧹 Nitpick comments (1)
skills/deepmd-train/SKILL.md (1)

22-28: 💤 Low value

Consider rewording successive "If" clauses to improve readability.

Lines 26–28 each begin with "If", which may reduce clarity. Rewording some conditionals to vary sentence structure would improve the flow:

- Do not start by reading every model document. First classify the request:
-
- - If the user already named a model, read only that model reference.
- - If the user asks for a recommendation, collect the decision inputs below, choose a model, then read only the selected reference.
- - If model-specific parameters are not needed yet, stay in this top-level workflow.
+ Do not start by reading every model document. First classify the request:
+
+ - When the user names a model, read only that model reference.
+ - For recommendation requests, collect the decision inputs below, choose a model, then read only the selected reference.
+ - When model-specific parameters are not needed yet, stay in this top-level workflow.
🤖 Prompt for AI Agents
Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

In `@skills/deepmd-train/SKILL.md` around lines 22 - 28, The three consecutive
conditional list items under "Progressive disclosure protocol" start each with
"If", which reduces readability; edit the three bullets so they vary sentence
openings (e.g., keep the first as "If the user already named a model,...",
change the second to "When the user asks for a recommendation, collect the
decision inputs..., then read only the selected reference", and change the third
to "If model-specific parameters are not needed yet, remain in the top-level
workflow" or use lead-ins like "Otherwise," or "In that case,"), preserving
meaning and the existing decision flow in the SKILL.md section.

Source: Linters/SAST tools

🤖 Prompt for all review comments with AI agents
Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

Nitpick comments:
In `@skills/deepmd-train/SKILL.md`:
- Around line 22-28: The three consecutive conditional list items under
"Progressive disclosure protocol" start each with "If", which reduces
readability; edit the three bullets so they vary sentence openings (e.g., keep
the first as "If the user already named a model,...", change the second to "When
the user asks for a recommendation, collect the decision inputs..., then read
only the selected reference", and change the third to "If model-specific
parameters are not needed yet, remain in the top-level workflow" or use lead-ins
like "Otherwise," or "In that case,"), preserving meaning and the existing
decision flow in the SKILL.md section.

ℹ️ Review info
⚙️ Run configuration

Configuration used: Repository UI

Review profile: CHILL

Plan: Pro

Run ID: ec4dfae2-8b0d-4442-8ec2-1a0455dcfad6

📥 Commits

Reviewing files that changed from the base of the PR and between 5d94bd6 and 51e345e.

📒 Files selected for processing (5)
  • doc/development/agent-skills.md
  • doc/index.rst
  • skills/deepmd-train/SKILL.md
  • skills/deepmd-train/models/dpa3.md
  • skills/deepmd-train/models/se-e2-a.md

@njzjz njzjz force-pushed the skill-progressive-train branch from 51e345e to 8e9b3af Compare June 13, 2026 18:09

@coderabbitai coderabbitai Bot left a comment

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Actionable comments posted: 2

🤖 Prompt for all review comments with AI agents
Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

Inline comments:
In `@doc/agent-skills.md`:
- Around line 17-21: Update the description for the deepmd-train skill so it no
longer implies only training from scratch: change the opening sentence to say
the skill handles training and fine‑tuning workflows (including from‑scratch and
pretrained/transfer-style DPA3 paths), and keep the rest of the paragraph about
progressive disclosure and model-specific configs under
skills/deepmd-train/models/ intact.

In `@skills/deepmd-train/models/se-e2-a.md`:
- Around line 12-16: The example for computing neighbor selection `sel`
hard-codes element types with `-t O H`; update the `dp --pt neighbor-stat`
example so it uses the dataset's actual type_map instead of fixed "O H" (e.g.
show `-t <type_map>` or omit `-t` and explain how to extract the dataset's
type_map) and mention `type_map` by name; change the line in se-e2-a.md that
currently shows `dp --pt neighbor-stat -s /path/to/data -r 6.0 -t O H` to a
model-agnostic form like `dp --pt neighbor-stat -s /path/to/data -r 6.0 -t
<type_map>` (or `dp --pt neighbor-stat -s /path/to/data -r 6.0` with a note how
to get the dataset's type_map), and ensure the surrounding text references `sel`
and `type_map` so users know to substitute their dataset's types.
🪄 Autofix (Beta)

Fix all unresolved CodeRabbit comments on this PR:

  • Push a commit to this branch (recommended)
  • Create a new PR with the fixes

ℹ️ Review info
⚙️ Run configuration

Configuration used: Repository UI

Review profile: CHILL

Plan: Pro

Run ID: 673d9a2c-056d-49bb-8dd2-e0db695947ab

📥 Commits

Reviewing files that changed from the base of the PR and between 51e345e and 8e9b3af.

📒 Files selected for processing (6)
  • doc/agent-skills.md
  • skills/deepmd-train-dpa3/SKILL.md
  • skills/deepmd-train-se-e2-a/SKILL.md
  • skills/deepmd-train/SKILL.md
  • skills/deepmd-train/models/dpa3.md
  • skills/deepmd-train/models/se-e2-a.md
💤 Files with no reviewable changes (2)
  • skills/deepmd-train-se-e2-a/SKILL.md
  • skills/deepmd-train-dpa3/SKILL.md
✅ Files skipped from review due to trivial changes (1)
  • skills/deepmd-train/models/dpa3.md

Comment thread doc/agent-skills.md Outdated
Comment thread skills/deepmd-train/models/se-e2-a.md
@njzjz njzjz force-pushed the skill-progressive-train branch from 8e9b3af to 4ab0212 Compare June 13, 2026 18:18

@coderabbitai coderabbitai Bot left a comment

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

🧹 Nitpick comments (2)
skills/deepmd-train/SKILL.md (2)

119-119: 💤 Low value

Clarify what "clearly marked" means for placeholder data paths.

Line 119 expects data paths to be "clearly marked as placeholders," but the checklist doesn't specify the format or convention. Consider adding an example (e.g., /path/to/train_system or <PLACEHOLDER: training data>) to guide agents on acceptable patterns.

🤖 Prompt for AI Agents
Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

In `@skills/deepmd-train/SKILL.md` at line 119, Update the checklist item
"Training/validation data paths exist or are clearly marked as placeholders" to
define the placeholder convention and give examples; specifically, modify the
SKILL.md checklist entry to state what "clearly marked" means (for example: use
a distinct pattern like /path/to/train_system or <PLACEHOLDER: training data>),
allow either absolute paths or the placeholder syntax, and include one or two
concrete examples so agents know acceptable formats (e.g.,
"/path/to/train_system" and "<PLACEHOLDER: training data>").

26-28: 💤 Low value

Reduce repetition of "If" at sentence starts.

Lines 26–28 each begin with "If," which LanguageTool flagged as repetitive. Consider reordering or merging to improve readability while preserving the logical structure.

Example rephrase:

  • Classify based on the user's input: if they already named a model, read only that reference; if they ask for a recommendation, collect decision inputs and choose before reading; if model-specific parameters aren't needed yet, stay in the top-level workflow.
🤖 Prompt for AI Agents
Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

In `@skills/deepmd-train/SKILL.md` around lines 26 - 28, Combine the three short
sentences that each start with "If" into a single, clearer sentence to reduce
repetition and preserve logic; for example, in SKILL.md's workflow description
replace the three lines beginning "If the user..." / "If the user asks..." / "If
model-specific..." with a single sentence such as: "Classify based on the user's
input: if they already named a model, read only that reference; if they ask for
a recommendation, collect the decision inputs below, choose a model, then read
only the selected reference; if model-specific parameters aren't needed yet,
stay in this top-level workflow." Ensure the revised sentence retains the
original ordering and meaning of the three cases.

Source: Linters/SAST tools

🤖 Prompt for all review comments with AI agents
Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

Nitpick comments:
In `@skills/deepmd-train/SKILL.md`:
- Line 119: Update the checklist item "Training/validation data paths exist or
are clearly marked as placeholders" to define the placeholder convention and
give examples; specifically, modify the SKILL.md checklist entry to state what
"clearly marked" means (for example: use a distinct pattern like
/path/to/train_system or <PLACEHOLDER: training data>), allow either absolute
paths or the placeholder syntax, and include one or two concrete examples so
agents know acceptable formats (e.g., "/path/to/train_system" and "<PLACEHOLDER:
training data>").
- Around line 26-28: Combine the three short sentences that each start with "If"
into a single, clearer sentence to reduce repetition and preserve logic; for
example, in SKILL.md's workflow description replace the three lines beginning
"If the user..." / "If the user asks..." / "If model-specific..." with a single
sentence such as: "Classify based on the user's input: if they already named a
model, read only that reference; if they ask for a recommendation, collect the
decision inputs below, choose a model, then read only the selected reference; if
model-specific parameters aren't needed yet, stay in this top-level workflow."
Ensure the revised sentence retains the original ordering and meaning of the
three cases.

ℹ️ Review info
⚙️ Run configuration

Configuration used: Repository UI

Review profile: CHILL

Plan: Pro

Run ID: 7c7ee886-eeac-443a-bf0d-f84a2e453f1a

📥 Commits

Reviewing files that changed from the base of the PR and between 8e9b3af and 4ab0212.

📒 Files selected for processing (6)
  • doc/agent-skills.md
  • skills/deepmd-train-dpa3/SKILL.md
  • skills/deepmd-train-se-e2-a/SKILL.md
  • skills/deepmd-train/SKILL.md
  • skills/deepmd-train/models/dpa3.md
  • skills/deepmd-train/models/se-e2-a.md
💤 Files with no reviewable changes (2)
  • skills/deepmd-train-se-e2-a/SKILL.md
  • skills/deepmd-train-dpa3/SKILL.md
✅ Files skipped from review due to trivial changes (1)
  • doc/agent-skills.md
🚧 Files skipped from review as they are similar to previous changes (2)
  • skills/deepmd-train/models/se-e2-a.md
  • skills/deepmd-train/models/dpa3.md

Move model-specific DeePMD-kit training recipes under a single deepmd-train skill so agents first choose a model and only then read the selected configuration reference. Document the progressive-disclosure pattern for future skill additions.

Authored by OpenClaw (model: custom-chat-jinzhezeng-group/gpt-5.5)
@njzjz njzjz force-pushed the skill-progressive-train branch from 4ab0212 to a9dfc01 Compare June 13, 2026 18:24
@njzjz njzjz requested a review from iProzd June 13, 2026 18:42
@codecov

codecov Bot commented Jun 13, 2026

Copy link
Copy Markdown

Codecov Report

✅ All modified and coverable lines are covered by tests.
✅ Project coverage is 82.19%. Comparing base (5d94bd6) to head (a9dfc01).

Additional details and impacted files
@@            Coverage Diff             @@
##           master    #5530      +/-   ##
==========================================
- Coverage   82.19%   82.19%   -0.01%     
==========================================
  Files         891      891              
  Lines      101599   101599              
  Branches     4242     4240       -2     
==========================================
- Hits        83507    83506       -1     
  Misses      16789    16789              
- Partials     1303     1304       +1     

☔ View full report in Codecov by Harness.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:
  • ❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.
  • 📦 JS Bundle Analysis: Save yourself from yourself by tracking and limiting bundle sizes in JS merges.

@njzjz njzjz added this pull request to the merge queue Jun 16, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants