test(skills): add plugin skill eval datasets#116
Conversation
|
/nvskills-ci |
|
Note Reviews pausedIt looks like this branch is under active development. To avoid overwhelming you with review comments due to an influx of new commits, CodeRabbit has automatically paused this review. You can configure this behavior by changing the Use the following commands to manage reviews:
Use the checkboxes below for quick actions:
📝 WalkthroughWalkthroughAdds evaluation JSON suites for three skills, updates SKILL front-matter and references to use renamed nemo-* plugin identifiers, and tweaks a single SDK example console message. ChangesSkill Evaluation Cases and SKILL metadata
Setup docs update
SDK example tweak
Suggested reviewers
🚥 Pre-merge checks | ✅ 5✅ Passed checks (5 passed)
✏️ Tip: You can configure your own custom pre-merge checks in the settings. ✨ Finishing Touches🧪 Generate unit tests (beta)
Comment |
|
/nvskills-ci |
|
ce349ae to
b51a05b
Compare
There was a problem hiding this comment.
Actionable comments posted: 1
🤖 Prompt for all review comments with AI agents
Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.
Inline comments:
In `@packages/nemo_evaluator_sdk/examples/plugin_examples.py`:
- Line 232: Update the generic print("API key secret created for workspace") to
include the workspace and secret_name variables so the message mirrors other
logs in this file; locate the print in plugin_examples.py (near the API key
secret creation logic) and change it to output a descriptive message containing
workspace and secret_name (e.g., "API key secret '{secret_name}' created for
workspace '{workspace}'") to restore useful context for debugging.
🪄 Autofix (Beta)
Fix all unresolved CodeRabbit comments on this PR:
- Push a commit to this branch (recommended)
- Create a new PR with the fixes
ℹ️ Review info
⚙️ Run configuration
Configuration used: Path: .coderabbit.yaml
Review profile: CHILL
Plan: Enterprise
Run ID: 252b19f4-2a84-4405-a5fc-f0fe97fc5d3e
📒 Files selected for processing (1)
packages/nemo_evaluator_sdk/examples/plugin_examples.py
|
/nvskills-ci |
There was a problem hiding this comment.
🧹 Nitpick comments (1)
skills/nemo-setup/SKILL.md (1)
188-188: 💤 Low valueInconsistent separator for dual skill names.
Line 188 uses slash (
nemo-evaluator/nemo-evaluator-plugin), but line 162 uses comma (nemo-evaluator, nemo-evaluator-plugin). Pick one format consistently.🤖 Prompt for AI Agents
Verify each finding against current code. Fix only still-valid issues, skip the rest with a brief reason, keep changes minimal, and validate. In `@skills/nemo-setup/SKILL.md` at line 188, The dual skill name separator is inconsistent: change the entry "**`nemo-evaluator`** / **`nemo-evaluator-plugin`** — metrics, sync/async evaluations, llm-judge, benchmark jobs. From `plugins/nemo-evaluator`." to use the same separator as the earlier mention (comma), i.e., "**`nemo-evaluator`, **`nemo-evaluator-plugin`** — metrics, sync/async evaluations, llm-judge, benchmark jobs. From `plugins/nemo-evaluator`.", so both occurrences of the pair use the comma separator and maintain identical styling for `nemo-evaluator` and `nemo-evaluator-plugin`.
🤖 Prompt for all review comments with AI agents
Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.
Nitpick comments:
In `@skills/nemo-setup/SKILL.md`:
- Line 188: The dual skill name separator is inconsistent: change the entry
"**`nemo-evaluator`** / **`nemo-evaluator-plugin`** — metrics, sync/async
evaluations, llm-judge, benchmark jobs. From `plugins/nemo-evaluator`." to use
the same separator as the earlier mention (comma), i.e., "**`nemo-evaluator`,
**`nemo-evaluator-plugin`** — metrics, sync/async evaluations, llm-judge,
benchmark jobs. From `plugins/nemo-evaluator`.", so both occurrences of the pair
use the comma separator and maintain identical styling for `nemo-evaluator` and
`nemo-evaluator-plugin`.
ℹ️ Review info
⚙️ Run configuration
Configuration used: Path: .coderabbit.yaml
Review profile: CHILL
Plan: Enterprise
Run ID: abbbe96e-4450-4db5-a836-1f34cc8ebf73
📒 Files selected for processing (5)
skills/nemo-data-designer-plugin/SKILL.mdskills/nemo-data-designer-plugin/evals/evals.jsonskills/nemo-evaluator-plugin/SKILL.mdskills/nemo-evaluator-plugin/evals/evals.jsonskills/nemo-setup/SKILL.md
✅ Files skipped from review due to trivial changes (2)
- skills/nemo-evaluator-plugin/evals/evals.json
- skills/nemo-data-designer-plugin/SKILL.md
🚧 Files skipped from review as they are similar to previous changes (1)
- skills/nemo-data-designer-plugin/evals/evals.json
There was a problem hiding this comment.
Actionable comments posted: 1
🤖 Prompt for all review comments with AI agents
Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.
Inline comments:
In `@skills/nemo-data-designer-plugin/SKILL.md`:
- Line 42: The troubleshooting bullets have mismatched markdown delimiters
around the key tokens (e.g., the line referencing `SamplerColumnConfig`),
causing rendering errors; update both bullets (the one mentioning
SamplerColumnConfig and the other at the same issue location) to use consistent
backticks and bold markers so they read like: bold label then inline code for
names — e.g., reference the symbol SamplerColumnConfig and its parameter name
`params` (not `sampler_params`) using matching delimiters to fix the markdown
rendering.
🪄 Autofix (Beta)
Fix all unresolved CodeRabbit comments on this PR:
- Push a commit to this branch (recommended)
- Create a new PR with the fixes
ℹ️ Review info
⚙️ Run configuration
Configuration used: Path: .coderabbit.yaml
Review profile: CHILL
Plan: Enterprise
Run ID: 939ee745-8031-4bf4-89cb-13fbe8eda371
📒 Files selected for processing (8)
skills/nemo-data-designer-plugin/SKILL.mdskills/nemo-data-designer-plugin/evals/evals.jsonskills/nemo-evaluator-plugin/SKILL.mdskills/nemo-evaluator-plugin/references/api-auth.mdskills/nemo-evaluator-plugin/references/llm-judge.mdskills/nemo-evaluator-plugin/references/troubleshooting.mdskills/nemo-setup/SKILL.mdskills/nemo-setup/evals/evals.json
✅ Files skipped from review due to trivial changes (1)
- skills/nemo-setup/SKILL.md
🚧 Files skipped from review as they are similar to previous changes (1)
- skills/nemo-data-designer-plugin/evals/evals.json
Signed-off-by: Nick Goncharenko <ngoncharenko@nvidia.com>
|
/nvskills-ci |
b4c360d to
33b3c5f
Compare
Signed-off-by: Nick Goncharenko <ngoncharenko@nvidia.com>
|
/nvskills-ci |
Signed-off-by: Nick Goncharenko <ngoncharenko@nvidia.com>
a3c803d to
0e02301
Compare
Signed-off-by: Nick Goncharenko <ngoncharenko@nvidia.com>
|
/nvskills-ci |
Signed-off-by: Nick Goncharenko <ngoncharenko@nvidia.com>
0125065 to
47c4778
Compare
|
/nvskills-ci |
1 similar comment
|
/nvskills-ci |
Summary
Validation
jq empty skills/nemo-data-designer-plugin/evals/evals.jsonjq empty skills/nemo-evaluator-plugin/evals/evals.jsongit diff --cached --checkbefore commitSummary by CodeRabbit