Open
Conversation
ea00a75 to
234da05
Compare
234da05 to
4b9c055
Compare
…un comparison table
Failed attempt rows (from *_failed_attempt_* directories) are now flagged in _collect_run_metrics and filtered out by default. A toggle appears above the table when failed attempts exist, allowing the user to show them.
… sections Adds an Output Dir column showing the parent directory of each run, and splits the single metrics table into three labelled subtables: Accuracy (EVA-A composites + [Accuracy] metrics), EVA-X (EVA-X composites + [Experience] metrics), and Diagnostic & Other (remaining metrics). Each subtable is skipped if it has no data for the current run set.
gabegma
reviewed
Apr 14, 2026
| id_rename = { | ||
| "system_name": "System", | ||
| "run_timestamp": "Timestamp", | ||
| "run_output_dir": "Output Dir", |
Collaborator
There was a problem hiding this comment.
Could we avoid putting the output dir in the table when we just have one? It find it adds unnecessary clutter. Multiple outputs is probably not the standard flow.
gabegma
reviewed
Apr 14, 2026
|
|
||
| ```bash | ||
| streamlit run apps/analysis.py | ||
| uv run streamlit run apps/analysis.py |
Collaborator
There was a problem hiding this comment.
We didn't put uv run elsewhere in this README for the other commands given it's not needed if you have set your python interpreter. I would remove these changes, or apply them everywhere.
gabegma
approved these changes
Apr 14, 2026
Collaborator
gabegma
left a comment
There was a problem hiding this comment.
Super!! Thank you, love the changes!
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Uh oh!
There was an error while loading. Please reload this page.