Use cmdstan config and decouple stanfit objects from RunSet by amas0 · Pull Request #851 · stan-dev/cmdstanpy

amas0 · 2026-04-12T01:39:18Z

Draft PR for initial progress on config parsing for #785 and implementing ideas discussed in #848.

@WardBrian I'd like you to take a look at this initial draft which implements pydantic-based parsing of the config json files and uses that to refactor CmdStanPathfinder to be built entirely from output files (and decouples it from the runset).

The changes I'd like you to look at are in stanfit/pathfinder.py and stanfit/metadata.py -- the others files are mostly just temporary patches to have the library in a functional state.

The CmdStanPathfinder class has been rewritten as a dataclass that defines a from_files(...) method as a constructor to build it from the output files. I also prefer the clarity that this gives to what attributes the class has.

Some areas where I'd appreciate thoughts:

I don't include any reference to the runset in the object itself. I think there are some arguments for having it as an optional attribute, but I opted for a clean break here.
I dropped from the repr any reference to the method_args, building directly from files made it seem that was no longer desirable.
I include stdout_file as an optional attribute. With no reference to the runset, it still seems like something that might be nice to have? But the stdout info is more like process diagnostic than an essential component of the stanfit?
save_csvfiles probably needs to be called something like save_output_files, but I haven't changed that yet.
In a world where we no longer need to parse any config out of the stan csv, this makes the InferenceMetadata object more or less only scan the header line for stan variables, I think. So it may be worth a think as to what that should look like?

If the shape of looks to be in the right direction, I'll proceed with doing this for the other stanfit objects. Following that, we should be able to really clean up what we have going on in stanfit/__init__.py and should be able to drop all the code that we have for parsing out the headers of the stan csvs.

Bumps [actions/cache](https://github.com/actions/cache) from 4 to 5. - [Release notes](https://github.com/actions/cache/releases) - [Changelog](https://github.com/actions/cache/blob/main/RELEASES.md) - [Commits](actions/cache@v4...v5) --- updated-dependencies: - dependency-name: actions/cache dependency-version: '5' dependency-type: direct:production update-type: version-update:semver-major ... Signed-off-by: dependabot[bot] <support@github.com>

WardBrian

A few comments/questions, but overall I like the look of the code!

Responding to your comments in the PR:

Runset being left out of the result entirely makes sense. We will have to see how this feels when we tackle something with more moving parts like the MCMC class, but for now I agree.
Repr changes make sense.
Stdout file is good. We may want to also do the same thing with diagnostic files and profiling files, which gets us closer to just including the whole Runset again... maybe we have a small dataclass that is like a cut-down runset and stores the relevant files, can be re-used across all of them?
I agree, it makes sense to replace csv with output across the board, both in the save functions and the from_csv function. This would make sense going forward anyway, c.f. stan-dev/design-docs#58
Yep, totally agree that InferenceMetadata might want to take a hike when all is said and done. The header could be stored in each class, or in the cut-down-Runset idea ("FileInfo"?)

WardBrian · 2026-04-13T15:51:42Z

+    model_name: str
+    csv_file: str
+    config: PathfinderConfig
+    config_file: str | None = None


Is this really optional? Doesn't look like it in from_files

It's optional in the sense that if you pass a config in directly, you don't need to reference the file itself. Outside of simplifying some testing, I'm not sure if it will come up much? The intended entrypoint in practice is always going to be the from_files.

WardBrian · 2026-04-13T15:55:08Z

+            metadata=metadata,
+            csv_file=str(csv_file),
+            model_name=stan_config.model_name,
+            config=stan_config.method_config,


It's obviously nice that we don't have any downstream isinstance checks, but it is a bit of a bummer that we end up throwing away the extra information that is present in the StanConfig (e.g. model name, etc). Is there some trick to keep it all around, while maintaining that this is a Pathfinder-specific config?

Good point. We could just store references to both stan_config and config? The top level StanConfig would still contain the inference config within it, but it would just be the same object. Storing both would keep the top level info and still let us have config be validated that the method is correct. Keeps the type checker happy and doesn't drop any info. Thoughts?

That seems fine. I was wondering if pydantic had a more elegant way, but it's not like doing that actually costs any extra memory or anything

We could do something fancy like make StanConfig a generic, which pydantic has good support for:

from typing import Generic, TypeVar AnyMethodConfig = Annotated[ SampleConfig | OptimizeConfig | PathfinderConfig | LaplaceConfig | VariationalConfig | GeneratedQuantitiesConfig, Discriminator("method"), ] MethodT = TypeVar("MethodT", bound=BaseModel, default=AnyMethodConfig) class StanConfig(BaseModel, Generic[MethodT]): model_config = ConfigDict(extra="allow") model_name: str stan_major_version: str stan_minor_version: str stan_patch_version: str method_config: MethodT

Then for example we could have:

@dataclass class CmdStanPathfinder: # other fields ... config: StanConfig[PathfinderConfig]

And parsing like StanConfig[PathfinderConfig].model_validate(jsons) would validate it against the Pathfinder instance of the config. This would result in only one reference to the config but the type checker would know that it has the pathfinder configuration fields within method_config.

That seems like a good representation of what's actually happening under the hood

WardBrian · 2026-04-13T16:01:56Z

+                    f'found {len(csvfiles)}'
+                )
+            csv_file = csvfiles[0]
+            config_file = os.path.splitext(csv_file)[0] + '_config.json'


I assume this is temporary but it's worth putting a comment saying so, since this is kind of a nasty assumption

Yeah definitely temporary. I was planning on doing away with this function entirely once the full changes are in.

amas0 · 2026-04-16T02:22:26Z

@WardBrian Working through some of the other methods.

For CmdStanMLE, it looks like our typical approach to checking for convergence is looking at the return codes via the runset. Is there a more natural way to do this for reading from the output files directly?

I see we could maybe parse that info out of the stdout file, but I don't love that.

WardBrian · 2026-04-16T15:05:53Z

Hm, that is indeed tricky. I think we should add the piece of information that currently ends up in stdout (Optimization terminated normally:/with error:) to the output csv, but that won't help with existing, released versions.

Looking at the code, it looks like the only place we actually check the return codes is in model.py, before we call the CmdStanMLE constructor, so maybe it is fine?

amas0 · 2026-04-16T15:20:30Z

Yeah. In my local implementation, I take is_converged as an argument to the constructor. We have it in model.py so it should be fine in practice, but it means we can't get full reconstitution from just output files for MLE (unless we parse stdout).

It does seem like the kind of thing that should be available in the CSV? Like a converged__ column or something? Not a major issue for now, just doesn't make things as clean as I'd like.

WardBrian · 2026-04-16T15:24:32Z

but it means we can't get full reconstitution from just output files for MLE (unless we parse stdout).

This is true for the existing from_csv function, isn't it?

amas0 · 2026-04-16T15:29:24Z

Yeah good point, this is the snippet from from_csv:

            for i in range(len(runset._retcodes)):
                runset._set_retcode(i, 0)
            return CmdStanMLE(runset)

Looks like in recreating the runset, we just manually set all return codes to 0, which is just assuming convergence I believe.

WardBrian · 2026-04-16T16:14:47Z

Yeah, so we won't be doing any worse, at least!

amas0 · 2026-04-29T01:29:43Z

@WardBrian Is there an equivalent to saving the metric json for the variational method? Looking at CmdStanVB now and we parse the variational eta value from the stancsv comments, but afaik we don't have an alternative way to source it for now.

bob-carpenter · 2026-04-30T14:46:02Z

There's no preconditioning in our two variational inference implementations, ADVI and Pathfinder, so there's no equivalent of a metric/mass matrix/preconditioner.

WardBrian · 2026-04-30T14:52:25Z

@bob-carpenter I believe @amas0 was asking just if we wrote the metadata from the algorithm as a json, similar to the metric in sampling, not if there was actually a preconditioner

Unfortunately to my knowledge the answer to this is no. It's a shame to have to keep all the comment-reading machinery around for something as rarely-needed as the final eta value from ADVI...

bob-carpenter · 2026-04-30T14:57:09Z

Sorry for the confusion.

Keeping things around for eta is also silly in that estimating eta is one of the major weaknesses of our ADVI implementation (another being restriction to diagonal or dense approximations, a third being the high variance of low-sample ELBO evaluations).

A better approach for eta might be to treat it as another column in the tabular output like step size. Like step size, it will change as the algorithm progresses then hopefully stabilize.

amas0 · 2026-04-30T15:08:58Z

Yeah, in particular, I was referring to this section that appears in the output CSV if you run the variational method:

lp__,log_p__,log_g__,theta
# Stepsize adaptation complete.
# eta = 0.1
0,0,0,0.5838444

I just meant that we output the algorithm stepsize info to the CSV in the same way that we write the sampler adaptation output (as comment lines after the column headers). But in the sample case, we can output equivalent info to a metric JSON file, but that doesn't seem to be the case for the variational method here.

So, I guess the question is what do we want to do with this in the near term? I think it makes sense to have it be an additional column as Bob suggested? I suppose the question is whether it is important enough to maintain the comment parsing in the near term or should we drop keeping track of it for now until an alternative source is available?

WardBrian · 2026-05-01T16:18:47Z

I'm not sure it makes sense as a column, since the advi output is only the one row of the varational parameters, in which eta has already been adapted, and then forward sampling iterations

Github code search doesn't seem to find any public code which actually calls .eta, but I'm not sure this is really a good measure.

Maybe we preserve a dumber version of comment parsing just for that, i.e. a function that just does

for line in file:
  if line.startswith("# eta ="):
    return float(line.split('=')[1])

Rather than all the present machinery? We can also try to add a structured/json output like for the metric, but that would take longer to thread through the cmdstan code and we'd still need something in the meantime

amas0 · 2026-05-01T16:33:45Z

Honestly, the code surface for just being able to parse eta out is relatively small under the current structure. It's just one call that partitions the CSV into comment lines and draw lines and then a second call to pull out the eta value from the comment lines. Even keeping those, we'll be able to drop a good chunk Stan CSV parsing logic that goes through the comments.

amas0 · 2026-05-04T22:01:37Z

As I continue to chip away at this, I'd like to get some thoughts on how we should manage the way we current do testing in light of these changes.

We have a large test directory that contains a number of pre-generated cmdstan output files. These are used in fixtures throughout testing to reconstruct stanfit objects. With this new approach of not parsing everything out of the CSVs, we'll need to have config files in some way.

As part of this, should we got through all of those instances and regenerate the output files to make it so our testing fixtures correspond properly to the new structure? That would probably be the minimal change for testing purposes. Alternatively, are there other approaches we may want to consider for testing that wouldn't require this? (Like mocking file outputs or generating output files on the fly as part of testing)

WardBrian · 2026-05-05T13:45:35Z

I think re-generating them is reasonable. My opinion is that storing the files is basically the same as a really-good mock, since it actually interfaces with real outputs. We like having a few end-to-end tests that actually run cmdstan (especially to spot changes in the output formats over time), but it's too slow to do that for every test IMO

amas0 and others added 9 commits January 6, 2026 13:28

Add initial basic ConfigInfo model

d3fcc52

Add flattening and parsing logic for config JSON

81d08ab

Update flatten_value_dict to clarify variable naming

b3b20eb

Add PathfinderConfig

a5f8094

Add LaplaceConfig

d49c62c

Add VariationalConfig

503a970

Add GeneratedQuantitiesConfig

08c6d0e

Add first-pass CmdStanPathfinder rewrite

8f62f2e

WardBrian reviewed Apr 13, 2026

View reviewed changes

amas0 added 4 commits April 15, 2026 20:50

Fixup Path conversion to string

ca13a2e

Convert StanConfig to Generic for richer typing

c5a42db

Minor comment on config_file optionality

610ad2f

Refactor CmdStanLaplace

1e2f272

Add default method_type for parse_config

da04db4

WardBrian mentioned this pull request Apr 16, 2026

Add convergence information to optimization outputs stan-dev/stan#3379

Merged

3 tasks

amas0 added 2 commits April 18, 2026 13:47

Refactor CmdStanMLE

8b1a39b

Allow passing 'mode' directly to CmdStanLaplace.from_files

3466024

Refactor CmdStanVB to build from files

16eff3d

Uh oh!

Conversation

amas0 commented Apr 12, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

WardBrian left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

amas0 commented Apr 16, 2026

Uh oh!

WardBrian commented Apr 16, 2026

Uh oh!

amas0 commented Apr 16, 2026

Uh oh!

WardBrian commented Apr 16, 2026

Uh oh!

amas0 commented Apr 16, 2026

Uh oh!

WardBrian commented Apr 16, 2026

Uh oh!

amas0 commented Apr 29, 2026

Uh oh!

bob-carpenter commented Apr 30, 2026

Uh oh!

WardBrian commented Apr 30, 2026

Uh oh!

bob-carpenter commented Apr 30, 2026

Uh oh!

amas0 commented Apr 30, 2026

Uh oh!

WardBrian commented May 1, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

amas0 commented May 1, 2026

Uh oh!

amas0 commented May 4, 2026

Uh oh!

WardBrian commented May 5, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

amas0 commented Apr 12, 2026 •

edited

Loading

WardBrian commented May 1, 2026 •

edited

Loading