Skip to content

Inquiry Regarding Pairwise Model Comparison in Multi-Modality Arena #25

@zhimin-z

Description

@zhimin-z

Thank you for your remarkable contributions!

I've explored the multi-modality arena and noticed that it actually differs from the Chatbot Arena, where two anonymous models are compared side-by-side.
image
After playing with the demo in the README (as shown above), I observed that only one model is provided for evaluation by the third-party crowd:
image
I cannot find any arena-related keywords in the demo as well:
image

This leads me to inquire: where can we find the second model for conducting a pairwise comparison?
@shepnerd @wqshao126 @zzyfd @orashi

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions