-
Notifications
You must be signed in to change notification settings - Fork 3.2k
Eval fails for non-English languages #3111
Copy link
Copy link
Labels
eval[Component] This issue is related to evaluation[Component] This issue is related to evaluationgood first issue[Community] This issue is good for newcomers to participate[Community] This issue is good for newcomers to participatehelp wanted[Community] Extra attention is needed[Community] Extra attention is neededneeds review[Status] The PR/issue is awaiting review from the maintainer[Status] The PR/issue is awaiting review from the maintainer
Metadata
Metadata
Assignees
Labels
eval[Component] This issue is related to evaluation[Component] This issue is related to evaluationgood first issue[Community] This issue is good for newcomers to participate[Community] This issue is good for newcomers to participatehelp wanted[Community] Extra attention is needed[Community] Extra attention is neededneeds review[Status] The PR/issue is awaiting review from the maintainer[Status] The PR/issue is awaiting review from the maintainer
Describe the bug
When evaluating text in other languages (e.g., Thai, etc.), the eval logic incorrectly returns mismatches (Match score: 0)— even when the evaluated expression should be valid or identical in meaning.
To Reproduce
Expected behavior
Eval should correctly handle input in other languages.
Screenshots
Desktop (please complete the following information):
Model Information:
Additional context
Add any other context about the problem here.