fix: the /check_correctness endpoint accepts arbitra... in... by orbisai0security · Pull Request #6423 · hpcaitech/ColossalAI

orbisai0security · 2026-06-20T18:30:04Z

Summary

Fix critical severity security issue in applications/ColossalChat/start_code_verifier.py.

Vulnerability

Field	Value
ID	V-001
Severity	CRITICAL
Scanner	multi_agent_ai
Rule	`V-001`
File	`applications/ColossalChat/start_code_verifier.py:23`
Assessment	Confirmed exploitable

Description: The /check_correctness endpoint accepts arbitrary code in the 'generation' field and executes it via the check_correctness() function to verify correctness against test cases. The endpoint has no authentication, accepts requests from any network client, and the timeout parameter is user-controllable. There is no evidence of sandboxing at the service layer, meaning submitted code runs with the full privileges of the server process.

Evidence

Scanner confirmation: multi_agent_ai rule V-001 flagged this pattern.

Production code: This file is in the production codebase, not test-only code.

Threat Model Context

This is a Python library - vulnerabilities affect applications that import this code.

Changes

applications/ColossalChat/start_code_verifier.py

Verification

Build passes
Scanner re-scan confirms fix
LLM code review passed

Security Invariant

Property: Protected endpoints reject unauthenticated requests

Regression test

import pytest
import importlib.util
import sys
from pathlib import Path
from fastapi.testclient import TestClient

# Load the actual production app from the file under test
spec = importlib.util.spec_from_file_location(
    "start_code_verifier",
    Path(__file__).parent / "applications/ColossalChat/start_code_verifier.py"
)
module = importlib.util.load_from_spec(spec)
spec.loader.exec_module(module)
app = module.app

client = TestClient(app, raise_server_exceptions=False)

VALID_PAYLOAD = {"generation": "def f(x): return x", "in_outs": "{}", "timeout": 5, "debug": False, "eval_mode": "exec"}

@pytest.mark.parametrize("headers,payload", [
    # Exact exploit: no auth, arbitrary code execution attempt
    ({}, {"generation": "__import__('os').system('id')", "in_outs": "{}", "timeout": 10, "debug": False, "eval_mode": "exec"}),
    # Missing token (boundary: empty Authorization header)
    ({"Authorization": ""}, VALID_PAYLOAD),
    # Malformed token
    ({"Authorization": "Bearer not.a.valid.jwt"}, VALID_PAYLOAD),
    # Expired/fake token
    ({"Authorization": "Bearer eyJhbGciOiJIUzI1NiJ9.eyJleHAiOjF9.invalid"}, VALID_PAYLOAD),
])
def test_check_correctness_requires_authentication(headers, payload):
    """Invariant: /check_correctness must reject unauthenticated or invalidly-authenticated requests with 401 or 403."""
    response = client.post("/check_correctness", json=payload, headers=headers)
    assert response.status_code in (401, 403), (
        f"Expected 401 or 403 for unauthenticated request, got {response.status_code}. "
        "The endpoint must not be publicly accessible without valid credentials."
    )

This test guards against regressions — it's useful independent of the code change above.

Automated security fix by OrbisAI Security

Automated security fix generated by OrbisAI Security

The /check_correctness endpoint accepts arbitrary code in the 'generation' field and executes it via the check_correctness() function to verify correctness against test cases

for more information, see https://pre-commit.ci

orbisai0security added 2 commits June 20, 2026 18:29

fix: V-001 security vulnerability

bd6e9c3

Automated security fix generated by OrbisAI Security

fix: the /check_correctness endpoint accepts arbitra... in start_code...

5b7b0ca

The /check_correctness endpoint accepts arbitrary code in the 'generation' field and executes it via the check_correctness() function to verify correctness against test cases

orbisai0security requested a review from a team as a code owner June 20, 2026 18:30

[pre-commit.ci] auto fixes from pre-commit.com hooks

3339ef5

for more information, see https://pre-commit.ci

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: the /check_correctness endpoint accepts arbitra... in...#6423

fix: the /check_correctness endpoint accepts arbitra... in...#6423
orbisai0security wants to merge 3 commits into
hpcaitech:mainfrom
orbisai0security:fix-v-001-code-verifier-auth

orbisai0security commented Jun 20, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

orbisai0security commented Jun 20, 2026

Summary

Vulnerability

Evidence

Threat Model Context

Changes

Verification

Security Invariant

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant