Skip to content

Latest commit

 

History

History
20 lines (13 loc) · 1008 Bytes

File metadata and controls

20 lines (13 loc) · 1008 Bytes

Demo Script

Use this when showing the project to a recruiter, project lead, professor, or Handshake AI reviewer.

30-second explanation

This is EdgeCaseForge AI. It helps design coding benchmark tasks for evaluating AI coding models. Instead of only writing solutions, the app focuses on the harder part: creating problems, hidden tests, edge cases, and model-failure analysis.

What to show first

  1. Open the Streamlit app.
  2. Pick "Room Collision Validator" because it connects to game development.
  3. Show the "Why this can make AI models fail" tab.
  4. Explain that many models check only whether the player center is inside a wall, but the correct solution needs circle-rectangle overlap.
  5. Open the "Test Runner" tab.
  6. Paste a solution and run sample tests.

What to say

I built this project to show that I can think like an AI evaluator, not just a programmer. The app designs coding problems with hidden edge cases, explains where AI models may fail, and validates sample solutions.