SA-752 SOC prompt engineering: Top candidate by ivyONS · Pull Request #34 · ONSdigital/soc-classification-utils

ivyONS · 2026-06-26T10:34:42Z

✨ Summary

Adds a new top-1 SOC selection flow for shortlist-based classification, with a strict prompt, typed response model, robust parsing fallback, and targeted tests. This is better aligned with clerical coding practices on SOC specifically (that is assigning one code always instead of higher groups used for SIC).

📜 Changes Introduced

Added a new method: top_one_soc_code in the LLM classification flow.
Added a new response model: TopOneResponse with fields for selected SOC code, title, likelihood score, and reasoning.
Added a dedicated top-one prompt that enforces:
- exactly one shortlist code selection
- shortlist-only output
- calibrated likelihood scoring using fixed values (0.2, 0.4, 0.6, 0.8, 0.9)
- explicit reasoning and separation from next-best alternatives
Added fallback behaviour in top_one_soc_code when chain/parsing fails: returns first shortlist item with low confidence and error-context reasoning
Updated SOC candidate formatting labels to clearer wording:
- Example job titles
- Description
- Example job tasks
Expanded tests to cover:
- prompt constraints for top-one selection
- typed return behavior of top_one_soc_code
- updated candidate-text expectations
Updated poetry.lock (regenerated with Poetry 2.4.1; transitive dependency refresh).

✅ Checklist

Code is formatted using ruff format
Imports are sorted using ruff
Code passes linting with ruff and mypy
Security checks pass using Bandit
API and Unit tests are written and pass using pytest
Terraform files (if applicable) follow best practices and have been validated (terraform fmt & terraform validate)
DocStrings follow Google-style and are added as per ruff recommendations
Documentation has been updated if needed

🔍 How to Test

Ran unit tests.
Invoke the llm interaction from survey-assist-evel (paired PR).

ivyONS added 6 commits June 19, 2026 12:36

chore: update dependencies

c154fdd

feat: add top_one_soc_code prompt

8fa4283

chore: tweak extra info in candidates

b7f201c

chore: top_one_promt variation

f875ebe

chore: top_one_promt variation - less meta

2a89015

chore: merge main into branch, fix conflicts

484c7ab

ivyONS requested a review from peter-spencer-ons June 26, 2026 10:36

ivyONS added 2 commits June 26, 2026 16:45

Merge branch 'main' into sa-752-top-candidate

1a4aa5c

bump version

ea89a9c

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

SA-752 SOC prompt engineering: Top candidate#34

SA-752 SOC prompt engineering: Top candidate#34
ivyONS wants to merge 8 commits into
mainfrom
sa-752-top-candidate

ivyONS commented Jun 26, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Uh oh!

Conversation

ivyONS commented Jun 26, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

✨ Summary

📜 Changes Introduced

✅ Checklist

🔍 How to Test

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

ivyONS commented Jun 26, 2026 •

edited

Loading