[OPEN] Nexus stability debugging session
Session ID: nexus-stability Date: 2026-06-10 Project Root: /Users/stephenwang/Documents/trae_projects/DevOps
Symptom:
- Full chain build reaches GitLab and SonarQube successfully.
- Nexus upload succeeds in isolation.
- After GitLab and SonarQube are both running, Nexus on port 8082 becomes unreachable during the formal Jenkins build.
Expected:
- Nexus stays reachable long enough for
arithmetic-demo-buildto upload the artifact.
Hypotheses:
- The temporary Nexus container is OOM-killed after GitLab and SonarQube reclaim memory.
- Nexus remains running, but host port 8082 is no longer reachable because of container/network state.
- Service startup order and overlap cause Nexus to fail before it reaches a usable state.
- Jenkins probing is too strict or checks the wrong readiness signal for Nexus availability.
Evidence Plan:
- Inspect Docker container state, restart counts, and OOM flags for Nexus, GitLab, SonarQube, and Jenkins.
- Check live reachability from host and, if possible, from the Jenkins context.
- Review recent Nexus logs around the latest failure window.
- Reproduce one more failure with minimal overlap changes only after baseline evidence is collected.
Status:
- Session opened.
- No business logic changed yet.
Evidence Collected:
- devops-nexus-fresh-test previously exited with exitCode=137 and oomKilled=true while GitLab + SonarQube were running.
- After tuning GitLab/SonarQube and adjusting readiness checks, Nexus stayed reachable long enough for a full build to complete.
Current Verification:
- Jenkins job arithmetic-demo-build build #5: SUCCESS
- Nexus artifact download (anonymous): arithmetic-demo-image-5.tar.gz is reachable from http://127.0.0.1:8082/repository/demo-images/arithmetic-demo/