fix: reject Java optimizations with unused additions and unchanged target method#1947
fix: reject Java optimizations with unused additions and unchanged target method#1947mashraf-222 wants to merge 6 commits intomainfrom
Conversation
…rget method Adds a wiring check in replace_function() that detects when the AI generates "optimizations" adding fields/helpers that the target method never references. Previously these passed through because benchmark noise produced fake speedups. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
|
Claude encountered an error —— View job I'll analyze this and get back to you. |
- Remove 8 qualified_name="..." kwargs passed to FunctionToOptimize: qualified_name is a @Property, not a constructor field. Pydantic silently accepts it at runtime but mypy strict mode (prek hook) rejects it. - Add -> None return annotations and missing Path / JavaSupport parameter annotations to every test method + fixture in test_replacement.py so the prek mypy hook passes when the file is in the CI diff.
Rich renders the banner panel with box-drawing characters (╭, ╮, │, etc.) that cp1252 cannot decode. On Windows, subprocess.run(..., text=True) uses cp1252 by default, so decoding the child stdout raises UnicodeDecodeError and subprocess sets result.stdout to None — breaking the assertion with a misleading "argument of type 'NoneType' is not iterable". Pass encoding="utf-8" explicitly so the test passes on every platform.
ReviewBug premise verified — real. On current Fix is architecturally correct. CI blockers addressed in the last two commits:
All other failing checks (e2e-java/python 500/504, snyk quota) are infra flakes unrelated to this PR. Relationship to PR #1950 (CF-1084): both close the same class of bug (AI-generated candidate leaves the target untouched). #1947 catches the "additions-with-no-body-reference" case; #1950 catches the "class member mod with untouched target" case — including constructors, which this PR doesn't. They are complementary, not duplicates; recommend merging both. Ready for re-review. |
Problem
The AI optimizer sometimes generates "optimizations" that add new fields or helper methods to a Java class without changing the target method at all. Because benchmark noise produces small timing variations, these fake optimizations pass the speedup critic and create PRs with no real improvement.
Example: 4 commons-lang PRs each added
private static final Supplier<String> NULL_SUPPLIER = Suppliers.nul();but the target methods (getJavaAwtHeadless,getJavaIoTmpdir, etc.) were never modified to use it — yet reported 7-151% speedups.Root Cause
replace_function()inreplacement.pyaccepts any optimization that changes the file, even if the target method body is identical to the original. The dedup check compares the entire candidate (function + helpers/fields), so adding a new field makes it "different" from the original, bypassing the identity check.Fix
Added
_has_unused_additions()inreplacement.pythat:This causes
replace_function_definitions_for_language()to returnFalse(no update), which skips the candidate.Validation
/home/ubuntu/e2e-sessions/2026-04-01_15-45_cf1081-unused-additions/Test Coverage
New
TestUnusedAdditionsRejectionclass with 5 tests:test_unchanged_method_with_unused_field_rejected— unchanged method + unused field → rejectedtest_unchanged_method_with_unused_helper_rejected— unchanged method + unused helper → rejectedtest_changed_method_with_used_field_accepted— changed method + used field → acceptedtest_changed_method_without_additions_accepted— normal optimization → acceptedtest_unchanged_method_with_used_helper_accepted— method uses new helper → acceptedCloses CF-1081