codeflash-omni-java #1199

misrasaurabh1 · 2026-01-30T08:38:44Z

No description provided.

- Add JaCoCo Maven plugin management to build_tools.py: - is_jacoco_configured() to check if plugin exists - add_jacoco_plugin_to_pom() to inject plugin configuration - get_jacoco_xml_path() for coverage report location - Add JacocoCoverageUtils class to coverage_utils.py: - Parses JaCoCo XML reports into CoverageData objects - Handles method boundary detection and line/branch coverage - Update test_runner.py to support coverage collection: - run_behavioral_tests() now handles enable_coverage=True - Automatically adds JaCoCo plugin and runs jacoco:report goal - Update critic.py to enforce 60% coverage threshold for Java (previously Java was bypassed) - Add comprehensive test suite with 19 tests for coverage functionality Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

- Fix config parser to find codeflash.toml for Java projects (was only looking for pyproject.toml) - Fix JaCoCo plugin addition to pom.xml: - Use string manipulation instead of ElementTree to avoid namespace prefix corruption (ns0:project issue) - ElementTree was changing <project> to <ns0:project> which broke Maven - Add Java coverage parsing in parse_test_output.py: - Route Java coverage to JacocoCoverageUtils instead of Python's CoverageUtils Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

- Fix is_jacoco_configured() to search all build/plugins sections recursively, including those in profiles - Fix add_jacoco_plugin_to_pom() to correctly find the main build section when profiles exist (not insert into profile builds) - Add _find_closing_tag() helper to handle nested XML tags - Remove explicit jacoco:report goal from Maven command since the plugin execution binds report to test phase automatically Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

- Add _find_multi_module_root() to detect when tests are in a separate module - Add _get_test_module_target_dir() to find the correct surefire reports dir - Update run_behavioral_tests() and run_benchmarking_tests() to: - Run Maven from the parent project root for multi-module projects - Use -pl <module> -am to build only the test module and dependencies - Use -DfailIfNoTests=false to allow modules without tests to pass - Use -DskipTests=false to override pom.xml skipTests settings - Look for surefire reports in the test module's target directory Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

- Update TestConfig._detect_java_test_framework() to check parent pom.xml for multi-module projects where test deps are in a different module - Add framework aliases in registry to map junit4/testng to Java support - Correctly detect JUnit 4 projects and send correct framework to AI service Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

- Use ^(?:public\s+)?class pattern to match class declaration at start of line - Prevents matching words like "command" or text in comments that contain "class" - Fixes issue where test files were named incorrectly (e.g., "and__perfinstrumented.java") Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

…dule projects - Fix duplicate test file issue: when multiple tests have the same class name, append unique index suffix (e.g., CryptoTest_2) to avoid file overwrites - Fix multi-module JaCoCo support: add JaCoCo plugin to test module's pom.xml instead of source module, ensuring coverage data is collected where tests run - Fix timeout: use minimum 60s (120s with coverage) for Java builds since Maven takes longer than the default 15s INDIVIDUAL_TESTCASE_TIMEOUT - Fix Maven phase: use 'verify' instead of 'test' when coverage is enabled, with maven.test.failure.ignore=true to generate report even if tests fail - Update JaCoCo report phase from 'test' to 'verify' to run after tests complete Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

feat: add JaCoCo test coverage support for Java optimization

- Update coverage_critic to skip coverage check when CoverageStatus.NOT_FOUND is returned (e.g., when JaCoCo report doesn't exist in multi-module projects where the test module has no source classes) - Add JaCoCo configuration to include all class files for multi-module support This fixes "threshold for test confidence was not met" errors that occurred even when all tests passed, because JaCoCo couldn't generate coverage reports for test modules without source classes. Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

fix: handle NOT_FOUND coverage status in Java multi-module projects

codeflash-ai · 2026-02-01T21:20:00Z

codeflash/cli_cmds/init_java.py

+    project_root = Path.cwd()
+
+    # Check for existing codeflash config in pom.xml or a separate config file
+    codeflash_config_path = project_root / "codeflash.toml"
+    if codeflash_config_path.exists():


⚡️Codeflash found 70% (0.70x) speedup for should_modify_java_config in codeflash/cli_cmds/init_java.py

⏱️ Runtime : 714 microseconds → 421 microseconds (best of 60 runs)

📝 Explanation and details

The optimized code achieves a 69% speedup (714μs → 421μs) by replacing pathlib.Path operations with equivalent os module functions, which have significantly lower overhead.

Key optimizations:

os.getcwd() instead of Path.cwd(): The line profiler shows Path.cwd() took 689,637ns (34.1% of total time) vs os.getcwd() taking only 68,036ns (7.4%). This is a ~10x improvement because Path.cwd() instantiates a Path object and performs additional normalization, while os.getcwd() returns a raw string from a system call.

os.path.join() instead of Path division operator: Constructing the config path via project_root / "codeflash.toml" took 386,582ns (19.1%) vs os.path.join() taking 190,345ns (20.6%). Though the percentage appears similar, the absolute time is ~50% faster because the / operator creates a new Path object with its associated overhead.

os.path.exists() instead of Path.exists(): The existence check dropped from 476,490ns (23.6%) to 223,477ns (24.2%) - roughly 2x faster. The os.path.exists() function directly calls the stat syscall, while Path.exists() goes through Path's object model.

Why this works:
Path objects provide a cleaner API but add object instantiation, method dispatch, and normalization overhead. For simple filesystem checks in initialization code that runs frequently, using lower-level os functions eliminates this overhead while maintaining identical functionality.

Test results:
All test cases show 68-111% speedup across scenarios including:

Empty directories (fastest: 82-87% improvement)

Large directories with 500 files (68-111% improvement)

Edge cases like symlinks and directory-as-file (75-82% improvement)

The optimization is particularly beneficial for CLI initialization code that may run on every command invocation, where sub-millisecond improvements in frequently-called functions compound into noticeable user experience gains.

✅ Correctness verification report:

Test Status

⚙️ Existing Unit Tests 🔘 None Found

🌀 Generated Regression Tests ✅ 23 Passed

⏪ Replay Tests 🔘 None Found

🔎 Concolic Coverage Tests 🔘 None Found

📊 Tests Coverage 100.0%

🌀 Click to see Generated Regression Tests

from __future__ import annotations # imports import os from pathlib import Path from typing import Any import pytest # used for our unit tests from codeflash.cli_cmds.init_java import should_modify_java_config def test_no_config_file_does_not_prompt_and_returns_true(monkeypatch, tmp_path): # Arrange: ensure working directory has no codeflash.toml monkeypatch.chdir(tmp_path) # set cwd to a clean temporary directory # Replace Confirm.ask with a function that fails the test if called. def fail_if_called(*args, **kwargs): raise AssertionError("Confirm.ask should not be called when no config file exists") # Patch the exact attribute that the function imports at runtime. monkeypatch.setattr("rich.prompt.Confirm.ask", fail_if_called, raising=True) # Act: call function under test codeflash_output = should_modify_java_config(); result = codeflash_output # 28.9μs -> 15.9μs (82.0% faster) def test_config_file_exists_prompts_and_respects_true_choice(monkeypatch, tmp_path): # Arrange: create a codeflash.toml file so the function will detect it monkeypatch.chdir(tmp_path) config_file = tmp_path / "codeflash.toml" config_file.write_text("existing = true") # create the file # Capture the arguments passed to Confirm.ask and return True to simulate user acceptance called = {} def fake_ask(prompt, default, show_default): # Record inputs for later assertions called["prompt"] = prompt called["default"] = default called["show_default"] = show_default return True # Patch Confirm.ask used inside the function monkeypatch.setattr("rich.prompt.Confirm.ask", fake_ask, raising=True) # Act codeflash_output = should_modify_java_config(); result = codeflash_output # 25.6μs -> 13.7μs (86.9% faster) def test_config_file_exists_prompts_and_respects_false_choice(monkeypatch, tmp_path): # Arrange: create the config file monkeypatch.chdir(tmp_path) (tmp_path / "codeflash.toml").write_text("existing = true") # Simulate user declining re-configuration def fake_ask_decline(prompt, default, show_default): return False monkeypatch.setattr("rich.prompt.Confirm.ask", fake_ask_decline, raising=True) # Act codeflash_output = should_modify_java_config(); result = codeflash_output # 24.7μs -> 13.3μs (86.3% faster) def test_presence_of_pom_xml_does_not_trigger_prompt(monkeypatch, tmp_path): # Arrange: create a pom.xml but NOT codeflash.toml monkeypatch.chdir(tmp_path) (tmp_path / "pom.xml").write_text("<project></project>") # If Confirm.ask is called, fail the test because only codeflash.toml should trigger it in current implementation def fail_if_called(*args, **kwargs): raise AssertionError("Confirm.ask should not be called when only pom.xml exists (implementation checks codeflash.toml)") monkeypatch.setattr("rich.prompt.Confirm.ask", fail_if_called, raising=True) # Act codeflash_output = should_modify_java_config(); result = codeflash_output # 28.3μs -> 16.6μs (69.9% faster) def test_codeflash_config_is_directory_triggers_prompt(monkeypatch, tmp_path): # Arrange: create a directory named codeflash.toml (Path.exists will be True) monkeypatch.chdir(tmp_path) (tmp_path / "codeflash.toml").mkdir() # Simulate user selecting True monkeypatch.setattr("rich.prompt.Confirm.ask", lambda *a, **k: True, raising=True) # Act codeflash_output = should_modify_java_config(); result = codeflash_output # 23.6μs -> 12.9μs (82.2% faster) def test_codeflash_config_symlink_triggers_prompt_if_supported(monkeypatch, tmp_path): # Arrange: attempt to create a symlink to a real file; skip if symlink not supported if not hasattr(os, "symlink"): pytest.skip("Platform does not support os.symlink; skipping symlink test") real = tmp_path / "real_config" real.write_text("x = 1") link = tmp_path / "codeflash.toml" try: os.symlink(real, link) # may fail on Windows without privileges except (OSError, NotImplementedError) as e: pytest.skip(f"Could not create symlink on this platform/environment: {e}") monkeypatch.chdir(tmp_path) # Simulate user declining re-configuration monkeypatch.setattr("rich.prompt.Confirm.ask", lambda *a, **k: False, raising=True) # Act codeflash_output = should_modify_java_config(); result = codeflash_output # 24.9μs -> 14.2μs (75.7% faster) def test_large_directory_without_config_is_fast_and_does_not_prompt(monkeypatch, tmp_path): # Large scale scenario: create many files (but under 1000) to simulate busy project directory. monkeypatch.chdir(tmp_path) num_files = 500 # under the 1000 element guideline for i in range(num_files): # Create many innocuous files; should not affect the function's behavior (tmp_path / f"file_{i}.txt").write_text(str(i)) # Ensure Confirm.ask is not called def fail_if_called(*args, **kwargs): raise AssertionError("Confirm.ask should not be called when codeflash.toml is absent even in large directories") monkeypatch.setattr("rich.prompt.Confirm.ask", fail_if_called, raising=True) # Act codeflash_output = should_modify_java_config(); result = codeflash_output # 36.3μs -> 21.6μs (68.0% faster) def test_large_directory_with_config_prompts_once(monkeypatch, tmp_path): # Large scale scenario with config present: many files plus codeflash.toml monkeypatch.chdir(tmp_path) num_files = 500 for i in range(num_files): (tmp_path / f"file_{i}.txt").write_text(str(i)) # Create the config file that should trigger prompting (tmp_path / "codeflash.toml").write_text("reconfigure = maybe") # Track how many times Confirm.ask is invoked to ensure single prompt counter = {"calls": 0} def fake_ask(prompt, default, show_default): counter["calls"] += 1 return True monkeypatch.setattr("rich.prompt.Confirm.ask", fake_ask, raising=True) # Act codeflash_output = should_modify_java_config(); result = codeflash_output # 30.8μs -> 14.6μs (111% faster) # codeflash_output is used to check that the output of the original code is the same as that of the optimized code.

import os import tempfile from pathlib import Path from unittest.mock import MagicMock, patch # imports import pytest from codeflash.cli_cmds.init_java import should_modify_java_config class TestShouldModifyJavaConfigBasic: """Basic test cases for should_modify_java_config function.""" def test_no_config_file_exists_returns_true(self): """ Scenario: Project has no existing codeflash.toml file Expected: Function returns (True, None) without prompting user """ # Create a temporary directory without codeflash.toml with tempfile.TemporaryDirectory() as tmpdir: original_cwd = os.getcwd() try: os.chdir(tmpdir) codeflash_output = should_modify_java_config(); result = codeflash_output finally: os.chdir(original_cwd) def test_config_file_exists_user_confirms(self): """ Scenario: Project has existing codeflash.toml and user confirms re-configuration Expected: Function prompts user and returns (True, None) if user confirms """ with tempfile.TemporaryDirectory() as tmpdir: original_cwd = os.getcwd() try: os.chdir(tmpdir) # Create a codeflash.toml file config_file = Path(tmpdir) / "codeflash.toml" config_file.touch() # Mock the Confirm.ask to return True (user confirms) with patch('rich.prompt.Confirm.ask', return_value=True): codeflash_output = should_modify_java_config(); result = codeflash_output finally: os.chdir(original_cwd) def test_config_file_exists_user_declines(self): """ Scenario: Project has existing codeflash.toml and user declines re-configuration Expected: Function prompts user and returns (False, None) if user declines """ with tempfile.TemporaryDirectory() as tmpdir: original_cwd = os.getcwd() try: os.chdir(tmpdir) # Create a codeflash.toml file config_file = Path(tmpdir) / "codeflash.toml" config_file.touch() # Mock the Confirm.ask to return False (user declines) with patch('rich.prompt.Confirm.ask', return_value=False): codeflash_output = should_modify_java_config(); result = codeflash_output finally: os.chdir(original_cwd) def test_return_tuple_structure(self): """ Scenario: Verify the function always returns a tuple with specific structure Expected: Return value is a tuple of (bool, None) """ with tempfile.TemporaryDirectory() as tmpdir: original_cwd = os.getcwd() try: os.chdir(tmpdir) codeflash_output = should_modify_java_config(); result = codeflash_output finally: os.chdir(original_cwd) class TestShouldModifyJavaConfigEdgeCases: """Edge case test cases for should_modify_java_config function.""" def test_config_file_exists_but_empty(self): """ Scenario: codeflash.toml file exists but is empty Expected: File is still considered as existing, prompts user """ with tempfile.TemporaryDirectory() as tmpdir: original_cwd = os.getcwd() try: os.chdir(tmpdir) # Create an empty codeflash.toml file config_file = Path(tmpdir) / "codeflash.toml" config_file.write_text("") with patch('rich.prompt.Confirm.ask', return_value=True): codeflash_output = should_modify_java_config(); result = codeflash_output finally: os.chdir(original_cwd) def test_config_file_with_content(self): """ Scenario: codeflash.toml file exists with actual TOML content Expected: Prompts user regardless of file content """ with tempfile.TemporaryDirectory() as tmpdir: original_cwd = os.getcwd() try: os.chdir(tmpdir) # Create a codeflash.toml file with content config_file = Path(tmpdir) / "codeflash.toml" config_file.write_text("[codeflash]\nversion = 1\n") with patch('rich.prompt.Confirm.ask', return_value=False): codeflash_output = should_modify_java_config(); result = codeflash_output finally: os.chdir(original_cwd) def test_config_file_case_sensitive(self): """ Scenario: Directory has 'Codeflash.toml' or 'CODEFLASH.TOML' instead of lowercase Expected: Function only recognizes 'codeflash.toml' (case-sensitive on Unix) """ with tempfile.TemporaryDirectory() as tmpdir: original_cwd = os.getcwd() try: os.chdir(tmpdir) # Create a file with different casing config_file = Path(tmpdir) / "Codeflash.toml" config_file.touch() codeflash_output = should_modify_java_config(); result = codeflash_output finally: os.chdir(original_cwd) def test_config_file_is_directory_not_file(self): """ Scenario: codeflash.toml exists as a directory instead of a file Expected: Path.exists() still returns True, prompts user """ with tempfile.TemporaryDirectory() as tmpdir: original_cwd = os.getcwd() try: os.chdir(tmpdir) # Create codeflash.toml as a directory config_dir = Path(tmpdir) / "codeflash.toml" config_dir.mkdir() with patch('rich.prompt.Confirm.ask', return_value=True): codeflash_output = should_modify_java_config(); result = codeflash_output finally: os.chdir(original_cwd)

To test or edit this optimization locally git merge codeflash/optimize-pr1199-2026-02-01T21.20.00

Suggested change

project_root = Path.cwd()

# Check for existing codeflash config in pom.xml or a separate config file

codeflash_config_path = project_root / "codeflash.toml"

if codeflash_config_path.exists():

project_root = os.getcwd()

# Check for existing codeflash config in pom.xml or a separate config file

codeflash_config_path = os.path.join(project_root, "codeflash.toml")

if os.path.exists(codeflash_config_path):

This optimization achieves a **26x speedup (2598% improvement)** by eliminating expensive logging operations that dominated the original runtime. ## Key Performance Improvements ### 1. **Conditional Logging Guard (95% of original time eliminated)** The original code unconditionally formatted expensive log messages even when logging was disabled: ```python logger.warning( f"Optimized code not found for {relative_path} In the context\n-------\n{optimized_code}\n-------\n" ... ) ``` This single operation consumed **111ms out of 117ms total runtime** (95%). The optimization adds a guard check: ```python if logger.isEnabledFor(logger.level): logger.warning(...) ``` This prevents string formatting and object serialization when the log message won't be emitted, dramatically reducing overhead in production scenarios where warning-level logging may be disabled. ### 2. **Eliminated Redundant Path Object Creation** The original created `Path` objects repeatedly during filename matching: ```python if file_path_str and Path(file_path_str).name == target_filename: ``` The optimized version uses string operations: ```python if file_path_str.endswith(target_filename) and (len(file_path_str) == len(target_filename) or file_path_str[-len(target_filename)-1] in ('/', '\\')): ``` This removes overhead from Path instantiation (1.16ms → 44µs in the profiler). ### 3. **Minor Cache Lookup Optimization** Changed from `self._cache.get("file_to_path") is not None` to `"file_to_path" in self._cache` and hoisted the dict assignment to avoid inline mutation, providing small gains in the caching path. ### 4. **String Conversion Hoisting** Pre-computed `relative_path_str = str(relative_path)` to avoid repeated conversions. ## Test Case Performance Patterns - **Exact path matches** (most common case): 10-20% faster due to optimized caching - **No-match scenarios** (fallback paths): **78-189x faster** due to eliminated logger.warning overhead - `test_empty_code_strings`: 1.03ms → 12.9µs (7872% faster) - `test_no_match_multiple_blocks`: 1.28ms → 16.3µs (7753% faster) - `test_many_code_blocks_no_match`: 20.5ms → 107µs (18985% faster) The optimization particularly benefits scenarios where file path mismatches occur, as these trigger the expensive warning path in the original code. For the common case of exact matches, the improvements are modest but consistent.

…2026-02-06T15.25.29 ⚡️ Speed up function `_get_test_module_target_dir` by 43% in PR #1401 (`fix/behavioral-equivalence-improvements`)

The Java test_runner was returning (result_xml_path, result, sqlite_db_path, coverage_xml_path) but the caller expected coverage_database_file to be the JaCoCo XML path, not the SQLite path. This caused the XML parser to fail with "syntax error: line 1, column 0" when trying to parse a SQLite database as XML. Also added improved logging and error handling for JaCoCo coverage parsing. Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

…rder fix: correct return value order in Java test_runner for coverage

Remove the pass_fail_only fallback mode that allowed accepting optimizations with different return values as long as tests passed. This mode compromised correctness by silently ignoring behavioral differences when SQLite files were missing. Changes: - Remove pass_fail_only parameter from compare_test_results() - Replace fallback with fail-fast error when SQLite files missing - Add TODO comments for fixing test instrumentation - Remove test_pass_fail_warning.py (testing removed feature) - Remove TestPassFailFallbackBehavior from test_comparison_decision.py - Update canary tests to reflect new strict behavior Rationale: Codeflash must guarantee behavioral equivalence. Accepting optimizations without proper return value comparison degrades correctness guarantees. If SQLite files are missing, we now fail with a clear error message directing users to fix instrumentation. TODO: Ensure SQLite files are always generated by: 1. Java: Fix JavaTestInstrumentation to always capture return values 2. JavaScript: Ensure JS instrumentation runs before optimization 3. Other languages: Implement proper test result capture Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>

…ments Resolved conflicts in 3 files: - build_tools.py: Keep our project_root parameter support - test_discovery.py: Merge performance optimizations from origin with our overload detection - test_runner.py: Accept origin's logging improvements and return signature changes Key changes from origin/omni-java: - Kryo serialization support for Java runtime - Performance optimizations in test discovery (_compute_file_context, _match_test_method_with_context) - Enhanced logging for Java coverage and test execution - Bug fixes in coverage return value ordering Preserved from our branch: - Overload detection and logging in discover_tests() - disambiguate_overloads() function for handling ambiguous matches - project_root parameter in find_maven_executable and find_gradle_executable All tests pass after merge. Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>

…rs_from_pom functions These functions were removed during the merge with origin/omni-java but are needed by our test files (test_build_tools.py and test_java_test_paths.py). Added back: - _extract_modules_from_pom_content: Extracts Maven module names from POM XML - _extract_source_dirs_from_pom: Extracts custom source/test directories from POM Note: 10 tests still failing related to test discovery import-based logic. The new test discovery implementation from origin/omni-java uses different matching strategies that may need test adjustments. Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>

Merged omni-java base into PR #1279 to resolve conflicts. Resolution approach: 1. test_discovery.py: Used refactored method call resolution from base - New approach uses sophisticated type tracking (jedi-like "goto") - Already includes duplicate checking (line 141) - Removed old Strategy 3 (class-based fallback) as it's not needed and caused single-function optimization issues 2. test_instrumentation.py: Combined both changes - Added API key setup from PR #1279 - Kept FunctionToOptimize imports from base The refactored code is more accurate and fixes the single-function optimization issue that existed in the original PR. Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>

…ong-associations fix: prevent duplicate and wrong test-to-function associations in Java

…-support Add concurrency pattern detection for Java optimization

Fixed two critical bugs in JavaLineProfiler that prevented compilation: 1. OUTPUT_FILE declaration: Changed from repr() (single quotes) to regular string interpolation (double quotes). Java requires double quotes for string literals. 2. JSON generation: Changed all literal newlines (\n) to escaped newlines (\\n) in StringBuilder append calls. Java does not support multi-line string literals without escape sequences. Both bugs caused Java compilation errors. After fixes: - Code compiles successfully with Maven - E2E tests pass (instrumentation → compilation → execution → profiling) - All unit tests pass (8/9, 1 skipped) - Profile data correctly captured and parsed Tested with real Java code (Fibonacci.sumFibonacci) - verified hotspot identification works correctly. Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>

Resolved conflicts in test_runner.py by keeping the run_line_profile_tests function from the feature branch and maintaining the get_test_run_command signature from omni-java. The line profiling feature is now up-to-date with the latest omni-java changes. Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>

Add line profiling for Java optimization

Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>

Resolved conflicts in test_runner.py by keeping both _extract_source_dirs_from_pom from the PR branch and run_line_profile_tests from the base branch. Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>

The optimized code achieves a **17% runtime improvement** through two strategic optimizations that work together to reduce unnecessary computation: ## Primary Optimization: Early Exit via String Check The key improvement is adding `if "modules" not in content: return []` before XML parsing. This simple string check provides massive speedups in specific scenarios: - **When no modules exist**: Avoids expensive XML parsing entirely (up to **4099% faster** for POMs without modules) - **For invalid/empty input**: Prevents unnecessary parse attempts (up to **121,945% faster** for malformed XML) Looking at the line profiler results, 66.9% of the original runtime was spent on the logger.debug call during parse errors. By catching cases without "modules" upfront, we skip both the parsing attempt and the expensive logging operation. ## Secondary Optimization: Precomputed Namespace Constants Moving the Maven namespace string to module-level constants (`_MAVEN_NS` and `_M_MODULES_TAG`) eliminates redundant string formatting and dictionary creation on every function call. While this saves only 2-3% in typical cases, it adds up when the function is called repeatedly (as seen in the 1000-module tests). ## Performance Characteristics The optimization shines in different scenarios based on the annotated tests: - **Empty/Invalid POMs** (no modules): 3000-4000% faster - early exit avoids all parsing - **Standard POMs** (with modules): 14-21% faster - benefits from precomputed constants and reduced overhead - **Large POMs** (1000+ modules): 1-3% faster - parsing dominates, but constant optimization still helps - **Malformed XML edge cases**: Up to 121,945% faster by avoiding parse + log overhead ## Impact on Workloads Based on `function_references`, this function is called from test infrastructure that parses Maven POMs to discover multi-module projects. The optimization is particularly valuable because: 1. **Parsing happens frequently**: Tests run against many projects, some without modules 2. **Error cases are common**: Real-world projects may have POMs without module declarations, or the function may be called on non-POM files 3. **No hot loop context visible**: Function appears to be called once per POM, so even moderate speedups compound across large test suites The string check is a classic "fail-fast" pattern that pays off handsomely when the failure case (no modules) is reasonably common in the workload.

fix: wire Java formatter into detection pipeline

…n-failures fix: detect and log Java compilation failures explicitly

Merge latest changes from base branch including: - Java compilation error detection (PR #1394) - Java formatter detection via google-java-format (PR #1400) - Enhanced test coverage for comparator logic Conflict resolution: - tests/test_languages/test_java/test_comparison_decision.py: Used PR version that enforces strict correctness (no pass_fail_only fallback tests) to align with PR 1401's goal of removing pass_fail_only mode entirely. Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>

…2026-02-10T17.32.41 ⚡️ Speed up function `_extract_modules_from_pom_content` by 18% in PR #1401 (`fix/behavioral-equivalence-improvements`)

…-improvements feat: Maven infrastructure and behavioral equivalence improvements

## Problem Fixed Generated Java test files were written to incorrect paths outside Maven's standard `src/test/java/` directory structure, causing Maven compilation to fail and blocking the entire Java E2E optimization pipeline. ## Root Cause The `_get_java_sources_root()` function didn't handle the case where `tests_root` was already set to the Maven test directory (`src/test/java`). When tests_root was the project root, it would find "java" in the path components and return the wrong directory. When tests_root was already the test directory, it would try to append `src/test/java` again, creating an invalid duplicate path. ## Solution Added two checks at the beginning of `_get_java_sources_root()`: 1. Check if `tests_root` already ends with `src/test/java` (Maven-standard) - If yes, return it as-is 2. Check if `tests_root/src/test/java` exists as a subdirectory - If yes, return that path This handles both scenarios: - When tests_root is project root: adds `src/test/java` - When tests_root is already test dir: returns it unchanged ## Verification Before fix: - Test files 0 & 1: `java/com/example/` (WRONG - outside src/) - Test file 2: `java/src/test/java/com/example/` (CORRECT) - Maven compilation: FAILED After fix: - All test files: `java/src/test/java/com/example/` (CORRECT) - Files created on disk in correct location - Maven can now find test files ## Impact - Resolves critical P0 bug blocking Java E2E optimization - All generated test files now correctly placed in Maven structure - Enables Maven compilation to proceed - Clean, focused fix with no side effects Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>

fix pom.xml

fix: correct Java test file path generation for Maven structure

…rmer [FIX] java assertion transformer

[FIX] JAVA line profiler attrs

misrasaurabh1 and others added 18 commits January 30, 2026 00:37

wip java support

29f266e

add Java code to optimize with tests

cbb532f

add Class and Proxy type handlers to Serializer

a4ee9eb

Fix Map key collision

1e0236b

e2e working java

06353ea

Merge remote-tracking branch 'origin/omni-java' into omni-java

b050b65

make tests do full string equality check

045b4dd

fix code context extraction bugs

c35ce69

syntax error for code extraction is not allowed

f201e66

Merge branch 'main' into omni-java

e8c5227

thorough tests for code replacement

090e775

Merge remote-tracking branch 'origin/omni-java' into omni-java

c53c20a

fix instrumentation

d886de3

more tests

60fefbc

progress on instrumentation of java code

77cddec

misrasaurabh1 force-pushed the omni-java branch from d2050b1 to 77cddec Compare January 31, 2026 09:09

misrasaurabh1 and others added 7 commits January 31, 2026 09:31

Merge pull request #1224 from codeflash-ai/java-coverage-support

ee06331

feat: add JaCoCo test coverage support for Java optimization

Merge pull request #1229 from codeflash-ai/fix-java-coverage-critic

c299d99

fix: handle NOT_FOUND coverage status in Java multi-module projects

codeflash-ai bot reviewed Feb 1, 2026

View reviewed changes

This was referenced Feb 1, 2026

⚡️ Speed up function _prompt_custom_directory by 365% in PR #1199 (omni-java) #1237

Closed

⚡️ Speed up function get_java_runtime_setup_steps by 20% in PR #1199 (omni-java) #1239

Closed

misrasaurabh1 and others added 30 commits February 6, 2026 12:33

Merge pull request #1403 from codeflash-ai/codeflash/optimize-pr1401-…

098cb78

…2026-02-06T15.25.29 ⚡️ Speed up function `_get_test_module_target_dir` by 43% in PR #1401 (`fix/behavioral-equivalence-improvements`)

Merge pull request #1417 from codeflash-ai/fix/java-coverage-return-o…

9558bb6

…rder fix: correct return value order in Java test_runner for coverage

fix: use Kryo serialization and deep comparison for Java test results

98a5a43

Merge pull request #1279 from codeflash-ai/fix/java-test-discovery-wr…

05f5e6e

…ong-associations fix: prevent duplicate and wrong test-to-function associations in Java

Merge pull request #1283 from codeflash-ai/feat/java-async-concurrent…

2d64b35

…-support Add concurrency pattern detection for Java optimization

Merge pull request #1281 from codeflash-ai/feat/java-line-profiling

5a2f147

Add line profiling for Java optimization

chore: merge omni-java base into fix/wire-java-formatter

c6703d6

Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>

chore: merge omni-java base into fix/detect-java-compilation-failures

410cbcd

Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>

chore: merge omni-java base into fix/behavioral-equivalence-improvements

58d4b94

Resolved conflicts in test_runner.py by keeping both _extract_source_dirs_from_pom from the PR branch and run_line_profile_tests from the base branch. Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>

Merge pull request #1400 from codeflash-ai/fix/wire-java-formatter

a732a69

fix: wire Java formatter into detection pipeline

Merge pull request #1394 from codeflash-ai/fix/detect-java-compilatio…

fd6be1e

…n-failures fix: detect and log Java compilation failures explicitly

Merge pull request #1434 from codeflash-ai/codeflash/optimize-pr1401-…

a9de8bb

…2026-02-10T17.32.41 ⚡️ Speed up function `_extract_modules_from_pom_content` by 18% in PR #1401 (`fix/behavioral-equivalence-improvements`)

Merge pull request #1401 from codeflash-ai/fix/behavioral-equivalence…

56c2b4a

…-improvements feat: Maven infrastructure and behavioral equivalence improvements

fix pom.xml

b749958

Merge pull request #1435 from codeflash-ai/fix/pom.xml

82f2371

fix pom.xml

Merge pull request #1436 from codeflash-ai/fix/java-test-path-generation

df5b6a2

fix: correct Java test file path generation for Maven structure

Fix line profiler

0d10dbb

fix asserts

4740725

Merge pull request #1450 from codeflash-ai/fix/java-assertion-transfo…

f215bd3

…rmer [FIX] java assertion transformer

Merge pull request #1449 from codeflash-ai/fix/java-line-profiler-attrs

6ba6a28

[FIX] JAVA line profiler attrs

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

codeflash-omni-java #1199

codeflash-omni-java #1199

Uh oh!

misrasaurabh1 commented Jan 30, 2026

Uh oh!

codeflash-ai bot Feb 1, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

Test	Status
⚙️ Existing Unit Tests	🔘 None Found
🌀 Generated Regression Tests	✅ 23 Passed
⏪ Replay Tests	🔘 None Found
🔎 Concolic Coverage Tests	🔘 None Found
📊 Tests Coverage	100.0%

codeflash-omni-java #1199

Are you sure you want to change the base?

codeflash-omni-java #1199

Uh oh!

Conversation

misrasaurabh1 commented Jan 30, 2026

Uh oh!

codeflash-ai bot Feb 1, 2026

Choose a reason for hiding this comment

⚡️Codeflash found 70% (0.70x) speedup for should_modify_java_config in codeflash/cli_cmds/init_java.py

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

⚡️Codeflash found 70% (0.70x) speedup for `should_modify_java_config` in `codeflash/cli_cmds/init_java.py`