[LifetimeSafety] Enhance benchmark script for new sub analyses #149577

usx95 · 2025-07-18T19:50:02Z

Enhanced the lifetime safety analysis benchmark script with more detailed performance metrics and a new nested loop test case. This is a worst case for loan expiry analysis.

What changed?

Added a new test case nested_loops that generates code with N levels of nested loops to test how analysis performance scales with loop nesting depth
Improved the trace file analysis to extract durations for sub-phases of the lifetime analysis (FactGenerator, LoanPropagation, ExpiredLoans)
Enhanced the markdown report generation to include:
- Relative timing results as percentages of total Clang time
- More detailed complexity analysis for each analysis phase

Report

Lifetime Analysis Performance Report

Generated on: 2025-08-18 13:29:57

Test Case: Pointer Cycle in Loop

Timing Results:

N (Input Size)	Total Time	Analysis Time (%)	Fact Generator (%)	Loan Propagation (%)	Expired Loans (%)
10	10.75 ms	24.61%	0.00%	24.38%	0.00%
25	64.98 ms	86.08%	0.00%	86.02%	0.00%
50	709.37 ms	98.53%	0.00%	98.51%	0.00%
75	3.13 s	99.63%	0.00%	99.63%	0.00%
100	9.44 s	99.85%	0.00%	99.84%	0.00%
150	45.31 s	99.96%	0.00%	99.96%	0.00%

Complexity Analysis:

Analysis Phase	Complexity O(n^k)
Total Analysis	O(n^3.87 ± 0.01)
FactGenerator	(Negligible)
LoanPropagation	O(n^3.87 ± 0.01)
ExpiredLoans	(Negligible)

Test Case: CFG Merges

Timing Results:

N (Input Size)	Total Time	Analysis Time (%)	Fact Generator (%)	Loan Propagation (%)	Expired Loans (%)
10	8.54 ms	0.00%	0.00%	0.00%	0.00%
50	40.85 ms	65.09%	0.00%	64.61%	0.00%
100	207.70 ms	93.58%	0.00%	93.46%	0.00%
200	1.54 s	98.82%	0.00%	98.78%	0.00%
400	12.04 s	99.72%	0.00%	99.71%	0.01%
800	96.73 s	99.94%	0.00%	99.94%	0.00%

Complexity Analysis:

Analysis Phase	Complexity O(n^k)
Total Analysis	O(n^3.01 ± 0.00)
FactGenerator	(Negligible)
LoanPropagation	O(n^3.01 ± 0.00)
ExpiredLoans	(Negligible)

Test Case: Deeply Nested Loops

Timing Results:

N (Input Size)	Total Time	Analysis Time (%)	Fact Generator (%)	Loan Propagation (%)	Expired Loans (%)
10	8.25 ms	0.00%	0.00%	0.00%	0.00%
50	27.25 ms	51.87%	0.00%	45.71%	5.93%
100	113.42 ms	82.48%	0.00%	72.74%	9.62%
200	730.05 ms	95.24%	0.00%	83.95%	11.25%
400	5.40 s	98.74%	0.01%	87.05%	11.68%
800	41.86 s	99.62%	0.00%	87.77%	11.84%

Complexity Analysis:

Analysis Phase	Complexity O(n^k)
Total Analysis	O(n^2.97 ± 0.00)
FactGenerator	(Negligible)
LoanPropagation	O(n^2.96 ± 0.00)
ExpiredLoans	O(n^2.97 ± 0.00)

usx95 · 2025-07-18T19:50:29Z

[LifetimeSafety] Enhance benchmark script for new sub analyses #149577 👈 (View in Graphite)
main

This stack of pull requests is managed by Graphite. Learn more about stacking.

github-actions · 2025-07-18T19:52:18Z

✅ With the latest revision this PR passed the Python code formatter.

llvmbot · 2025-08-18T18:35:47Z

@llvm/pr-subscribers-clang

@llvm/pr-subscribers-clang-static-analyzer-1

Author: Utkarsh Saxena (usx95)

Changes

Enhanced the lifetime safety analysis benchmark script with more detailed performance metrics and a new nested loop test case. This is a worst case for loan expiry analysis.

What changed?

Added a new test case nested_loops that generates code with N levels of nested loops to test how analysis performance scales with loop nesting depth
Improved the trace file analysis to extract durations for sub-phases of the lifetime analysis (FactGenerator, LoanPropagation, ExpiredLoans)
Enhanced the markdown report generation to include:
- Relative timing results as percentages of total Clang time
- More detailed complexity analysis for each analysis phase

Report

Lifetime Analysis Performance Report

> Generated on: 2025-08-18 13:29:57

Test Case: Pointer Cycle in Loop

Timing Results:

N (Input Size)	Total Time	Analysis Time (%)	Fact Generator (%)	Loan Propagation (%)	Expired Loans (%)
10	10.75 ms	24.61%	0.00%	24.38%	0.00%
25	64.98 ms	86.08%	0.00%	86.02%	0.00%
50	709.37 ms	98.53%	0.00%	98.51%	0.00%
75	3.13 s	99.63%	0.00%	99.63%	0.00%
100	9.44 s	99.85%	0.00%	99.84%	0.00%
150	45.31 s	99.96%	0.00%	99.96%	0.00%

Complexity Analysis:

Analysis Phase	Complexity O(n<sup>k</sup>)
Total Analysis	O(n<sup>3.87</sup> &pm; 0.01)
FactGenerator	(Negligible)
LoanPropagation	O(n<sup>3.87</sup> &pm; 0.01)
ExpiredLoans	(Negligible)

Test Case: CFG Merges

Timing Results:

N (Input Size)	Total Time	Analysis Time (%)	Fact Generator (%)	Loan Propagation (%)	Expired Loans (%)
10	8.54 ms	0.00%	0.00%	0.00%	0.00%
50	40.85 ms	65.09%	0.00%	64.61%	0.00%
100	207.70 ms	93.58%	0.00%	93.46%	0.00%
200	1.54 s	98.82%	0.00%	98.78%	0.00%
400	12.04 s	99.72%	0.00%	99.71%	0.01%
800	96.73 s	99.94%	0.00%	99.94%	0.00%

Complexity Analysis:

Analysis Phase	Complexity O(n<sup>k</sup>)
Total Analysis	O(n<sup>3.01</sup> &pm; 0.00)
FactGenerator	(Negligible)
LoanPropagation	O(n<sup>3.01</sup> &pm; 0.00)
ExpiredLoans	(Negligible)

Test Case: Deeply Nested Loops

Timing Results:

N (Input Size)	Total Time	Analysis Time (%)	Fact Generator (%)	Loan Propagation (%)	Expired Loans (%)
10	8.25 ms	0.00%	0.00%	0.00%	0.00%
50	27.25 ms	51.87%	0.00%	45.71%	5.93%
100	113.42 ms	82.48%	0.00%	72.74%	9.62%
200	730.05 ms	95.24%	0.00%	83.95%	11.25%
400	5.40 s	98.74%	0.01%	87.05%	11.68%
800	41.86 s	99.62%	0.00%	87.77%	11.84%

Complexity Analysis:

Analysis Phase	Complexity O(n<sup>k</sup>)
Total Analysis	O(n<sup>2.97</sup> &pm; 0.00)
FactGenerator	(Negligible)
LoanPropagation	O(n<sup>2.96</sup> &pm; 0.00)
ExpiredLoans	O(n<sup>2.97</sup> &pm; 0.00)

Full diff: https://github.com/llvm/llvm-project/pull/149577.diff

1 Files Affected:

(modified) clang/test/Analysis/LifetimeSafety/benchmark.py (+161-66)

diff --git a/clang/test/Analysis/LifetimeSafety/benchmark.py b/clang/test/Analysis/LifetimeSafety/benchmark.py index 9d5f36c51b9ee..4421fe9a81e21 100644 --- a/clang/test/Analysis/LifetimeSafety/benchmark.py +++ b/clang/test/Analysis/LifetimeSafety/benchmark.py @@ -99,28 +99,84 @@ def generate_cpp_merge_test(n: int) -> str: return cpp_code -def analyze_trace_file(trace_path: str) -> tuple[float, float]: +def generate_cpp_nested_loop_test(n: int) -> str: """ - Parses the -ftime-trace JSON output to find durations. + Generates C++ code with N levels of nested loops. + This pattern tests how analysis performance scales with loop nesting depth, + which is a key factor in the complexity of dataflow analyses on structured + control flow. - Returns: - A tuple of (lifetime_analysis_duration_us, total_clang_duration_us). + Example (n=3): + struct MyObj { int id; ~MyObj() {} }; + void nested_loops_3() { + MyObj* p = nullptr; + for(int i0=0; i0<2; ++i0) { + MyObj s0; + p = &s0; + for(int i1=0; i1<2; ++i1) { + MyObj s1; + p = &s1; + for(int i2=0; i2<2; ++i2) { + MyObj s2; + p = &s2; + } + } + } + } + """ + if n <= 0: + return "// Nesting depth must be positive." + + cpp_code = "struct MyObj { int id; ~MyObj() {} };\n\n" + cpp_code += f"void nested_loops_{n}() {{\n" + cpp_code += " MyObj* p = nullptr;\n" + + for i in range(n): + indent = " " * (i + 1) + cpp_code += f"{indent}for(int i{i}=0; i{i}<2; ++i{i}) {{\n" + cpp_code += f"{indent} MyObj s{i}; p = &s{i};\n" + + for i in range(n - 1, -1, -1): + indent = " " * (i + 1) + cpp_code += f"{indent}}}\n" + + cpp_code += "}\n" + cpp_code += f"\nint main() {{ nested_loops_{n}(); return 0; }}\n" + return cpp_code + + +def analyze_trace_file(trace_path: str) -> dict: """ - lifetime_duration = 0.0 - total_duration = 0.0 + Parses the -ftime-trace JSON output to find durations for the lifetime + analysis and its sub-phases. + Returns a dictionary of durations in microseconds. + """ + durations = { + "lifetime_us": 0.0, + "total_us": 0.0, + "fact_gen_us": 0.0, + "loan_prop_us": 0.0, + "expired_loans_us": 0.0, + } + event_name_map = { + "LifetimeSafetyAnalysis": "lifetime_us", + "ExecuteCompiler": "total_us", + "FactGenerator": "fact_gen_us", + "LoanPropagation": "loan_prop_us", + "ExpiredLoans": "expired_loans_us", + } try: with open(trace_path, "r") as f: trace_data = json.load(f) for event in trace_data.get("traceEvents", []): - if event.get("name") == "LifetimeSafetyAnalysis": - lifetime_duration += float(event.get("dur", 0)) - if event.get("name") == "ExecuteCompiler": - total_duration += float(event.get("dur", 0)) - + event_name = event.get("name") + if event_name in event_name_map: + key = event_name_map[event_name] + durations[key] += float(event.get("dur", 0)) except (IOError, json.JSONDecodeError) as e: print(f"Error reading or parsing trace file {trace_path}: {e}", file=sys.stderr) - return 0.0, 0.0 - return lifetime_duration, total_duration + return {key: 0.0 for key in durations} + return durations def power_law(n, c, k): @@ -135,8 +191,29 @@ def human_readable_time(ms: float) -> str: return f"{ms:.2f} ms" +def calculate_complexity(n_data, y_data) -> tuple[float | None, float | None]: + """ + Calculates the exponent 'k' for the power law fit y = c * n^k. + Returns a tuple of (k, k_standard_error). + """ + try: + if len(n_data) < 3 or np.all(y_data < 1e-6) or np.var(y_data) < 1e-6: + return None, None + + non_zero_indices = y_data > 0 + if np.sum(non_zero_indices) < 3: + return None, None + + n_fit, y_fit = n_data[non_zero_indices], y_data[non_zero_indices] + popt, pcov = curve_fit(power_law, n_fit, y_fit, p0=[0, 1], maxfev=5000) + k_stderr = np.sqrt(np.diag(pcov))[1] + return popt[1], k_stderr + except (RuntimeError, ValueError): + return None, None + + def generate_markdown_report(results: dict) -> str: - """Generates a Markdown-formatted report from the benchmark results.""" + """Generates a concise, Markdown-formatted report from the benchmark results.""" report = [] timestamp = datetime.now().strftime("%Y-%m-%d %H:%M:%S %Z") report.append(f"# Lifetime Analysis Performance Report") @@ -146,54 +223,52 @@ def generate_markdown_report(results: dict) -> str: for test_name, data in results.items(): title = data["title"] report.append(f"## Test Case: {title}") - report.append("") + report.append("\n**Timing Results:**\n") # Table header - report.append("| N | Analysis Time | Total Clang Time |") - report.append("|:----|--------------:|-----------------:|") + report.append( + "| N (Input Size) | Total Time | Analysis Time (%) | Fact Generator (%) | Loan Propagation (%) | Expired Loans (%) |" + ) + report.append( + "|:---------------|-----------:|------------------:|-------------------:|---------------------:|------------------:|" + ) # Table rows n_data = np.array(data["n"]) - analysis_data = np.array(data["lifetime_ms"]) - total_data = np.array(data["total_ms"]) + total_ms_data = np.array(data["total_ms"]) for i in range(len(n_data)): - analysis_str = human_readable_time(analysis_data[i]) - total_str = human_readable_time(total_data[i]) - report.append(f"| {n_data[i]:<3} | {analysis_str:>13} | {total_str:>16} |") - - report.append("") - - # Complexity analysis - report.append(f"**Complexity Analysis:**") - try: - # Curve fitting requires at least 3 points - if len(n_data) < 3: - raise ValueError("Not enough data points to perform curve fitting.") - - popt, pcov = curve_fit( - power_law, n_data, analysis_data, p0=[0, 2], maxfev=5000 - ) - _, k = popt - - # Confidence Interval for k - alpha = 0.05 # 95% confidence - dof = max(0, len(n_data) - len(popt)) # degrees of freedom - t_val = t.ppf(1.0 - alpha / 2.0, dof) - # Standard error of the parameters - perr = np.sqrt(np.diag(pcov)) - k_stderr = perr[1] - k_ci_lower = k - t_val * k_stderr - k_ci_upper = k + t_val * k_stderr - - report.append( - f"- The performance for this case scales approx. as **O(n<sup>{k:.2f}</sup>)**." - ) - report.append( - f"- **95% Confidence interval for exponent:** `[{k_ci_lower:.2f}, {k_ci_upper:.2f}]`." - ) + total_t = total_ms_data[i] + if total_t < 1e-6: + total_t = 1.0 # Avoid division by zero + + row = [ + f"| {n_data[i]:<14} |", + f"{human_readable_time(total_t):>10} |", + f"{data['lifetime_ms'][i] / total_t * 100:>17.2f}% |", + f"{data['fact_gen_ms'][i] / total_t * 100:>18.2f}% |", + f"{data['loan_prop_ms'][i] / total_t * 100:>20.2f}% |", + f"{data['expired_loans_ms'][i] / total_t * 100:>17.2f}% |", + ] + report.append(" ".join(row)) + + report.append("\n**Complexity Analysis:**\n") + report.append("| Analysis Phase | Complexity O(n<sup>k</sup>) |") + report.append("|:------------------|:--------------------------|") + + analysis_phases = { + "Total Analysis": data["lifetime_ms"], + "FactGenerator": data["fact_gen_ms"], + "LoanPropagation": data["loan_prop_ms"], + "ExpiredLoans": data["expired_loans_ms"], + } - except (RuntimeError, ValueError) as e: - report.append(f"- Could not determine a best-fit curve for the data: {e}") + for phase_name, y_data in analysis_phases.items(): + k, delta = calculate_complexity(n_data, np.array(y_data)) + if k is not None and delta is not None: + complexity_str = f"O(n<sup>{k:.2f}</sup> &pm; {delta:.2f})" + else: + complexity_str = "(Negligible)" + report.append(f"| {phase_name:<17} | {complexity_str:<25} |") report.append("\n---\n") @@ -202,7 +277,7 @@ def generate_markdown_report(results: dict) -> str: def run_single_test( clang_binary: str, output_dir: str, test_name: str, generator_func, n: int -) -> tuple[float, float]: +) -> dict: """Generates, compiles, and benchmarks a single test case.""" print(f"--- Running Test: {test_name.capitalize()} with N={n} ---") @@ -221,7 +296,8 @@ def run_single_test( "-o", "/dev/null", "-ftime-trace=" + trace_file, - "-Wexperimental-lifetime-safety", + "-Xclang", + "-fexperimental-lifetime-safety", "-std=c++17", source_file, ] @@ -231,11 +307,12 @@ def run_single_test( if result.returncode != 0: print(f"Compilation failed for N={n}!", file=sys.stderr) print(result.stderr, file=sys.stderr) - return 0.0, 0.0 + return {} - lifetime_us, total_us = analyze_trace_file(trace_file) - - return lifetime_us / 1000.0, total_us / 1000.0 + durations_us = analyze_trace_file(trace_file) + return { + key.replace("_us", "_ms"): value / 1000.0 for key, value in durations_us.items() + } if __name__ == "__main__": @@ -270,6 +347,12 @@ def run_single_test( "generator_func": generate_cpp_merge_test, "n_values": [10, 50, 100, 200, 400, 800], }, + { + "name": "nested_loops", + "title": "Deeply Nested Loops", + "generator_func": generate_cpp_nested_loop_test, + "n_values": [10, 50, 100, 200, 400, 800], + }, ] results = {} @@ -282,21 +365,28 @@ def run_single_test( "n": [], "lifetime_ms": [], "total_ms": [], + "fact_gen_ms": [], + "loan_prop_ms": [], + "expired_loans_ms": [], } for n in config["n_values"]: - lifetime_ms, total_ms = run_single_test( + durations_ms = run_single_test( args.clang_binary, args.output_dir, test_name, config["generator_func"], n, ) - if total_ms > 0: + if durations_ms: results[test_name]["n"].append(n) - results[test_name]["lifetime_ms"].append(lifetime_ms) - results[test_name]["total_ms"].append(total_ms) + for key, value in durations_ms.items(): + results[test_name][key].append(value) + print( - f" Total: {human_readable_time(total_ms)} | Analysis: {human_readable_time(lifetime_ms)}" + f" Total Analysis: {human_readable_time(durations_ms['lifetime_ms'])} | " + f"FactGen: {human_readable_time(durations_ms['fact_gen_ms'])} | " + f"LoanProp: {human_readable_time(durations_ms['loan_prop_ms'])} | " + f"ExpiredLoans: {human_readable_time(durations_ms['expired_loans_ms'])}" ) print("\n\n" + "=" * 80) @@ -305,3 +395,8 @@ def run_single_test( markdown_report = generate_markdown_report(results) print(markdown_report) + + report_filename = os.path.join(args.output_dir, "performance_report.md") + with open(report_filename, "w") as f: + f.write(markdown_report) + print(f"Report saved to: {report_filename}")

…mark

This was referenced Jul 18, 2025

[LifetimeSafety] Add loan expiry analysis #148712

Merged

[LifetimeSafety] Implement LiveOrigins analysis #148976

Closed

[LifetimeSafety] Add lattice tracking per program point #149069

Closed

[LifetimeSafety] Revamp test suite using unittests #149158

Merged

usx95 mentioned this pull request Jul 18, 2025

[LifetimeSafety] Add per-program-point lattice tracking #149199

Merged

usx95 changed the title ~~add-loan-analysis-to-benchmark~~ [LifetimeSafety] Enhance benchmark script for end timing Jul 18, 2025

usx95 changed the title ~~[LifetimeSafety] Enhance benchmark script for end timing~~ [LifetimeSafety] Enhance benchmark script for ExpiredLoans analysis Jul 18, 2025

usx95 force-pushed the users/usx95/07-18-add-loan-analysis-to-benchmark branch from cb9a521 to 9e6a675 Compare July 19, 2025 08:41

usx95 changed the base branch from users/usx95/07-14-users_usx95_lifetime-safety-add-loan-expiry to users/usx95/07-15-add-liveness-finally July 19, 2025 12:32

usx95 force-pushed the users/usx95/07-18-add-loan-analysis-to-benchmark branch from 9e6a675 to d4f5457 Compare July 19, 2025 12:32

usx95 changed the title ~~[LifetimeSafety] Enhance benchmark script for ExpiredLoans analysis~~ [LifetimeSafety] Enhance benchmark script for new sub analyses Jul 19, 2025

usx95 added this to Lifetime Safety in Clang Jul 19, 2025

usx95 moved this to Todo in Lifetime Safety in Clang Jul 19, 2025

usx95 moved this from Todo to In Progress in Lifetime Safety in Clang Jul 19, 2025

usx95 self-assigned this Jul 19, 2025

usx95 force-pushed the users/usx95/07-18-add-loan-analysis-to-benchmark branch 2 times, most recently from 6cb3654 to c55a8f9 Compare July 21, 2025 14:48

usx95 force-pushed the users/usx95/07-15-add-liveness-finally branch 2 times, most recently from 8aa7431 to 4443dff Compare July 21, 2025 22:05

usx95 force-pushed the users/usx95/07-18-add-loan-analysis-to-benchmark branch from c55a8f9 to 0b3061e Compare July 21, 2025 22:06

usx95 force-pushed the users/usx95/07-15-add-liveness-finally branch from 4443dff to 22e01ed Compare July 22, 2025 10:34

usx95 force-pushed the users/usx95/07-18-add-loan-analysis-to-benchmark branch from 0b3061e to cf062ae Compare July 22, 2025 10:35

usx95 force-pushed the users/usx95/07-15-add-liveness-finally branch from 22e01ed to ac130f9 Compare July 23, 2025 10:16

usx95 force-pushed the users/usx95/07-18-add-loan-analysis-to-benchmark branch from cf062ae to d21e733 Compare July 23, 2025 10:17

usx95 force-pushed the users/usx95/07-18-add-loan-analysis-to-benchmark branch from d21e733 to e3c21f3 Compare August 18, 2025 12:05

usx95 changed the base branch from users/usx95/07-15-add-liveness-finally to main August 18, 2025 12:23

usx95 force-pushed the users/usx95/07-18-add-loan-analysis-to-benchmark branch from e3c21f3 to 7a8dd3e Compare August 18, 2025 16:30

add live origins

08cd7f5

usx95 force-pushed the users/usx95/07-18-add-loan-analysis-to-benchmark branch from 7a8dd3e to 08cd7f5 Compare August 18, 2025 18:34

usx95 marked this pull request as ready for review August 18, 2025 18:35

llvmbot added clang Clang issues not falling into any other category clang:static analyzer labels Aug 18, 2025

Merge branch 'main' into users/usx95/07-18-add-loan-analysis-to-bench…

3d4eb38

…mark

usx95 enabled auto-merge (squash) August 18, 2025 18:39

usx95 merged commit d30fd56 into main Aug 18, 2025
9 checks passed

usx95 deleted the users/usx95/07-18-add-loan-analysis-to-benchmark branch August 18, 2025 19:07

github-project-automation bot moved this from In Progress to Done in Lifetime Safety in Clang Aug 18, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[LifetimeSafety] Enhance benchmark script for new sub analyses #149577

[LifetimeSafety] Enhance benchmark script for new sub analyses #149577

Uh oh!

usx95 commented Jul 18, 2025 •

edited

Loading

usx95 commented Jul 18, 2025 •

edited

Loading

github-actions bot commented Jul 18, 2025 •

edited

Loading

llvmbot commented Aug 18, 2025 •

edited

Loading

What changed?

Lifetime Analysis Performance Report

Test Case: Pointer Cycle in Loop

Test Case: CFG Merges

Test Case: Deeply Nested Loops

Uh oh!

Labels

3 participants

[LifetimeSafety] Enhance benchmark script for new sub analyses #149577

[LifetimeSafety] Enhance benchmark script for new sub analyses #149577

Uh oh!

Conversation

usx95 commented Jul 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What changed?

Lifetime Analysis Performance Report

Test Case: Pointer Cycle in Loop

Test Case: CFG Merges

Test Case: Deeply Nested Loops

usx95 commented Jul 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

github-actions bot commented Jul 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

llvmbot commented Aug 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What changed?

Lifetime Analysis Performance Report

Test Case: Pointer Cycle in Loop

Test Case: CFG Merges

Test Case: Deeply Nested Loops

Uh oh!

Labels

3 participants

usx95 commented Jul 18, 2025 •

edited

Loading

usx95 commented Jul 18, 2025 •

edited

Loading

github-actions bot commented Jul 18, 2025 •

edited

Loading

llvmbot commented Aug 18, 2025 •

edited

Loading