feat: Introduce FastExcel Benchmark Performance Testing Module #575

GOODBOY008 · 2025-09-14T12:27:29Z

Overview

This PR introduces a comprehensive benchmark performance testing module for FastExcel, implementing the proposal outlined in #572.

Benchmark Results Available

CI Benchmark Run Completed Successfully: https://github.com/GOODBOY008/fastexcel/actions/runs/17709908635

Benchmark Artifacts: The workflow generated comprehensive benchmark reports available in the artifacts:

HTML Reports: Interactive benchmark comparison reports
Raw Results: JMH benchmark data in JSON format
Analysis: Performance analysis and comparison metrics

To view the benchmark reports:

Download the benchmark-results artifact from: https://github.com/GOODBOY008/fastexcel/actions/runs/17709908635
Unzip the downloaded file
Open benchmark-reports/benchmark-comparison.html in your browser

What's Changed

New Module: fastexcel-benchmark

• JMH Integration: Complete Maven configuration with industry-standard Java microbenchmarking framework
• Comprehensive Test Suites:
◦ Comparison benchmarks (FastExcel vs Apache POI)
◦ Memory efficiency specialized tests
◦ Streaming operation performance tests
◦ Microbenchmarks for core components
• Automated Execution: Multi-profile support with configurable dataset sizes and memory settings
• Advanced Features:
◦ Interactive CLI with scenario management
◦ Real-time memory profiling with GC tracking
◦ HTML visualization reports and JSON data export
◦ Performance trend analysis and regression detection

Key Components

Core Framework (cn.idev.excel.benchmark.core)
◦ Abstract benchmark base classes
◦ Configuration management
◦ Memory profiler integration
Test Scenarios (cn.idev.excel.benchmark.*)
◦ Read/Write operation benchmarks
◦ Fill operation performance tests
◦ Streaming benchmarks for large datasets
◦ Memory efficiency analysis
Comparison Benchmarks (cn.idev.excel.benchmark.comparison)
◦ Direct FastExcel vs Apache POI performance comparison
◦ Multi-dimensional analysis (throughput, latency, memory)
Utilities (cn.idev.excel.benchmark.utils)
◦ Test data generation
◦ File management utilities
◦ Reporting and visualization
Automated Scripts (scripts/benchmark-runner.sh)
◦ Profile-based execution (quick/standard/comprehensive)
◦ Configurable parameters and output formats
◦ Regression analysis automation

GitHub Actions Integration

• Workflow (.github/workflows/benchmark.yml)
◦ Manual trigger with workflow_dispatch for on-demand benchmarking
◦ Java 11 setup with proper classpath resolution
◦ Automated artifact upload for benchmark results
◦ Fixed JMH forking issues for reliable results

Test Scenarios Coverage

• Data Scales: SMALL(1K) → MEDIUM(10K) → LARGE(100K) → EXTRA_LARGE(1M+)
• File Formats: XLSX
• Operation Types: Read, Write, Fill, Streaming
• Memory Analysis: Real-time monitoring, GC pressure analysis, allocation patterns

Benefits

Validates Performance Claims: Provides empirical evidence for FastExcel's performance advantages
Quality Assurance: Enables systematic performance analysis and regression detection
User Confidence: Transparent performance reports for informed decision-making
Development Guidance: Data-driven optimization insights

Closes #572

GOODBOY008 · 2025-09-14T12:35:42Z

@delei @alaahong

CI Benchmark Run Completed Successfully: https://github.com/GOODBOY008/fastexcel/actions/runs/17709908635

To view the benchmark reports:

Download the benchmark-results artifact from: https://github.com/GOODBOY008/fastexcel/actions/runs/17709908635/artifacts/4006114037
Unzip the downloaded file
Open benchmark-reports/benchmark-comparison.html in your browser

There are a few issues to address:

In the Performance Comparisons section of the HTML report, the content is incomplete. A dataset and a format column need to be added.
For the 1M dataset scenario, the POI run failed, so no benchmark results were generated.

psxjoy · 2025-09-14T13:19:23Z

I'm really excited about this PR. However, it's quite large, so the code review will take some time.

Also, no offense intended, but I'd like to ask: Did you use AI-generated code in this PR?

GOODBOY008 · 2025-09-14T14:57:54Z

I'm really excited about this PR. However, it's quite large, so the code review will take some time.

Also, no offense intended, but I'd like to ask: Did you use AI-generated code in this PR?

@psxjoy Yes, some parts (like the comparison report, memory profiler logic, and quickstart scripts) were AI-assisted.AI is quite effective in these scenarios, I’ve verified them to make sure they work correctly.

I noticed the artifact wasn’t accessible, so I’ve uploaded the results for your review.
benchmark-results.zip

delei · 2025-09-18T13:44:33Z

Hi, @GOODBOY008
Thank you for submitting the PR.

Regarding this PR, I still have some questions:

It seems that the file ./fastexcel-benchmark/scripts/benchmark-runner.sh does not exist?
Introducing JMH benchmark testing is highly necessary, but currently we don't need to run it through CI.
If possible, I suggest deleting the code for generating reports and analyzing results, and only keeping the JMH classes.

Please refer to the above suggestions and make appropriate modifications to the PR content. After that, we will vote on this PR together with other reviewers ASAP.

GOODBOY008 · 2025-09-19T03:11:30Z

Hi @delei
Thanks for your feedback.

For the first point, I understand the concern about the PR size — my intention was to split the work into stages, so this submission might look a bit large.

Regarding the second and third points:
• Running benchmarks in CI helps produce relatively stable and reproducible results. Running them locally is often influenced by background tasks and can take a long time.
• As for report generation and analysis, they make it easier to compare multiple runs, especially when evaluating different scenarios. Doing this entirely by hand would be quite time-consuming.

I’m fine with keeping only the JMH core classes for now, but I’d like to highlight the above considerations.

GOODBOY008 · 2025-09-23T07:50:45Z

@delei PTAL

netlify · 2025-11-24T07:23:38Z

✅ Deploy Preview for fesod ready!

Name	Link
🔨 Latest commit	`3a2c467`
🔍 Latest deploy log	https://app.netlify.com/projects/fesod/deploys/69240bb276a415000840ef16
😎 Deploy Preview	https://deploy-preview-575--fesod.netlify.app
📱 Preview on mobile	Toggle QR Code... Use your smartphone camera to open QR code link.

To edit notification comments on pull requests, go to your Netlify project configuration.

Copilot

Pull request overview

This PR introduces a comprehensive JMH-based benchmark module for the FastExcel library, enabling performance testing and comparisons with Apache POI across various operations (read, write, fill) and dataset sizes. The module includes memory profiling capabilities, test data generation utilities, and comparison benchmarks to validate FastExcel's performance claims.

Changes:

New fesod-benchmark module with complete JMH integration and Maven configuration
Benchmark suites for read, write, and fill operations across multiple dataset sizes and file formats
Memory profiling utilities with GC tracking and detailed statistics
Comparison benchmarks between FastExcel and Apache POI
Comprehensive test data generation with configurable characteristics

Reviewed changes

Copilot reviewed 14 out of 14 changed files in this pull request and generated 20 comments.

Show a summary per file

File	Description
pom.xml	Added fesod-benchmark module to parent POM
fesod-benchmark/pom.xml	New Maven configuration with JMH dependencies and shade plugin
fesod-benchmark/benchmark.md	Documentation for running and interpreting benchmarks
MemoryProfiler.java	Utility for real-time memory profiling with GC tracking
DataGenerator.java	Test data generation with multiple data types and characteristics
BenchmarkFileUtil.java	File management utilities for benchmark operations
BenchmarkData.java	Data model with 20 fields covering various Excel data types
BenchmarkConfiguration.java	Configuration enums for dataset sizes and file formats
AbstractBenchmark.java	Base class providing common benchmark functionality
WriteBenchmark.java	Write operation benchmarks for different sizes and scenarios
ReadBenchmark.java	Read operation benchmarks with multiple listener patterns
FillBenchmark.java	Template fill operation benchmarks
FastExcelVsPoiBenchmark.java	Comparison benchmarks between FastExcel and Apache POI
ComparisonBenchmarkRunner.java	Runner for executing comparison benchmarks

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copilot · 2026-01-16T05:50:57Z