🚀 DataProfiler Performance Comparison ================================================== Environment: CI Testing file sizes: [1, 2, 5]MB ==================== 1MB TEST ==================== Generating 1MB test CSV... Generated file: 1.1MB --- Benchmarking DataProfiler on 1MB file --- Benchmarking DataProfiler... Building DataProfiler binary... ✅ DataProfiler: 0.02s, 0.2MB memory --- Benchmarking pandas on 1MB file --- Benchmarking pandas... ✅ pandas: 0.04s, 4.7MB memory --- Benchmarking polars on 1MB file --- Benchmarking Polars... ✅ polars: 0.01s, 34.3MB memory --- Benchmarking great_expectations on 1MB file --- Great Expectations not available, skipping... ❌ great_expectations: Not installed ==================== 2MB TEST ==================== Generating 2MB test CSV... Generated file: 2.3MB --- Benchmarking DataProfiler on 2MB file --- Benchmarking DataProfiler... ✅ DataProfiler: 0.04s, 0.2MB memory --- Benchmarking pandas on 2MB file --- Benchmarking pandas... ✅ pandas: 0.06s, 2.4MB memory --- Benchmarking polars on 2MB file --- Benchmarking Polars... ✅ polars: 0.00s, 0.0MB memory --- Benchmarking great_expectations on 2MB file --- Great Expectations not available, skipping... ❌ great_expectations: Not installed ==================== 5MB TEST ==================== Generating 5MB test CSV... Generated file: 5.8MB --- Benchmarking DataProfiler on 5MB file --- Benchmarking DataProfiler... ✅ DataProfiler: 0.08s, 0.2MB memory --- Benchmarking pandas on 5MB file --- Benchmarking pandas... ✅ pandas: 0.15s, 2.0MB memory --- Benchmarking polars on 5MB file --- Benchmarking Polars... ✅ polars: 0.01s, 9.5MB memory --- Benchmarking great_expectations on 5MB file --- Great Expectations not available, skipping... ❌ great_expectations: Not installed ============================================================ PERFORMANCE CLAIMS VALIDATION ============================================================ 📊 1MB File Results: ---------------------------------------- DataProfiler: 0.02s | 0.2MB | 8,738 rows pandas : 0.04s | 4.7MB | 8,738 rows polars : 0.01s | 34.3MB | 8,738 rows 🚀 PERFORMANCE COMPARISON: ⚡ Speed: 1.6x faster than pandas 💾 Memory: 23.5x more efficient than pandas 🥈 COMPETITIVE: Good speed with excellent memory efficiency! 📊 2MB File Results: ---------------------------------------- DataProfiler: 0.04s | 0.2MB | 17,476 rows pandas : 0.06s | 2.4MB | 17,476 rows polars : 0.00s | 0.0MB | 17,476 rows 🚀 PERFORMANCE COMPARISON: ⚡ Speed: 1.7x faster than pandas 💾 Memory: 12.1x more efficient than pandas 🥈 COMPETITIVE: Good speed with excellent memory efficiency! 📊 5MB File Results: ---------------------------------------- DataProfiler: 0.08s | 0.2MB | 43,690 rows pandas : 0.15s | 2.0MB | 43,690 rows polars : 0.01s | 9.5MB | 43,690 rows 🚀 PERFORMANCE COMPARISON: ⚡ Speed: 1.9x faster than pandas 💾 Memory: 10.2x more efficient than pandas 🥈 COMPETITIVE: Good speed with excellent memory efficiency! 📁 Results saved to: benchmark-results/benchmark_comparison_results.json ============================================================ BENCHMARK SUMMARY ============================================================ 📊 1MB File: Speed: 1.6x faster than pandas Memory: 23.5x more efficient than pandas Assessment: ✅ COMPETITIVE 📊 2MB File: Speed: 1.7x faster than pandas Memory: 12.1x more efficient than pandas Assessment: ✅ COMPETITIVE 📊 5MB File: Speed: 1.9x faster than pandas Memory: 10.2x more efficient than pandas Assessment: ✅ COMPETITIVE ✅ Benchmark completed successfully!