EasyMiner easy association rule mining

Benchmarks

This page needs to be updated. Currently it contains several benchmarks of older versions of the software, and accuracy benchmark between common classification algorithms and CBA.

Association rule learning

Adapted from: Stanislav Vojíř, Václav Zeman, Jaroslav Kuchař, Tomáš Kliegr: EasyMiner/R Preview: Towards a Web Interface for Association Rule Learning and Classification in R. RuleML 2015

Time requirements of rule mining (confidence=0.5)
supportrule countbackend onlyEasyMiner/R
w/o miss.w miss.w miss. prunedLISp-Minerarulesminingwith prun.
0.010 79 163 54 3 s 6.2 s 5.4 s 33.8 s
0.009 95 186 68 6 s 6.4 s 5.4 s 27.8 s
0.008 112 213 73 16 s 6.2 s 5.4 s 31.7 s
0.007 144 295 90 27 s 6.3 s 5.5 s 31.7 s
0.006 187 397 107 1 m 10 s 6.3 s 5.5 s 35.6 s
0.005 256 552 141 4 m 38 s 6.3 s 5.7 s 35.5 s
0.004 396 765 184 28 m 04 s 6.5 s 6.0 s 37.8 s
0.003 602 1147 253 > 5 h 6.5 s 8.6 s 43.3 s
0.002 1391 2699 430 > 6 h 6.5 s 14.0 s 1 m 04.1 s
0.001 3394 6034 697 > 6 h 6.7 s 15.1 s 1 m 59.0 s

Classification

Adapted from: Tomáš Kliegr, Jaroslav Kuchař: Benchmark of rule-based classifiers in the news recommendation task. CLEF 2015 Proceedings, p. 130–141.

Impact of pruning steps in CBA on CLEF#26875 dataset. Minimum support set to 0.1% and minimum confidence set to 2%.
algorithmaccuracyrules
no pruning, direct use of association rules 6.4 1735
data coverage pruning 6.9 497
data coverage, default rule pruning 7 175

 

Effect of support threshold - CBA (ten-fold shuffled cross-validation). CBA implementation evaluated is LUCS KDD.
metric0.10%0.09%0.08%0.07%0.06%0.05%0.04%0.03%0.02%0.01%
accuracy 6.68 6.88 7.07 7.64 8.1 8.65 9.48 10.4 13.47 17.55
rule count 148 178 193 228 270 317 452 576 1100 2303

 

Model benchmark on CLEF#26875 dataset (single 90/10 split). Model size refers to the number of rules for rule models and number of leaves for decision trees.  DecisionTree and CHAID - RapidMiner 5 Community edition, FOIL, CPAR, CBA, CMAR - LUCS KDD
algorithmaccuracymodel size
DecisionTree 23.0 13496
ID3 22.8 13579
CHAID 25.4 13224
FOIL 24.7 18047
CPAR 4.6 18907
CBA 21.2 3681
CMAR 16.9 22516