Skip to the content.

Rankings by correlation

AR

  Team name ρ IoU
1 ccnu 0.658322 0.599486
2 UCSC 0.654253 0.605911
3 MSA 0.648751 0.669976
4 Deloitte 0.637112 0.587047
5 Cantharellus 0.588639 0.580441
6 smurfcat 0.586897 0.554546
7 AILSNTUA 0.586518 0.49669
8 DeepPavlov 0.575385 0.562797
9 NCL-UoR 0.571027 0.538962
10 LCTeam 0.553711 0.533466
11 TrustAI 0.538544 0.284279
12 HalluSearch 0.525796 0.536196
13 TUM-MiKaNi 0.511441 0.477832
14 BlueToad 0.505765 0.546958
15 UZH 0.502335 0.50292
16 tsotsalab 0.476535 0.467336
17 uir-cis 0.447737 0.272152
18 CUET_SSTM 0.447229 0.0977755
19 nsu-ai 0.423632 0.475634
20 UMUTEAM 0.421136 0.343636
21 TU Munich 0.397252 0.148036
22 Go Bison 0.384405 0.213832
23 Swushroomsia 0.287353 0.309703
24 keepitsimple 0.249909 0.363055
25 REFIND 0.181827 0.373731
  Baseline (neural) 0.119044 0.0418214
26 CIMAT_NLP 0.0968738 0.144688
27 HalluciSeekers 0.0572297 0.118015
28 Hallucination Detectives 0.0357784 0.0754672
  Baseline (mark all) 0.00666667 0.361354
29 FENJI 0.00666667 0.0466667
  Baseline (mark none) 0.00666667 0.0466667

CA

  Team name ρ IoU
1 UCSC 0.784443 0.671104
2 ccnu 0.74788 0.669404
3 NCL-UoR 0.720283 0.660233
4 smurfcat 0.71269 0.66813
5 MSA 0.712569 0.654489
6 AILSNTUA 0.698585 0.666354
7 DeepPavlov 0.674159 0.417948
8 UZH 0.641962 0.585719
9 Deloitte 0.621869 0.503203
10 Cantharellus 0.572738 0.523118
11 HalluSearch 0.570404 0.521545
12 TUM-MiKaNi 0.555095 0.597131
13 uir-cis 0.543165 0.464404
14 nsu-ai 0.53457 0.468213
15 tsotsalab 0.518747 0.460743
16 LCTeam 0.493652 0.44413
17 UMUTEAM 0.429528 0.430108
18 Go Bison 0.374884 0.273122
19 keepitsimple 0.337705 0.316082
20 CIMAT_NLP 0.0690446 0.140966
  Baseline (neural) 0.064522 0.0523675
  Baseline (mark all) 0.06 0.242314
21 FENJI 0.06 0.179567
  Baseline (mark none) 0.06 0.08

CS

  Team name ρ IoU
1 UCSC 0.599259 0.507206
2 AILSNTUA 0.55604 0.542931
3 ccnu 0.554119 0.485157
4 MSA 0.551628 0.507274
5 smurfcat 0.533409 0.450961
6 NCL-UoR 0.528544 0.440913
7 Deloitte 0.503438 0.374018
8 HalluSearch 0.49418 0.491134
9 TUM-MiKaNi 0.457965 0.385329
10 Cantharellus 0.437327 0.382296
11 LCTeam 0.435664 0.405129
12 UZH 0.409798 0.393113
13 tsotsalab 0.366823 0.361273
14 BlueToad 0.362768 0.351392
15 UMUTEAM 0.360041 0.337978
16 DeepPavlov 0.321452 0.340538
17 Go Bison 0.306569 0.297839
18 nsu-ai 0.294812 0.305105
19 uir-cis 0.26954 0.305957
20 keepitsimple 0.242256 0.289479
21 REFIND 0.186094 0.235255
22 CIMAT_NLP 0.156329 0.182078
  Baseline (mark all) 0.1 0.263164
  Baseline (mark none) 0.1 0.13
23 FENJI 0.1 0.107279
  Baseline (neural) 0.0533101 0.0957349

DE

  Team name ρ IoU
1 UCSC 0.658758 0.622068
2 AILSNTUA 0.636736 0.582003
3 Swushroomsia 0.616026 0.291122
4 DeepPavlov 0.612572 0.504009
5 MSA 0.610729 0.613297
6 ccnu 0.608866 0.591716
7 smurfcat 0.604236 0.505018
8 NCL-UoR 0.586023 0.547259
9 Deloitte 0.549333 0.565526
10 Cantharellus 0.536146 0.563885
11 BlueToad 0.524323 0.543932
12 TrustAI 0.512082 0.332266
13 TUM-MiKaNi 0.508814 0.55689
14 HalluSearch 0.505648 0.518707
15 LCTeam 0.503102 0.563418
16 UZH 0.502754 0.512345
17 ATLANTIS 0.460747 0.520388
18 nsu-ai 0.458364 0.484104
19 UMUTEAM 0.440333 0.409277
20 uir-cis 0.406593 0.340033
21 tsotsalab 0.361389 0.396933
22 REFIND 0.353026 0.386198
23 TU Munich 0.319541 0.270422
24 Go Bison 0.276369 0.252222
25 keepitsimple 0.2199 0.365058
  Baseline (neural) 0.107339 0.0318306
26 HalluciSeekers 0.0439964 0.0573379
  Baseline (mark all) 0.0133333 0.345082
27 FENJI 0.0133333 0.162442
  Baseline (mark none) 0.0133333 0.0266667
28 S1mT5v-FMI 0.010897 0.0266667

EN

  Team name ρ IoU
1 Swushroomsia 0.648553 0.476925
2 UCSC 0.64787 0.568586
3 AILSNTUA 0.63806 0.530816
4 iai_MSU 0.629443 0.650899
5 DeepPavlov 0.611648 0.439097
6 smurfcat 0.611573 0.504968
7 Deloitte 0.583317 0.511402
8 ccnu 0.571317 0.517683
9 TrustAI 0.564236 0.298035
10 LCTeam 0.560416 0.459026
11 TUM-MiKaNi 0.550571 0.338538
12 NCL-UoR 0.547665 0.519506
13 HalluSearch 0.544359 0.531502
14 MSA 0.538022 0.506599
15 ATLANTIS 0.528718 0.515867
16 UZH 0.519255 0.469884
17 GIL-IIMAS UNAM 0.501453 0.460714
18 UMUTEAM 0.496598 0.366717
19 uir-cis 0.478107 0.402539
20 Cantharellus 0.466793 0.428869
21 nsu-ai 0.457807 0.443577
22 BlueToad 0.450891 0.46881
23 CIMAT_NLP 0.425484 0.426997
24 HausaNLP 0.422565 0.0324675
25 tsotsalab 0.41087 0.379302
26 YNU-HPCC 0.407526 0.480748
27 TU Munich 0.37604 0.208941
28 VerbaNexAI 0.365657 0.363401
29 advacheck 0.34982 0.443957
30 MALTO 0.311736 0.299252
31 RaggedyFive 0.303801 0.315072
32 Go Bison 0.275188 0.132535
33 COGUMELO 0.227748 0.310653
34 keepitsimple 0.210418 0.365959
35 REFIND 0.205808 0.281194
36 Hallucination Detectives 0.168161 0.214243
37 HalluciSeekers 0.153026 0.0541527
  Baseline (neural) 0.118962 0.0310306
38 HalluRAG-RUG 0.083251 0.309315
39 FunghiFunghi 0.0115548 0.294294
  Baseline (mark all) 0 0.348926
40 FENJI 0 0.185616
  Baseline (mark none) 0 0.0324675
41 DUTJBD -0.188262 0.0570927

ES

  Team name ρ IoU
1 UCSC 0.619284 0.43391
2 AILSNTUA 0.606817 0.439603
3 Deloitte 0.585294 0.406493
4 smurfcat 0.566164 0.430813
5 ccnu 0.55746 0.511061
6 MSA 0.547745 0.402249
7 NCL-UoR 0.546429 0.51464
8 CIMAT_NLP 0.545803 0.472748
9 UZH 0.508468 0.405104
10 TUM-MiKaNi 0.502674 0.373943
11 TrustAI 0.498297 0.268268
12 Cantharellus 0.448898 0.366712
13 LCTeam 0.447057 0.418761
14 HalluSearch 0.44556 0.388304
15 BlueToad 0.426741 0.278685
16 DeepPavlov 0.420667 0.209793
17 UMUTEAM 0.415235 0.298022
18 nsu-ai 0.396598 0.28542
19 ATLANTIS 0.379309 0.36063
20 Go Bison 0.364261 0.134148
21 GIL-IIMAS UNAM 0.324301 0.280706
22 TU Munich 0.32288 0.257843
23 uir-cis 0.310446 0.34471
24 Swushroomsia 0.247989 0.246567
25 keepitsimple 0.233523 0.21312
26 REFIND 0.169892 0.215179
27 COGUMELO 0.101349 0.132137
  Baseline (neural) 0.0358945 0.0723782
28 HalluciSeekers 0.0265825 0.0519325
  Baseline (mark all) 0.0131579 0.185334
29 tsotsalab 0.0131579 0.185334
30 FENJI 0.0131579 0.132475
  Baseline (mark none) 0.0131579 0.0855263
31 S1mT5v-FMI 0.0131579 0.0855263
32 FunghiFunghi -0.098563 0.161561

EU

  Team name ρ IoU
1 UCSC 0.626477 0.583036
2 MSA 0.620171 0.612919
3 ccnu 0.612061 0.57845
4 NCL-UoR 0.597423 0.510467
5 AILSNTUA 0.580486 0.555038
6 LCTeam 0.555992 0.458939
7 smurfcat 0.523384 0.510622
8 Deloitte 0.515714 0.521802
9 UZH 0.51084 0.507146
10 Cantharellus 0.503753 0.533928
11 TUM-MiKaNi 0.499565 0.428949
12 HalluSearch 0.478856 0.525141
13 BlueToad 0.457055 0.506068
14 nsu-ai 0.420974 0.436818
15 uir-cis 0.398855 0.291573
16 UMUTEAM 0.392456 0.327194
17 REFIND 0.35516 0.386945
18 keepitsimple 0.35247 0.419268
19 DeepPavlov 0.321352 0.38725
20 Go Bison 0.170669 0.246096
  Baseline (neural) 0.100423 0.0208128
21 CIMAT_NLP 0.0712057 0.137183
  Baseline (mark all) 0 0.36709
22 tsotsalab 0 0.352359
23 FENJI 0 0.132604
  Baseline (mark none) 0 0.010101

FA

  Team name ρ IoU
1 MSA 0.700909 0.639229
2 AILSNTUA 0.69889 0.710956
3 UCSC 0.695521 0.694861
4 ccnu 0.688646 0.656931
5 Cantharellus 0.686447 0.65514
6 NCL-UoR 0.673225 0.65857
7 smurfcat 0.65845 0.606245
8 BlueToad 0.578781 0.571086
9 Deloitte 0.537897 0.513905
10 UZH 0.498991 0.510767
11 TUM-MiKaNi 0.476213 0.531511
12 HalluSearch 0.473408 0.444267
13 LCTeam 0.455896 0.601757
14 CIMAT_NLP 0.42971 0.0248033
15 uir-cis 0.39462 0.166122
16 UMUTEAM 0.393932 0.467708
17 nsu-ai 0.387544 0.372886
18 keepitsimple 0.357009 0.313204
19 DeepPavlov 0.18591 0.240507
  Baseline (neural) 0.107759 0.000133333
20 HalluciSeekers 0.0743606 0.112631
21 Go Bison 0.0661229 0.118995
  Baseline (mark all) 0.01 0.202808
22 tsotsalab 0.01 0.202808
23 FENJI 0.01 0.00280303
  Baseline (mark none) 0.01 0

FI

  Team name ρ IoU
1 UCSC 0.649756 0.648264
2 Deloitte 0.642378 0.628366
3 AILSNTUA 0.620406 0.623496
4 TUM-MiKaNi 0.575089 0.626698
5 smurfcat 0.565011 0.553573
6 Cantharellus 0.564578 0.571418
7 ccnu 0.563066 0.511671
8 LCTeam 0.561119 0.393282
9 NCL-UoR 0.552387 0.49828
10 MSA 0.546673 0.642205
11 HalluSearch 0.529663 0.568077
12 TrustAI 0.528106 0.107232
13 UMUTEAM 0.512588 0.456276
14 UZH 0.493367 0.538329
15 nsu-ai 0.492204 0.587385
16 BlueToad 0.490576 0.569368
17 DeepPavlov 0.482142 0.584469
18 Swushroomsia 0.429834 0.495474
19 TU Munich 0.412063 0.404171
20 Go Bison 0.343294 0.399602
21 uir-cis 0.336595 0.245949
22 keepitsimple 0.332279 0.455409
23 REFIND 0.198615 0.502531
  Baseline (neural) 0.0924184 0.00418271
24 CIMAT_NLP 0.0418234 0.367269
25 S1mT5v-FMI 0.00143156 0
  Baseline (mark all) 0 0.4857
26 tsotsalab 0 0.4857
27 FENJI 0 0.0940742
  Baseline (mark none) 0 0

FR

  Team name ρ IoU
1 Deloitte 0.618707 0.646867
2 AILSNTUA 0.61026 0.581152
3 UCSC 0.604096 0.581228
4 Swushroomsia 0.590778 0.442186
5 ccnu 0.572353 0.482309
6 smurfcat 0.566108 0.526852
7 MSA 0.55531 0.61946
8 DeepPavlov 0.543978 0.583096
9 Cantharellus 0.531731 0.514708
10 TUM-MiKaNi 0.515746 0.631443
11 TrustAI 0.499211 0.379873
12 tsotsalab 0.490968 0.483577
13 LCTeam 0.488321 0.56336
14 NCL-UoR 0.482266 0.357145
15 UZH 0.466929 0.485988
16 nsu-ai 0.433876 0.518122
17 UMUTEAM 0.411692 0.319957
18 ATLANTIS 0.411668 0.518961
19 Go Bison 0.399039 0.416434
20 BlueToad 0.37965 0.438527
21 TU Munich 0.348386 0.415209
22 HalluSearch 0.336523 0.436638
23 uir-cis 0.287342 0.22862
24 keepitsimple 0.275619 0.465062
25 REFIND 0.152962 0.211995
26 CIMAT_NLP 0.089848 0.33095
27 HalluciSeekers 0.0447022 0.0500358
  Baseline (neural) 0.0207713 0.00220218
  Baseline (mark all) 0 0.454341
28 FENJI 0 0.0843731
  Baseline (mark none) 0 0
29 S1mT5v-FMI 0 0
30 FunghiFunghi -0.152109 0.309466

HI

  Team name ρ IoU
1 ccnu 0.78466 0.746571
2 UCSC 0.774615 0.673209
3 AILSNTUA 0.760167 0.725948
4 smurfcat 0.750213 0.706393
5 DeepPavlov 0.731983 0.511659
6 MSA 0.725217 0.684241
7 Cantharellus 0.694484 0.626955
8 BlueToad 0.684394 0.644655
9 NCL-UoR 0.683031 0.628631
10 UZH 0.668682 0.637742
11 Deloitte 0.639149 0.632193
12 uir-cis 0.558552 0.0612541
13 TUM-MiKaNi 0.540935 0.573665
14 HalluSearch 0.519489 0.526485
15 LCTeam 0.51215 0.660057
16 TrustAI 0.504983 0.314378
17 Swushroomsia 0.478873 0.453368
18 nsu-ai 0.449689 0.431481
19 UMUTEAM 0.438582 0.451005
20 keepitsimple 0.350802 0.359799
21 TU Munich 0.329729 0.280676
22 Go Bison 0.321685 0.258627
  Baseline (neural) 0.142871 0.00290509
  Baseline (mark all) 0 0.271096
23 tsotsalab 0 0.271096
24 FENJI 0 0
  Baseline (mark none) 0 0

IT

  Team name ρ IoU
1 AILSNTUA 0.819506 0.765999
2 UCSC 0.794364 0.750938
3 NCL-UoR 0.763733 0.654718
4 smurfcat 0.762752 0.72547
5 MSA 0.7587 0.728948
6 ccnu 0.745766 0.694386
7 Swushroomsia 0.739385 0.7149
8 Cantharellus 0.711849 0.690693
9 UZH 0.701611 0.683283
10 BlueToad 0.667483 0.638831
11 Deloitte 0.654666 0.625251
12 TUM-MiKaNi 0.623289 0.678142
13 HalluSearch 0.560364 0.54836
14 DeepPavlov 0.552877 0.527982
15 LCTeam 0.548709 0.701254
16 uir-cis 0.499055 0.396664
17 TrustAI 0.475986 0.20768
18 UMUTEAM 0.460141 0.441297
19 nsu-ai 0.440155 0.439614
20 TU Munich 0.420967 0.331916
21 Go Bison 0.402096 0.267502
22 keepitsimple 0.386033 0.40092
23 REFIND 0.242343 0.325471
24 CIMAT_NLP 0.0893679 0.169601
  Baseline (neural) 0.0799714 0.0104172
25 HalluciSeekers 0.0241904 0.0349619
  Baseline (mark all) 0 0.282615
26 tsotsalab 0 0.282615
27 FENJI 0 0.276521
  Baseline (mark none) 0 0
28 FunghiFunghi -0.211589 0.211097

SV

  Team name ρ IoU
1 AILSNTUA 0.5622 0.600937
2 MSA 0.548584 0.607064
3 Deloitte 0.537399 0.622029
4 NCL-UoR 0.522451 0.523353
5 UCSC 0.520398 0.642303
6 ccnu 0.512862 0.496107
7 smurfcat 0.500708 0.617439
8 LCTeam 0.463091 0.301569
9 UZH 0.434623 0.5263
10 HalluSearch 0.428993 0.562213
11 BlueToad 0.426741 0.585413
12 TrustAI 0.421914 0.15824
13 DeepPavlov 0.414687 0.538018
14 TUM-MiKaNi 0.402848 0.561387
15 UMUTEAM 0.393618 0.439337
16 uir-cis 0.365493 0.307967
17 nsu-ai 0.344154 0.547839
18 TU Munich 0.240321 0.275515
19 Swushroomsia 0.226542 0.354929
20 keepitsimple 0.217007 0.396724
  Baseline (neural) 0.0968112 0.0307729
21 HalluciSeekers 0.0855806 0.0574943
22 CIMAT_NLP 0.0822829 0.177169
23 Go Bison 0.0668742 0.110986
  Baseline (mark all) 0.0136054 0.537275
24 tsotsalab 0.0136054 0.534936
25 FENJI 0.0136054 0.115419
  Baseline (mark none) 0.0136054 0.0204082
26 S1mT5v-FMI 0.0136054 0.0204082
27 FunghiFunghi -0.117725 0.415579

ZH

  Team name ρ IoU
1 LCTeam 0.517056 0.523231
2 UMUTEAM 0.491649 0.387513
3 AILSNTUA 0.479097 0.308338
4 TrustAI 0.473483 0.342284
5 TUM-MiKaNi 0.467572 0.448973
6 MSA 0.436328 0.463093
7 ccnu 0.433474 0.37184
8 HalluSearch 0.423154 0.453449
9 UCSC 0.418653 0.463348
10 Cantharellus 0.406336 0.401066
11 NCL-UoR 0.383009 0.349296
12 nsu-ai 0.381261 0.493722
13 UZH 0.352048 0.39926
14 YNU-HPCC 0.351808 0.553971
15 smurfcat 0.345714 0.401736
16 uir-cis 0.327846 0.178566
17 DeepPavlov 0.325073 0.484934
18 Deloitte 0.320306 0.447917
19 TU Munich 0.27711 0.174971
20 BlueToad 0.226195 0.278315
21 keepitsimple 0.160073 0.47033
22 Go Bison 0.111916 0.215245
23 Swushroomsia 0.09661 0.205396
  Baseline (neural) 0.0883699 0.0235879
  Baseline (mark all) 0 0.477155
24 tsotsalab 0 0.477155
25 FENJI 0 0.0371131
  Baseline (mark none) 0 0.02
26 S1mT5v-FMI -0.020898 0.061894