From: Automated labeling of PDF mathematical exercises with word N-grams VSM classification
 | \(n=1\) | \(n=2\) | \(n=3\) | \(n=4\) | \(n=5\) | \(n=6\) | w2vec |
---|---|---|---|---|---|---|---|
Similarity (\({f}_{mean}\)) | 0.4046 | 0.4461 | 0.4288 | 0.4319 | 0.4190 | 0.4094 | 0.2459 |
Similarity (\({f}_{max}\)) | 0.5103 | 0.5573 | 0.5411 | 0.5153 | 0.4958 | 0.4723 | 0.4817 |
Similarity (\({f}_{to{p}_{2}}\)) | 0.5167 | 0.5616 | 0.5380 | 0.5216 | 0.4979 | 0.4725 | 0.4388 |
Similarity (\({f}_{to{p}_{3}}\)) | 0.4903 | 0.5380 | 0.5182 | 0.4974 | 0.4726 | 0.4589 | 0.3904 |
Similarity (\({f}_{to{p}_{4}}\)) | 0.4419 | 0.4933 | 0.4832 | 0.4666 | 0.4511 | 0.4395 | 0.3307 |
Similarity (\({f}_{to{p}_{5}}\)) | 0.2694 | 0.4082 | 0.4209 | 0.4323 | 0.4297 | 0.4263 | 0.1895 |
Similarity (\({f}_{to{p}_{6}}\)) | 0.2431 | 0.3376 | 0.3731 | 0.3968 | 0.4038 | 0.4150 | 0.1797 |
Similarity (\({f}_{to{p}_{7}}\)) | 0.2339 | 0.2905 | 0.3413 | 0.3675 | 0.3799 | 0.3994 | 0.1592 |
Similarity (\({f}_{to{p}_{8}}\)) | 0.2215 | 0.2649 | 0.3110 | 0.3485 | 0.3687 | 0.3927 | 0.1388 |
Similarity (\({f}_{to{p}_{9}}\)) | 0.1604 | 0.2310 | 0.2883 | 0.3283 | 0.3548 | 0.3847 | 0.0740 |
Similarity (\({f}_{to{p}_{10}}\)) | 0.1247 | 0.2061 | 0.2619 | 0.3061 | 0.3434 | 0.3789 | 0.0704 |
Similarity (\({f}_{ran{k}_{2}}\)) | 0.5275 | 0.5769 | 0.5462 | 0.5274 | 0.5018 | 0.4826 | 0.4661 |
Similarity (\({f}_{ran{k}_{3}}\)) | 0.5202 | 0.5703 | 0.5422 | 0.5273 | 0.5006 | 0.4772 | 0.4291 |
Similarity (\({f}_{ran{k}_{4}}\)) | 0.4914 | 0.5510 | 0.5253 | 0.5121 | 0.4828 | 0.4725 | 0.3919 |
Similarity (\({f}_{ran{k}_{5}}\)) | 0.4177 | 0.5179 | 0.5063 | 0.4917 | 0.4770 | 0.4620 | 0.2148 |
Similarity (\({f}_{ran{k}_{6}}\)) | 0.3283 | 0.4582 | 0.4770 | 0.4748 | 0.4634 | 0.4565 | 0.2050 |
Similarity (\({f}_{ran{k}_{7}}\)) | 0.2814 | 0.4250 | 0.4551 | 0.4640 | 0.4522 | 0.4476 | 0.1958 |
Similarity (\({f}_{ran{k}_{8}}\)) | 0.2552 | 0.3924 | 0.4267 | 0.4488 | 0.4414 | 0.4391 | 0.1829 |
Similarity (\({f}_{ran{k}_{9}}\)) | 0.2367 | 0.3609 | 0.4107 | 0.4259 | 0.4349 | 0.4317 | 0.1047 |
Similarity (\({f}_{ran{k}_{10}}\)) | 0.2113 | 0.3284 | 0.3878 | 0.4150 | 0.4247 | 0.4280 | 0.0862 |
xgb | 0.0333 | 0.0150 | 0.0087 | 0.0036 | 0.0026 | 0.0014 | 0.0480 |
rf | 0.6128 | 0.5967 | 0.5478 | 0.5151 | 0.4623 | 0.4128 | 0.4285 |
mlp | 0.3919 | 0.5725 | 0.5999 | 0.6084 | 0.5950 | 0.5707 | 0.4377 |
lr | 0.5858 | 0.5828 | 0.5752 | 0.5745 | 0.5499 | 0.5385 | 0.5922 |