From: Automated labeling of PDF mathematical exercises with word N-grams VSM classification
 | \(n=1\) | \(n=2\) | \(n=3\) | \(n=4\) | \(n=5\) | \(n=6\) | w2vec |
---|---|---|---|---|---|---|---|
Similarity (\({f}_{mean}\)) | 0.4317 | 0.4699 | 0.4447 | 0.4418 | 0.4249 | 0.4151 | 0.2584 |
Similarity (\({f}_{max}\)) | 0.5589 | 0.6076 | 0.5946 | 0.5780 | 0.5564 | 0.5305 | 0.5121 |
Similarity (\({f}_{to{p}_{2}}\)) | 0.5845 | 0.6252 | 0.6050 | 0.5914 | 0.5640 | 0.5337 | 0.4941 |
Similarity (\({f}_{to{p}_{3}}\)) | 0.5795 | 0.6169 | 0.6000 | 0.5823 | 0.5514 | 0.5283 | 0.4688 |
Similarity (\({f}_{to{p}_{4}}\)) | 0.5553 | 0.5968 | 0.5845 | 0.5607 | 0.5359 | 0.5164 | 0.4328 |
Similarity (\({f}_{to{p}_{5}}\)) | 0.4840 | 0.5586 | 0.5510 | 0.5431 | 0.5232 | 0.5074 | 0.3640 |
Similarity (\({f}_{to{p}_{6}}\)) | 0.4663 | 0.5276 | 0.5276 | 0.5222 | 0.5085 | 0.4984 | 0.3495 |
Similarity (\({f}_{to{p}_{7}}\)) | 0.4544 | 0.5016 | 0.5099 | 0.5045 | 0.4905 | 0.4890 | 0.3240 |
Similarity (\({f}_{to{p}_{8}}\)) | 0.4411 | 0.4829 | 0.4908 | 0.4915 | 0.4861 | 0.4836 | 0.2998 |
Similarity (\({f}_{to{p}_{9}}\)) | 0.3795 | 0.4490 | 0.4688 | 0.4782 | 0.4764 | 0.4782 | 0.2378 |
Similarity (\({f}_{to{p}_{10}}\)) | 0.3467 | 0.4231 | 0.4468 | 0.4631 | 0.4681 | 0.4710 | 0.2288 |
Similarity (\({f}_{ran{k}_{2}}\)) | 0.5895 | 0.6335 | 0.6083 | 0.5924 | 0.5676 | 0.5434 | 0.5114 |
Similarity (\({f}_{ran{k}_{3}}\)) | 0.5917 | 0.6364 | 0.6119 | 0.5993 | 0.5679 | 0.5409 | 0.4915 |
Similarity (\({f}_{ran{k}_{4}}\)) | 0.5813 | 0.6299 | 0.6054 | 0.5921 | 0.5604 | 0.5402 | 0.4742 |
Similarity (\({f}_{ran{k}_{5}}\)) | 0.5514 | 0.6119 | 0.5989 | 0.5809 | 0.5557 | 0.5359 | 0.3968 |
Similarity (\({f}_{ran{k}_{6}}\)) | 0.5124 | 0.5859 | 0.5841 | 0.5705 | 0.5467 | 0.5319 | 0.3838 |
Similarity (\({f}_{ran{k}_{7}}\)) | 0.4923 | 0.5712 | 0.5723 | 0.5640 | 0.5416 | 0.5250 | 0.3726 |
Similarity (\({f}_{ran{k}_{8}}\)) | 0.4782 | 0.5532 | 0.5582 | 0.5553 | 0.5337 | 0.5182 | 0.3582 |
Similarity (\({f}_{ran{k}_{9}}\)) | 0.4584 | 0.5377 | 0.5467 | 0.5409 | 0.5279 | 0.5114 | 0.2847 |
Similarity (\({f}_{ran{k}_{10}}\)) | 0.4321 | 0.5178 | 0.5315 | 0.5330 | 0.5214 | 0.5095 | 0.2656 |
xgb | 0.1441 | 0.0868 | 0.0659 | 0.0541 | 0.0490 | 0.0375 | 0.1643 |
rf | 0.6850 | 0.6829 | 0.6314 | 0.5957 | 0.5452 | 0.4861 | 0.4987 |
mlp | 0.4418 | 0.6281 | 0.6544 | 0.6566 | 0.6414 | 0.6169 | 0.4840 |
lr | 0.6404 | 0.6339 | 0.6270 | 0.6220 | 0.5924 | 0.5636 | 0.6245 |