From: Automated labeling of PDF mathematical exercises with word N-grams VSM classification
 | \(n=1\) | \(n=2\) | \(n=3\) | \(n=4\) | \(n=5\) | \(n=6\) | w2vec |
---|---|---|---|---|---|---|---|
Similarity (\({f}_{mean}\)) | 0.7211 | 0.7479 | 0.7122 | 0.7058 | 0.6827 | 0.6835 | 0.3894 |
Similarity (\({f}_{max}\)) | 0.8015 | 0.8124 | 0.8051 | 0.7864 | 0.7684 | 0.7437 | 0.7031 |
Similarity (\({f}_{to{p}_{2}}\)) | 0.8309 | 0.8345 | 0.8236 | 0.8004 | 0.7746 | 0.7550 | 0.7133 |
Similarity (\({f}_{to{p}_{3}}\)) | 0.8369 | 0.8375 | 0.8282 | 0.7969 | 0.7695 | 0.7520 | 0.7133 |
Similarity (\({f}_{to{p}_{4}}\)) | 0.8370 | 0.8441 | 0.8249 | 0.8007 | 0.7607 | 0.7427 | 0.7094 |
Similarity (\({f}_{to{p}_{5}}\)) | 0.8392 | 0.8457 | 0.8252 | 0.7928 | 0.7542 | 0.7369 | 0.7014 |
Similarity (\({f}_{to{p}_{6}}\)) | 0.8394 | 0.8444 | 0.8243 | 0.7836 | 0.7444 | 0.7257 | 0.6868 |
Similarity (\({f}_{to{p}_{7}}\)) | 0.8338 | 0.8464 | 0.8212 | 0.7728 | 0.7438 | 0.7199 | 0.6767 |
Similarity (\({f}_{to{p}_{8}}\)) | 0.8328 | 0.8457 | 0.8160 | 0.7699 | 0.7389 | 0.7154 | 0.6619 |
Similarity (\({f}_{to{p}_{9}}\)) | 0.8312 | 0.8421 | 0.8091 | 0.7669 | 0.7330 | 0.7004 | 0.6563 |
Similarity (\({f}_{to{p}_{10}}\)) | 0.8286 | 0.8387 | 0.8013 | 0.7598 | 0.7226 | 0.6913 | 0.6440 |
Similarity (\({f}_{ran{k}_{2}}\)) | 0.8209 | 0.8343 | 0.8196 | 0.7999 | 0.7740 | 0.7546 | 0.7163 |
Similarity (\({f}_{ran{k}_{3}}\)) | 0.8330 | 0.8379 | 0.8254 | 0.8018 | 0.7756 | 0.7572 | 0.7181 |
Similarity (\({f}_{ran{k}_{4}}\)) | 0.8379 | 0.8433 | 0.8288 | 0.8027 | 0.7752 | 0.7570 | 0.7162 |
Similarity (\({f}_{ran{k}_{5}}\)) | 0.8406 | 0.8484 | 0.8304 | 0.8020 | 0.7708 | 0.7552 | 0.7139 |
Similarity (\({f}_{ran{k}_{6}}\)) | 0.8433 | 0.8485 | 0.8297 | 0.8046 | 0.7666 | 0.7472 | 0.7119 |
Similarity (\({f}_{ran{k}_{7}}\)) | 0.8424 | 0.8492 | 0.8310 | 0.8028 | 0.7649 | 0.7473 | 0.7050 |
Similarity (\({f}_{ran{k}_{8}}\)) | 0.8439 | 0.8502 | 0.8298 | 0.7995 | 0.7605 | 0.7421 | 0.7002 |
Similarity (\({f}_{ran{k}_{9}}\)) | 0.8418 | 0.8504 | 0.8275 | 0.7967 | 0.7586 | 0.7375 | 0.6965 |
Similarity (\({f}_{ran{k}_{10}}\)) | 0.8414 | 0.8490 | 0.8251 | 0.7904 | 0.7551 | 0.7303 | 0.6865 |
xgb | 0.8572 | 0.8212 | 0.6171 | 0.5195 | 0.3938 | 0.3312 | 0.6303 |
rf | 0.9250 | 0.8996 | 0.8205 | 0.7133 | 0.6176 | 0.5568 | 0.7098 |
mlp | 0.7914 | 0.8719 | 0.8623 | 0.8665 | 0.8393 | 0.8178 | 0.8079 |
lr | 0.9218 | 0.8903 | 0.8570 | 0.8247 | 0.7880 | 0.7358 | 0.8566 |