From: Automated labeling of PDF mathematical exercises with word N-grams VSM classification
 | \(n=1\) | \(n=2\) | \(n=3\) | \(n=4\) | \(n=5\) | \(n=6\) | w2vec |
---|---|---|---|---|---|---|---|
Similarity (\({f}_{mean}\)) | 0.7121 | 0.7398 | 0.7059 | 0.7049 | 0.6825 | 0.6858 | 0.3834 |
Similarity (\({f}_{max}\)) | 0.8025 | 0.8184 | 0.8130 | 0.7953 | 0.7791 | 0.7539 | 0.6944 |
Similarity (\({f}_{to{p}_{2}}\)) | 0.8328 | 0.8386 | 0.8314 | 0.8105 | 0.7838 | 0.7647 | 0.7063 |
Similarity (\({f}_{to{p}_{3}}\)) | 0.8411 | 0.8429 | 0.8357 | 0.8072 | 0.7791 | 0.7625 | 0.7081 |
Similarity (\({f}_{to{p}_{4}}\)) | 0.8407 | 0.8494 | 0.8328 | 0.8105 | 0.7730 | 0.7539 | 0.7049 |
Similarity (\({f}_{to{p}_{5}}\)) | 0.8432 | 0.8505 | 0.8324 | 0.8022 | 0.7679 | 0.7506 | 0.6955 |
Similarity (\({f}_{to{p}_{6}}\)) | 0.8443 | 0.8494 | 0.8321 | 0.7960 | 0.7607 | 0.7409 | 0.6818 |
Similarity (\({f}_{to{p}_{7}}\)) | 0.8404 | 0.8523 | 0.8303 | 0.7888 | 0.7596 | 0.7373 | 0.6728 |
Similarity (\({f}_{to{p}_{8}}\)) | 0.8389 | 0.8508 | 0.8256 | 0.7859 | 0.7553 | 0.7348 | 0.6595 |
Similarity (\({f}_{to{p}_{9}}\)) | 0.8378 | 0.8483 | 0.8202 | 0.7831 | 0.7524 | 0.7243 | 0.6541 |
Similarity (\({f}_{to{p}_{10}}\)) | 0.8350 | 0.8465 | 0.8162 | 0.7798 | 0.7481 | 0.7196 | 0.6436 |
Similarity (\({f}_{ran{k}_{2}}\)) | 0.8231 | 0.8382 | 0.8281 | 0.8090 | 0.7841 | 0.7647 | 0.7095 |
Similarity (\({f}_{ran{k}_{3}}\)) | 0.8353 | 0.8411 | 0.8332 | 0.8123 | 0.7852 | 0.7665 | 0.7114 |
Similarity (\({f}_{ran{k}_{4}}\)) | 0.8400 | 0.8472 | 0.8364 | 0.8130 | 0.7852 | 0.7672 | 0.7106 |
Similarity (\({f}_{ran{k}_{5}}\)) | 0.8432 | 0.8523 | 0.8382 | 0.8119 | 0.7813 | 0.7654 | 0.7085 |
Similarity (\({f}_{ran{k}_{6}}\)) | 0.8461 | 0.8533 | 0.8371 | 0.8141 | 0.7798 | 0.7589 | 0.7067 |
Similarity (\({f}_{ran{k}_{7}}\)) | 0.8458 | 0.8537 | 0.8386 | 0.8123 | 0.7780 | 0.7593 | 0.6998 |
Similarity (\({f}_{ran{k}_{8}}\)) | 0.8483 | 0.8551 | 0.8375 | 0.8101 | 0.7744 | 0.7557 | 0.6951 |
Similarity (\({f}_{ran{k}_{9}}\)) | 0.8479 | 0.8555 | 0.8353 | 0.8072 | 0.7759 | 0.7517 | 0.6923 |
Similarity (\({f}_{ran{k}_{10}}\)) | 0.8479 | 0.8537 | 0.8335 | 0.8029 | 0.7719 | 0.7463 | 0.6836 |
xgb | 0.8605 | 0.8274 | 0.6681 | 0.5831 | 0.4825 | 0.4263 | 0.6468 |
rf | 0.9250 | 0.9013 | 0.8310 | 0.7427 | 0.6526 | 0.5968 | 0.7139 |
mlp | 0.7910 | 0.8732 | 0.8631 | 0.8674 | 0.8382 | 0.8209 | 0.8065 |
lr | 0.9193 | 0.8930 | 0.8641 | 0.8371 | 0.8025 | 0.7492 | 0.8537 |