From: Automated labeling of PDF mathematical exercises with word N-grams VSM classification
 | \(n=1\) | \(n=2\) | \(n=3\) | \(n=4\) | \(n=5\) | \(n=6\) | w2vec |
---|---|---|---|---|---|---|---|
Similarity (\({f}_{mean}\)) | 0.7127 | 0.7392 | 0.7055 | 0.7063 | 0.6831 | 0.6864 | 0.3962 |
Similarity (\({f}_{max}\)) | 0.8024 | 0.8191 | 0.8140 | 0.7960 | 0.7797 | 0.7541 | 0.6939 |
Similarity (\({f}_{to{p}_{2}}\)) | 0.8331 | 0.8393 | 0.8322 | 0.8109 | 0.7837 | 0.7638 | 0.7064 |
Similarity (\({f}_{to{p}_{3}}\)) | 0.8414 | 0.8440 | 0.8365 | 0.8073 | 0.7785 | 0.7609 | 0.7091 |
Similarity (\({f}_{to{p}_{4}}\)) | 0.8408 | 0.8505 | 0.8337 | 0.8105 | 0.7719 | 0.7514 | 0.7059 |
Similarity (\({f}_{to{p}_{5}}\)) | 0.8434 | 0.8514 | 0.8332 | 0.8019 | 0.7659 | 0.7476 | 0.6961 |
Similarity (\({f}_{to{p}_{6}}\)) | 0.8446 | 0.8502 | 0.8331 | 0.7954 | 0.7584 | 0.7378 | 0.6825 |
Similarity (\({f}_{to{p}_{7}}\)) | 0.8407 | 0.8531 | 0.8313 | 0.7886 | 0.7569 | 0.7344 | 0.6735 |
Similarity (\({f}_{to{p}_{8}}\)) | 0.8394 | 0.8515 | 0.8265 | 0.7852 | 0.7528 | 0.7321 | 0.6607 |
Similarity (\({f}_{to{p}_{9}}\)) | 0.8385 | 0.8492 | 0.8212 | 0.7823 | 0.7504 | 0.7223 | 0.6556 |
Similarity (\({f}_{to{p}_{10}}\)) | 0.8358 | 0.8475 | 0.8175 | 0.7795 | 0.7466 | 0.7184 | 0.6457 |
Similarity (\({f}_{ran{k}_{2}}\)) | 0.8232 | 0.8389 | 0.8291 | 0.8093 | 0.7844 | 0.7643 | 0.7097 |
Similarity (\({f}_{ran{k}_{3}}\)) | 0.8357 | 0.8419 | 0.8340 | 0.8126 | 0.7850 | 0.7656 | 0.7118 |
Similarity (\({f}_{ran{k}_{4}}\)) | 0.8403 | 0.8481 | 0.8372 | 0.8134 | 0.7846 | 0.7659 | 0.7116 |
Similarity (\({f}_{ran{k}_{5}}\)) | 0.8434 | 0.8532 | 0.8391 | 0.8121 | 0.7807 | 0.7638 | 0.7094 |
Similarity (\({f}_{ran{k}_{6}}\)) | 0.8462 | 0.8544 | 0.8377 | 0.8139 | 0.7792 | 0.7568 | 0.7076 |
Similarity (\({f}_{ran{k}_{7}}\)) | 0.8460 | 0.8547 | 0.8392 | 0.8121 | 0.7767 | 0.7566 | 0.7006 |
Similarity (\({f}_{ran{k}_{8}}\)) | 0.8484 | 0.8562 | 0.8383 | 0.8099 | 0.7726 | 0.7527 | 0.6959 |
Similarity (\({f}_{ran{k}_{9}}\)) | 0.8483 | 0.8563 | 0.8364 | 0.8069 | 0.7739 | 0.7486 | 0.6929 |
Similarity (\({f}_{ran{k}_{10}}\)) | 0.8484 | 0.8545 | 0.8344 | 0.8025 | 0.7698 | 0.7431 | 0.6846 |
xgb | 0.8601 | 0.8279 | 0.6777 | 0.6037 | 0.5274 | 0.4800 | 0.6530 |
rf | 0.9255 | 0.9020 | 0.8315 | 0.7430 | 0.6514 | 0.6016 | 0.7191 |
mlp | 0.7924 | 0.8745 | 0.8636 | 0.8673 | 0.8367 | 0.8192 | 0.8066 |
lr | 0.9193 | 0.8935 | 0.8655 | 0.8387 | 0.8029 | 0.7453 | 0.8537 |