From: Automated labeling of PDF mathematical exercises with word N-grams VSM classification
 | \(n=1\) | \(n=2\) | \(n=3\) | \(n=4\) | \(n=5\) | \(n=6\) | w2vec |
---|---|---|---|---|---|---|---|
Similarity (\({f}_{mean}\)) | 0.4468 | 0.4799 | 0.4495 | 0.4388 | 0.4159 | 0.4051 | 0.2879 |
Similarity (\({f}_{max}\)) | 0.5641 | 0.6140 | 0.6021 | 0.5860 | 0.5617 | 0.5349 | 0.5183 |
Similarity (\({f}_{to{p}_{2}}\)) | 0.6025 | 0.6419 | 0.6229 | 0.6071 | 0.5768 | 0.5439 | 0.5136 |
Similarity (\({f}_{to{p}_{3}}\)) | 0.6084 | 0.6422 | 0.6249 | 0.6036 | 0.5675 | 0.5406 | 0.5016 |
Similarity (\({f}_{to{p}_{4}}\)) | 0.5966 | 0.6330 | 0.6186 | 0.5883 | 0.5563 | 0.5330 | 0.4755 |
Similarity (\({f}_{to{p}_{5}}\)) | 0.5605 | 0.6115 | 0.5949 | 0.5768 | 0.5472 | 0.5259 | 0.4319 |
Similarity (\({f}_{to{p}_{6}}\)) | 0.5478 | 0.5961 | 0.5812 | 0.5610 | 0.5367 | 0.5181 | 0.4178 |
Similarity (\({f}_{to{p}_{7}}\)) | 0.5368 | 0.5788 | 0.5707 | 0.5496 | 0.5218 | 0.5118 | 0.3932 |
Similarity (\({f}_{to{p}_{8}}\)) | 0.5265 | 0.5653 | 0.5565 | 0.5401 | 0.5209 | 0.5081 | 0.3711 |
Similarity (\({f}_{to{p}_{9}}\)) | 0.4780 | 0.5368 | 0.5374 | 0.5296 | 0.5129 | 0.5036 | 0.3232 |
Similarity (\({f}_{to{p}_{10}}\)) | 0.4551 | 0.5149 | 0.5184 | 0.5174 | 0.5065 | 0.4963 | 0.3119 |
Similarity (\({f}_{ran{k}_{2}}\)) | 0.6045 | 0.6470 | 0.6238 | 0.6062 | 0.5800 | 0.5544 | 0.5261 |
Similarity (\({f}_{ran{k}_{3}}\)) | 0.6131 | 0.6561 | 0.6319 | 0.6171 | 0.5821 | 0.5519 | 0.5152 |
Similarity (\({f}_{ran{k}_{4}}\)) | 0.6116 | 0.6556 | 0.6303 | 0.6134 | 0.5772 | 0.5530 | 0.5075 |
Similarity (\({f}_{ran{k}_{5}}\)) | 0.5999 | 0.6454 | 0.6293 | 0.6062 | 0.5730 | 0.5519 | 0.4640 |
Similarity (\({f}_{ran{k}_{6}}\)) | 0.5782 | 0.6311 | 0.6211 | 0.5985 | 0.5674 | 0.5488 | 0.4511 |
Similarity (\({f}_{ran{k}_{7}}\)) | 0.5672 | 0.6232 | 0.6131 | 0.5938 | 0.5645 | 0.5432 | 0.4410 |
Similarity (\({f}_{ran{k}_{8}}\)) | 0.5581 | 0.6102 | 0.6043 | 0.5886 | 0.5584 | 0.5376 | 0.4281 |
Similarity (\({f}_{ran{k}_{9}}\)) | 0.5412 | 0.6024 | 0.5950 | 0.5781 | 0.5528 | 0.5310 | 0.3735 |
Similarity (\({f}_{ran{k}_{10}}\)) | 0.5203 | 0.5886 | 0.5839 | 0.5716 | 0.5485 | 0.5294 | 0.3570 |
xgb | 0.2162 | 0.1378 | 0.1105 | 0.0971 | 0.0895 | 0.0700 | 0.2308 |
rf | 0.7064 | 0.7099 | 0.6592 | 0.6213 | 0.5660 | 0.4969 | 0.5260 |
mlp | 0.4479 | 0.6461 | 0.6692 | 0.6702 | 0.6533 | 0.6280 | 0.4867 |
lr | 0.6533 | 0.6397 | 0.6349 | 0.6305 | 0.5939 | 0.5525 | 0.6311 |