From: Automated labeling of PDF mathematical exercises with word N-grams VSM classification
(a) without omitting any words | (b) omitting meaningless words | ||||
---|---|---|---|---|---|
Mono-gram | Mono-gram (English) | Importance | Mono-gram | Mono-gram (English) | Importance |
I | I | 0.027101 | ベクトル | Vector | 0.012692 |
III | III | 0.026868 | 数 | Number | 0.011921 |
II | II | 0.025151 | 関数 | Formula | 0.011126 |
A | A | 0.018989 | 確率 | Probability | 0.009463 |
B | B | 0.016693 | 複素 | Complex | 0.008825 |
数 | Number | 0.012049 | 点 | Point | 0.008570 |
関数 | Function | 0.011044 | 式 | Formula | 0.008534 |
解説 | Solution | 0.010463 | よっ | Therefore | 0.008256 |
ベクトル | Vector | 0.009900 | 極限 | Limit | 0.008139 |
点 | Point | 0.009046 | 定積 | Constant volume | 0.007628 |
確率 | Probability | 0.008046 | 列 | Column (or Sequence) | 0.006980 |
複素 | Complex | 0.007726 | 値 | Value | 0.006849 |
式 | Formula | 0.007594 | 次 | Next | 0.006669 |
列 | Column (or Sequence) | 0.006998 | 求め | Find [the value] | 0.006547 |
積分 | Integral | 0.006740 | する | Do | 0.006483 |
三角 | Triangle | 0.006716 | 積分 | Integral | 0.006374 |
定積 | Constant volume | 0.006563 | 三角 | Triangle | 0.006342 |
極限 | Limit | 0.006439 | が | Be | 0.006325 |