Skip to main content

Table 10  Feature analysis in mono-gram and Random Forest

From: Automated labeling of PDF mathematical exercises with word N-grams VSM classification

(a) without omitting any words

(b) omitting meaningless words

Mono-gram

Mono-gram (English)

Importance

Mono-gram

Mono-gram (English)

Importance

I

I

0.027101

ベクトル

Vector

0.012692

III

III

0.026868

Number

0.011921

II

II

0.025151

関数

Formula

0.011126

A

A

0.018989

確率

Probability

0.009463

B

B

0.016693

複素

Complex

0.008825

Number

0.012049

Point

0.008570

関数

Function

0.011044

Formula

0.008534

解説

Solution

0.010463

よっ

Therefore

0.008256

ベクトル

Vector

0.009900

極限

Limit

0.008139

Point

0.009046

定積

Constant volume

0.007628

確率

Probability

0.008046

Column (or Sequence)

0.006980

複素

Complex

0.007726

Value

0.006849

Formula

0.007594

Next

0.006669

Column (or Sequence)

0.006998

求め

Find [the value]

0.006547

積分

Integral

0.006740

する

Do

0.006483

三角

Triangle

0.006716

積分

Integral

0.006374

定積

Constant volume

0.006563

三角

Triangle

0.006342

極限

Limit

0.006439

Be

0.006325