Skip to main content

Table 6 Classification in 1st level unit between a feature and weighted F-measure \({F}_{wL}\) in n-grams and machine learning methods

From: Automated labeling of PDF mathematical exercises with word N-grams VSM classification

 

\(n=1\)

\(n=2\)

\(n=3\)

\(n=4\)

\(n=5\)

\(n=6\)

w2vec

Similarity (\({f}_{mean}\))

0.7127

0.7392

0.7055

0.7063

0.6831

0.6864

0.3962

Similarity (\({f}_{max}\))

0.8024

0.8191

0.8140

0.7960

0.7797

0.7541

0.6939

Similarity (\({f}_{to{p}_{2}}\))

0.8331

0.8393

0.8322

0.8109

0.7837

0.7638

0.7064

Similarity (\({f}_{to{p}_{3}}\))

0.8414

0.8440

0.8365

0.8073

0.7785

0.7609

0.7091

Similarity (\({f}_{to{p}_{4}}\))

0.8408

0.8505

0.8337

0.8105

0.7719

0.7514

0.7059

Similarity (\({f}_{to{p}_{5}}\))

0.8434

0.8514

0.8332

0.8019

0.7659

0.7476

0.6961

Similarity (\({f}_{to{p}_{6}}\))

0.8446

0.8502

0.8331

0.7954

0.7584

0.7378

0.6825

Similarity (\({f}_{to{p}_{7}}\))

0.8407

0.8531

0.8313

0.7886

0.7569

0.7344

0.6735

Similarity (\({f}_{to{p}_{8}}\))

0.8394

0.8515

0.8265

0.7852

0.7528

0.7321

0.6607

Similarity (\({f}_{to{p}_{9}}\))

0.8385

0.8492

0.8212

0.7823

0.7504

0.7223

0.6556

Similarity (\({f}_{to{p}_{10}}\))

0.8358

0.8475

0.8175

0.7795

0.7466

0.7184

0.6457

Similarity (\({f}_{ran{k}_{2}}\))

0.8232

0.8389

0.8291

0.8093

0.7844

0.7643

0.7097

Similarity (\({f}_{ran{k}_{3}}\))

0.8357

0.8419

0.8340

0.8126

0.7850

0.7656

0.7118

Similarity (\({f}_{ran{k}_{4}}\))

0.8403

0.8481

0.8372

0.8134

0.7846

0.7659

0.7116

Similarity (\({f}_{ran{k}_{5}}\))

0.8434

0.8532

0.8391

0.8121

0.7807

0.7638

0.7094

Similarity (\({f}_{ran{k}_{6}}\))

0.8462

0.8544

0.8377

0.8139

0.7792

0.7568

0.7076

Similarity (\({f}_{ran{k}_{7}}\))

0.8460

0.8547

0.8392

0.8121

0.7767

0.7566

0.7006

Similarity (\({f}_{ran{k}_{8}}\))

0.8484

0.8562

0.8383

0.8099

0.7726

0.7527

0.6959

Similarity (\({f}_{ran{k}_{9}}\))

0.8483

0.8563

0.8364

0.8069

0.7739

0.7486

0.6929

Similarity (\({f}_{ran{k}_{10}}\))

0.8484

0.8545

0.8344

0.8025

0.7698

0.7431

0.6846

xgb

0.8601

0.8279

0.6777

0.6037

0.5274

0.4800

0.6530

rf

0.9255

0.9020

0.8315

0.7430

0.6514

0.6016

0.7191

mlp

0.7924

0.8745

0.8636

0.8673

0.8367

0.8192

0.8066

lr

0.9193

0.8935

0.8655

0.8387

0.8029

0.7453

0.8537