Skip to main content

Table 7 Classification in 2nd level unit between a feature and accuracy \({A}_{L}\) in n-grams and machine learning methods

From: Automated labeling of PDF mathematical exercises with word N-grams VSM classification

 

\(n=1\)

\(n=2\)

\(n=3\)

\(n=4\)

\(n=5\)

\(n=6\)

w2vec

Similarity (\({f}_{mean}\))

0.4317

0.4699

0.4447

0.4418

0.4249

0.4151

0.2584

Similarity (\({f}_{max}\))

0.5589

0.6076

0.5946

0.5780

0.5564

0.5305

0.5121

Similarity (\({f}_{to{p}_{2}}\))

0.5845

0.6252

0.6050

0.5914

0.5640

0.5337

0.4941

Similarity (\({f}_{to{p}_{3}}\))

0.5795

0.6169

0.6000

0.5823

0.5514

0.5283

0.4688

Similarity (\({f}_{to{p}_{4}}\))

0.5553

0.5968

0.5845

0.5607

0.5359

0.5164

0.4328

Similarity (\({f}_{to{p}_{5}}\))

0.4840

0.5586

0.5510

0.5431

0.5232

0.5074

0.3640

Similarity (\({f}_{to{p}_{6}}\))

0.4663

0.5276

0.5276

0.5222

0.5085

0.4984

0.3495

Similarity (\({f}_{to{p}_{7}}\))

0.4544

0.5016

0.5099

0.5045

0.4905

0.4890

0.3240

Similarity (\({f}_{to{p}_{8}}\))

0.4411

0.4829

0.4908

0.4915

0.4861

0.4836

0.2998

Similarity (\({f}_{to{p}_{9}}\))

0.3795

0.4490

0.4688

0.4782

0.4764

0.4782

0.2378

Similarity (\({f}_{to{p}_{10}}\))

0.3467

0.4231

0.4468

0.4631

0.4681

0.4710

0.2288

Similarity (\({f}_{ran{k}_{2}}\))

0.5895

0.6335

0.6083

0.5924

0.5676

0.5434

0.5114

Similarity (\({f}_{ran{k}_{3}}\))

0.5917

0.6364

0.6119

0.5993

0.5679

0.5409

0.4915

Similarity (\({f}_{ran{k}_{4}}\))

0.5813

0.6299

0.6054

0.5921

0.5604

0.5402

0.4742

Similarity (\({f}_{ran{k}_{5}}\))

0.5514

0.6119

0.5989

0.5809

0.5557

0.5359

0.3968

Similarity (\({f}_{ran{k}_{6}}\))

0.5124

0.5859

0.5841

0.5705

0.5467

0.5319

0.3838

Similarity (\({f}_{ran{k}_{7}}\))

0.4923

0.5712

0.5723

0.5640

0.5416

0.5250

0.3726

Similarity (\({f}_{ran{k}_{8}}\))

0.4782

0.5532

0.5582

0.5553

0.5337

0.5182

0.3582

Similarity (\({f}_{ran{k}_{9}}\))

0.4584

0.5377

0.5467

0.5409

0.5279

0.5114

0.2847

Similarity (\({f}_{ran{k}_{10}}\))

0.4321

0.5178

0.5315

0.5330

0.5214

0.5095

0.2656

xgb

0.1441

0.0868

0.0659

0.0541

0.0490

0.0375

0.1643

rf

0.6850

0.6829

0.6314

0.5957

0.5452

0.4861

0.4987

mlp

0.4418

0.6281

0.6544

0.6566

0.6414

0.6169

0.4840

lr

0.6404

0.6339

0.6270

0.6220

0.5924

0.5636

0.6245