Skip to main content

Table 4 Classification in 1st level unit between a feature and accuracy \({A}_{L}\) in n-grams and machine learning methods

From: Automated labeling of PDF mathematical exercises with word N-grams VSM classification

 

\(n=1\)

\(n=2\)

\(n=3\)

\(n=4\)

\(n=5\)

\(n=6\)

w2vec

Similarity (\({f}_{mean}\))

0.7121

0.7398

0.7059

0.7049

0.6825

0.6858

0.3834

Similarity (\({f}_{max}\))

0.8025

0.8184

0.8130

0.7953

0.7791

0.7539

0.6944

Similarity (\({f}_{to{p}_{2}}\))

0.8328

0.8386

0.8314

0.8105

0.7838

0.7647

0.7063

Similarity (\({f}_{to{p}_{3}}\))

0.8411

0.8429

0.8357

0.8072

0.7791

0.7625

0.7081

Similarity (\({f}_{to{p}_{4}}\))

0.8407

0.8494

0.8328

0.8105

0.7730

0.7539

0.7049

Similarity (\({f}_{to{p}_{5}}\))

0.8432

0.8505

0.8324

0.8022

0.7679

0.7506

0.6955

Similarity (\({f}_{to{p}_{6}}\))

0.8443

0.8494

0.8321

0.7960

0.7607

0.7409

0.6818

Similarity (\({f}_{to{p}_{7}}\))

0.8404

0.8523

0.8303

0.7888

0.7596

0.7373

0.6728

Similarity (\({f}_{to{p}_{8}}\))

0.8389

0.8508

0.8256

0.7859

0.7553

0.7348

0.6595

Similarity (\({f}_{to{p}_{9}}\))

0.8378

0.8483

0.8202

0.7831

0.7524

0.7243

0.6541

Similarity (\({f}_{to{p}_{10}}\))

0.8350

0.8465

0.8162

0.7798

0.7481

0.7196

0.6436

Similarity (\({f}_{ran{k}_{2}}\))

0.8231

0.8382

0.8281

0.8090

0.7841

0.7647

0.7095

Similarity (\({f}_{ran{k}_{3}}\))

0.8353

0.8411

0.8332

0.8123

0.7852

0.7665

0.7114

Similarity (\({f}_{ran{k}_{4}}\))

0.8400

0.8472

0.8364

0.8130

0.7852

0.7672

0.7106

Similarity (\({f}_{ran{k}_{5}}\))

0.8432

0.8523

0.8382

0.8119

0.7813

0.7654

0.7085

Similarity (\({f}_{ran{k}_{6}}\))

0.8461

0.8533

0.8371

0.8141

0.7798

0.7589

0.7067

Similarity (\({f}_{ran{k}_{7}}\))

0.8458

0.8537

0.8386

0.8123

0.7780

0.7593

0.6998

Similarity (\({f}_{ran{k}_{8}}\))

0.8483

0.8551

0.8375

0.8101

0.7744

0.7557

0.6951

Similarity (\({f}_{ran{k}_{9}}\))

0.8479

0.8555

0.8353

0.8072

0.7759

0.7517

0.6923

Similarity (\({f}_{ran{k}_{10}}\))

0.8479

0.8537

0.8335

0.8029

0.7719

0.7463

0.6836

xgb

0.8605

0.8274

0.6681

0.5831

0.4825

0.4263

0.6468

rf

0.9250

0.9013

0.8310

0.7427

0.6526

0.5968

0.7139

mlp

0.7910

0.8732

0.8631

0.8674

0.8382

0.8209

0.8065

lr

0.9193

0.8930

0.8641

0.8371

0.8025

0.7492

0.8537