Skip to main content

Table 9 Classification in 2nd level unit between a feature and weighted F-measure \({F}_{wL}\) in n-grams and machine learning methods

From: Automated labeling of PDF mathematical exercises with word N-grams VSM classification

 

\(n=1\)

\(n=2\)

\(n=3\)

\(n=4\)

\(n=5\)

\(n=6\)

w2vec

Similarity (\({f}_{mean}\))

0.4468

0.4799

0.4495

0.4388

0.4159

0.4051

0.2879

Similarity (\({f}_{max}\))

0.5641

0.6140

0.6021

0.5860

0.5617

0.5349

0.5183

Similarity (\({f}_{to{p}_{2}}\))

0.6025

0.6419

0.6229

0.6071

0.5768

0.5439

0.5136

Similarity (\({f}_{to{p}_{3}}\))

0.6084

0.6422

0.6249

0.6036

0.5675

0.5406

0.5016

Similarity (\({f}_{to{p}_{4}}\))

0.5966

0.6330

0.6186

0.5883

0.5563

0.5330

0.4755

Similarity (\({f}_{to{p}_{5}}\))

0.5605

0.6115

0.5949

0.5768

0.5472

0.5259

0.4319

Similarity (\({f}_{to{p}_{6}}\))

0.5478

0.5961

0.5812

0.5610

0.5367

0.5181

0.4178

Similarity (\({f}_{to{p}_{7}}\))

0.5368

0.5788

0.5707

0.5496

0.5218

0.5118

0.3932

Similarity (\({f}_{to{p}_{8}}\))

0.5265

0.5653

0.5565

0.5401

0.5209

0.5081

0.3711

Similarity (\({f}_{to{p}_{9}}\))

0.4780

0.5368

0.5374

0.5296

0.5129

0.5036

0.3232

Similarity (\({f}_{to{p}_{10}}\))

0.4551

0.5149

0.5184

0.5174

0.5065

0.4963

0.3119

Similarity (\({f}_{ran{k}_{2}}\))

0.6045

0.6470

0.6238

0.6062

0.5800

0.5544

0.5261

Similarity (\({f}_{ran{k}_{3}}\))

0.6131

0.6561

0.6319

0.6171

0.5821

0.5519

0.5152

Similarity (\({f}_{ran{k}_{4}}\))

0.6116

0.6556

0.6303

0.6134

0.5772

0.5530

0.5075

Similarity (\({f}_{ran{k}_{5}}\))

0.5999

0.6454

0.6293

0.6062

0.5730

0.5519

0.4640

Similarity (\({f}_{ran{k}_{6}}\))

0.5782

0.6311

0.6211

0.5985

0.5674

0.5488

0.4511

Similarity (\({f}_{ran{k}_{7}}\))

0.5672

0.6232

0.6131

0.5938

0.5645

0.5432

0.4410

Similarity (\({f}_{ran{k}_{8}}\))

0.5581

0.6102

0.6043

0.5886

0.5584

0.5376

0.4281

Similarity (\({f}_{ran{k}_{9}}\))

0.5412

0.6024

0.5950

0.5781

0.5528

0.5310

0.3735

Similarity (\({f}_{ran{k}_{10}}\))

0.5203

0.5886

0.5839

0.5716

0.5485

0.5294

0.3570

xgb

0.2162

0.1378

0.1105

0.0971

0.0895

0.0700

0.2308

rf

0.7064

0.7099

0.6592

0.6213

0.5660

0.4969

0.5260

mlp

0.4479

0.6461

0.6692

0.6702

0.6533

0.6280

0.4867

lr

0.6533

0.6397

0.6349

0.6305

0.5939

0.5525

0.6311