Skip to main content
Fig. 2 | Smart Learning Environments

Fig. 2

From: Continuous user identification in distance learning: a recent technology perspective

Fig. 2

The SincNet architecture Ravanelli and Bengio (2018). The speech waveform is convoluted with a set of parametrized sinc functions that implement band-pass filters. Then, a standard CNN pipeline (pooling, normalization, activations, dropout) is employed. Multiple standard convolutional, fully-connected or recurrent layers are then stacked to finally perform speaker classification with a softmax classifier

Back to article page