This neural network has an input layer, one hidden layer, and one output layer. The input layer - 784 nodes, the hidden layer - 128 nodes and the output layer - 10 nodes.
The sigmoid function is used as activation function.
- Forward propagation (forward pass)
- Backward propagation (backward pass)
- Updating the gradient
• torch.add • torch.mul • torch.transpose • torch.mm
