Multi-Layer Perceptron (MLP)
Feedforward network with multiple hidden layers
Dense Layer
Fully connected layer where each neuron connects to all neurons in the previous layer
Input Layer
First layer that receives raw data features
Hidden Layer
Intermediate layers that extract and transform features
Output Layer
Final layer that produces predictions or classifications
Network Depth
Number of layers in the neural network
Batch
Subset of training data processed together in one iteration
Learning Rate
Step size for parameter updates during optimization
Validation Set
Data used to tune hyperparameters and monitor training progress
Test Set
Held-out data for final model evaluation
Early Stopping
Technique to prevent overfitting by stopping training when validation performance degrades
Convergence
When the training process reaches a stable state with minimal loss changes
SGD
Stochastic Gradient Descent - basic optimization algorithm
Adam
Adaptive optimization algorithm with momentum
Momentum
Technique to accelerate gradient descent by accumulating past gradients
Regularization
Techniques to prevent overfitting by constraining model complexity
Dropout
Regularization technique that randomly turns off neurons during training
Batch Normalization
Technique to normalize inputs to each layer for stable training
Accuracy
Percentage of correct predictions
Precision
True positives divided by predicted positives
Recall
True positives divided by actual positives
F1 Score
Harmonic mean of precision and recall
Confusion Matrix
Table showing correct vs predicted classifications
Cross Validation
Technique to assess model generalization using multiple train/test splits
Transfer Learning
Using pre-trained models as starting points for new tasks
Ensemble Methods
Combining multiple models to improve performance
Attention Mechanism
Technique for models to focus on relevant parts of input
Autoencoder
Neural network that learns compressed representations
Embedding
Dense vector representations of discrete objects
Fine-tuning
Adapting pre-trained models to specific tasks