Category: Deep Learning

Quantisation-Aware Training

Quantisation-Aware Training is a method used to prepare machine learning models for running efficiently on devices with limited computing power, such as smartphones or embedded systems. It teaches the model to handle the reduced precision of numbers, which happens when large models are made smaller by using fewer bits to represent data. This approach helps…

Attention Rollout

Attention Rollout is a technique used to visualise and interpret how information flows through the layers of an attention-based model, such as a transformer. It helps to track which parts of the input the model focuses on at each stage, giving insight into the decision-making process. This method combines attention maps from different layers to…

Recursive Neural Networks

Recursive Neural Networks are a type of artificial neural network designed to process data with a hierarchical or tree-like structure. They work by applying the same set of weights recursively over structured inputs, such as sentences broken into phrases or sub-phrases. This allows the network to capture relationships and meanings within complex data structures, making…

Activation Functions

Activation functions are mathematical formulas used in neural networks to decide whether a neuron should be activated or not. They help the network learn complex patterns by introducing non-linearity, allowing it to solve more complicated problems than a simple linear system could handle. Without activation functions, neural networks would not be able to model tasks…