Neural Network

What is a neural network?

A neural network is a type of machine learning algorithm that's inspired by how brain cells (neurons) connect and communicate with each other. But before you get too excited about artificial brains, let's be clear: they're much simpler than the real thing.

That's why ML algorithms can be as simple as linear regression — which you may have learned about in Statistics 101 — or as complex as a neural network with millions of nodes. The kinds of models that have made headlines recently are mind bogglingly complex, and took the work of hundreds of people (not to mention decades of collective research).

Think of a neural network like a really sophisticated pattern-matching machine. If you showed a traditional computer program a picture and asked "is this a cat?", you'd have to write thousands of lines of code describing what makes a cat a cat (pointy ears, whiskers, four legs, etc.). A neural network, on the other hand, learns what a cat looks like by studying thousands of cat photos until it figures out the patterns on its own.

What is a neural network?

Loading image...

A neural network is a type of machine learning algorithm that's inspired by how brain cells (neurons) connect and communicate with each other. But before you get too excited about artificial brains, let's be clear: they're much simpler than the real thing.

That's why ML algorithms can be as simple as linear regression — which you may have learned about in Statistics 101 — or as complex as a neural network with millions of nodes. The kinds of models that have made headlines recently are mind bogglingly complex, and took the work of hundreds of people (not to mention decades of collective research).

Think of a neural network like a really sophisticated pattern-matching machine. If you showed a traditional computer program a picture and asked "is this a cat?", you'd have to write thousands of lines of code describing what makes a cat a cat (pointy ears, whiskers, four legs, etc.). A neural network, on the other hand, learns what a cat looks like by studying thousands of cat photos until it figures out the patterns on its own.

How do neural networks work?

Neurons are the basic building blocks of AI architectures, modeled after the actual biological neurons that transmit signals throughout the human brain. Remember, AI models are essentially pattern investigators; they find the underlying pattern in the data. You can think of these neurons as the mathematical functions that are doing this hard investigative work, getting into the weeds of the data and figuring out what’s going on.

The math performed by individual neurons is actually pretty simple – it’s usually just basic multiplication and addition that you could do with a calculator. So how are AI models able to capture such complex patterns, like the ones involved in language and vision? The trick is to string together a lot of neurons – like hundreds of millions of them.

This stringing together is where our first “decision” – and thus the early stages of an architecture – starts to come into play. Researchers can combine neurons in two ways.

First, neurons can be lined up in a sequence, so the output of one becomes the input of the next.

Neurons can also be stacked in layers, where they don’t interact directly but take the same input values.

Some special neurons can even accept their own output and use it to update their internal function, in a kind of simulated memory. This is helpful when you’re handling a sequence of data inputs, like a bunch of frames from the same video, and you want your model to use knowledge from earlier frames to contextualize what’s happening in later frames.

Put these configurations together, and you’ve got an Artificial Neural Network – the most basic model architecture. Neural networks are just layers (stacks of neurons) arranged in a sequence.

Different networks might follow different rules: in the setup above, a neuron accepts input from every single neuron in the layer preceding it. This is what computer scientists call a fully connected network (actually, the configuration pictured has an even more specific name: the Feedforward Neural Network, or FNN). But networks can also be partially connected, meaning that neurons selectively accept input from neurons in the previous layer.

You might notice that – just like an individual neuron – a neural network takes an input and returns an output. The architecture itself can be treated like a big mathematical function, and used as part of an even larger architecture.