Member-only story

A Software Engineer’s Explanation of Deep Learning Models

⚡️Hudson Ⓜ️endes
7 min readJun 24, 2020

--

Non-data scientists often use the word Model to mistakenly refer to the what in Machine Learning we canonically call Model Architecture. And it couldn't be more wrong!

Is it True? All hard-coded? (Image Source: Luke Rixson)

Wanna hear occasional rants about Tensorflow, Keras, DeepLearning4J, Python and Java?

Join me on twitter @ twitter.com/hudsonmendes!

Taking Machine Learning models to production is a battle. And there I share my learnings (and my sorrows) there, so we can learn together!

The Model Architecture is very important to the Model. Without the variety of architectures we have we would never be able to fit data for the vastness of different problems that are currently being successfully solved by Deep Learning.

However, the Weights are a vital part of the model too. Different Weights can describe functions that have absolutely different geometry.

Model = Architecture (a.k.a. algorithm) + Weights (a.k.a. parameters)

Let's have a look at how it works:

--

--

⚡️Hudson Ⓜ️endes
⚡️Hudson Ⓜ️endes

Written by ⚡️Hudson Ⓜ️endes

⚡️Staff AI/ML Engineer & Senior Engineering Manager, #NLP, opinions are my own. https://linkedin.com/in/hudsonmendes

No responses yet