Category: Other

The Evolution of AI: From Text Generation to Multimodal Tool Masters

How Large Language Models are Revolutionizing Problem-Solving Through Tool Integration and Multimodality The landscape of artificial intelligence has shifted dramatically. What started as impressive text generators have evolved into sophisticated problem-solving systems that mirror human intelligence in their approach to complex tasks. Today, we’re witnessing a fundamental transformation in how AI systems operate – they’re no longer confined to generating text in isolation, but actively leverage external tools and multimodal capabilities to tackle real-world challenges.…
Read more

dH #016: Problems with Stochastic Gradient Descent and the Momentum Solution

highlights: Understanding the fundamental challenges in optimization and how momentum-based approaches provide elegant solutions. This post will give deeper knowledge about more advanced methods for optimization of Machine Learning models. Source: This post is inspired by Lecture of Prof. Justin Johnson, from Michigan University:  https://www.youtube.com/watch?v=YnQJTfbwBM8 The goal of the post is to present the most important ideas, along with the graphs, and it can be used for quick recap of the main ideas.  Problems with SGD…
Read more

dH 015# Optimization in Neural Networks: A Journey Through Loss Landscapes

*Understanding loss functions and the mathematical framework that drives neural network training* — ## The Foundation of Optimization The slide introduces the topic of Optimization, setting up our framework for understanding loss functions and weight matrices. But for the purpose of today, we’re mostly going to extract all those away and just think about the loss function as an abstract function. It inputs the weight matrix and outputs this scalar value of the loss. During…
Read more

dH #014: Understanding SVM Loss Functions: From Theory to Practice

*A deep dive into multiclass SVM loss with practical examples and mathematical insights* — ## The Foundation: Multiclass SVM Loss The multiclass SVM loss function ensures the score of the correct class should be higher than all other scores. This fundamental principle drives how we evaluate and optimize classification models, creating a robust framework for distinguishing between multiple categories. The loss function is visualized with a graph showing the relationship between the highest score among…
Read more

dH #013: Neural Networks: Linear Classifiers – The Foundation of Deep Learning – Part 1

*Understanding the fundamental building blocks that power modern AI systems* — ## The Building Blocks of Neural Networks One of the most basic blocks that you’re going to have in your toolbox when you build large complicated neural networks is a linear classifier, illustrated here with stacked building blocks showing the layered structure. Much of the intuition and technical bits that we’ll cover today will carry over completely to the neural network systems that we’ll…
Read more