AI – Page 4 – Knowledge sparks

What is Hierarchical Classification + Python Code

by Kurious Fox
September 23, 2025September 27, 2025

Hierarchical classification is a method of assigning items to a category that is part of a larger, structured hierarchy. Unlike traditional “flat” classification where categories are independent, hierarchical classification considers the relationships between categories, organizing…

Segmentation: definition & types

by Kurious Fox
September 19, 2025September 19, 2025

“Do you know what segmentation is?” Jon asked the magician. The magician replied: “It’s the process of dividing an image into meaningful regions” “like giving every pixel a role. So, you know the pixels that…

YOLO

by Kurious Fox
September 18, 2025September 20, 2025

“The YOLO-scope! It stands for ‘You Only Look Once.” He aimed the YOLO-scope at a passing dragon. A neat little box appeared on the viewing screen around the creature, with a label that read: ‘Dragon…

Fast R-CNN

by Kurious Fox
September 18, 2025September 18, 2025

Blaze was fascinated by the tiny details of the world below. He dreamt of a way to instantly recognize every flower, every rock, and every scurrying critter in the meadow when he flies. One day,…

Inception: The Neural Network That Thinks in Parallel

by Kurious Fox
September 18, 2025September 18, 2025

“Most neural networks pick one filter size at a time. But why not do all of them at once?” Inception said. Give me a photo, I can zoom in on tiny details, look at medium-sized…

Example of LangGraph with Ollama for conditional logic

by Kurious Fox
September 18, 2025September 18, 2025

Here, we will use the LangGraph library to create a simple AI agent that can decide whether to answer a user’s question directly or use a “search” tool to find the answer first. The system…

What’s LangGraph

by Kurious Fox
September 17, 2025September 17, 2025

LangGraph is a powerful, open-source framework for building and managing complex, stateful, and long-running AI agents. It provides a flexible and controllable way to create sophisticated AI workflows by representing them as graphs. At its…

Perceptual loss

by Kurious Fox
September 17, 2025September 17, 2025

Perceptual loss is a type of loss function used in AI, especially for tasks like creating or changing images. Instead of comparing two images pixel by pixel, it measures the difference between them based on…

crewAI: Orchestrating Collaborative AI Agents for Complex Task: example in Ollama

by Kurious Fox
September 16, 2025September 17, 2025

Developed by João Moura, crewAI provides a structured environment for orchestrating autonomous AI agents, enabling them to collaborate and tackle complex tasks that would be challenging for a single AI model to handle alone. At…

Fixed: crewai 0.186.1 requires litellm==1.74.9, but you have litellm 1.77.1 which is incompatible.

by Kurious Fox
September 16, 2025September 16, 2025

This is a common dependency conflict. So, sometimes, when you successfully install crewAI, you may encounter problems when using it. To resolve this, you need to install the specific version of litellm that crewai requires.…

Ollama models that can be run on a laptop

by Kurious Fox
September 16, 2025September 16, 2025

Running large language models locally on a laptop is becoming increasingly feasible, and Ollama makes it accessible. The key to a good experience is choosing a model that matches your laptop’s hardware, primarily its RAM…

pywin32

by Kurious Fox
September 15, 2025September 15, 2025

pywin32 lets your Python scripts directly control the Windows operating system and its applications. It acts as a bridge, giving you access to the vast Windows Application Programming Interface (API) from within Python. 🤖 Think…

RAG

by Kurious Fox
September 15, 2025September 15, 2025

RAG stands for Retrieval-Augmented Generation. It’s a powerful technique used in artificial intelligence to make Large Language Models (LLMs) like me more accurate, up-to-date, and trustworthy. The Simple Analogy: An “Open-Book Exam” Think of a…

LangChain: The Powerhouse Behind Intelligent Language Applications

by Kurious Fox
September 15, 2025September 15, 2025

LangChain is an open-source framework designed to simplify the creation of applications powered by large language models (LLMs). Available in both Python and JavaScript, it provides a modular and extensible architecture that allows developers to…

LlamaIndex: Bridging the Gap Between Your Data and Large Language Models

by Kurious Fox
September 15, 2025September 15, 2025

LlamaIndex is a powerful and flexible open-source data framework designed to connect custom data sources to large language models (LLMs). In essence, it acts as a crucial bridge, enabling developers to build applications that can…

The Specter in the Machine: Understanding Hallucinations in Large Language Models

by Kurious Fox
September 6, 2025September 6, 2025

In the rapidly advancing world of artificial intelligence, Large Language Models (LLMs) have emerged as powerful tools capable of generating human-like text, translating languages, and answering questions with remarkable fluency. However, these sophisticated models are…

selective focus photography of brown camel

RAG with EmbeddingGemma with Python Code using Ollama

by Kurious Fox
September 5, 2025September 5, 2025

Retrieval-Augmented Generation (RAG) is a powerful technique that enhances the capabilities of Large Language Models (LLMs) by connecting them to external knowledge sources. According to Google Developer website, EmbeddingGemma is a compact, open‑source embedding model…

AdamW optimization and implementation in PyTorch

by Kurious Fox
September 1, 2025October 12, 2025

The AdamW method was proposed in the paper “Decoupled Weight Decay Regularization” by Ilya Loshchilov and Frank Hutter. While the paper was officially published at the prestigious International Conference on Learning Representations (ICLR) in 2019,…

Train AI Models Faster and Better: The Power of Progressive Resizing

by Kurious Fox
August 27, 2025October 12, 2025

In the world of computer vision, we’re always chasing two things: better accuracy and faster training. The conventional wisdom is to use the largest, highest-quality images you can from the very beginning. But what if…

Types of Pooling operations

by Kurious Fox
August 23, 2025August 24, 2025

“You said pooling operations in Convolutional Neural Networks (CNNs) are like the magical zoom-out buttons.” “They reduce the size of feature maps while keeping the juicy bits of information. But how?” Peter asked. “There are…

Why CNNs are so effective

by Kurious Fox
August 23, 2025August 24, 2025

“Professor, why is CNN so effective?” “CNNs don’t just look at the whole image like a confused tourist—they zoom in on tiny patches (called kernels) and analyze them like Sherlock Holmes inspecting clues.” “Ok. This…

The CNN Workflow

by Kurious Fox
August 23, 2025August 24, 2025

“What’s CNN workflow?” Alex asked. Peter replied, “If we have an input image represented as a tensor, like a 32×32 pixel image with 3 color channels (Red, Green, Blue) would have a shape of 32x32x3.”…

ResNet – Residual Network

by Kurious Fox
August 21, 2025August 21, 2025

I’m building a super tall tower out of Lego blocks. Each block is a layer in a neural network. The taller the tower, the more complex patterns it can learn. But the problem is “Tall…

AlexNet: The CNN That Changed Everything

by Kurious Fox
August 21, 2025August 21, 2025

“Hey Alex, do you know what AlexNet is?” The little spirit asked Alex. “AlexNet is a game changer. Many years ago, everyone were using basic machine learning models to recognize images — and they were…

VGGNet in the Magic Canvas

by Kurious Fox
August 21, 2025August 21, 2025

“Wow. So this is the canvas that can do image classification and object detection?” Vixel asked. “Yes, I am VGG. VGG stands for Visual Geometry Group.” the Canvas replied. “More exactly, I’m VGG19, which means…

convolutional operations and convolutional neural networks (CNNs) — the backbone of modern computer vision

by Kurious Fox
August 20, 2025August 20, 2025

“Hey, Kernel. You work for Mr. Convolution, right? What do you do there?” The pixelated giant asked, to which the young Kernel response, “A convolution is a mathematical operation that blends two functions to produce…

Object Detection

by Kurious Fox
August 20, 2025August 24, 2025

“I love sortering, especially beautiful mushrooms like this.” Jon thought “But I heard something on object detection trying to micmic human ability. It combines object localization to create bounding boxes around each object and then…

Segmentation by Thresholding: Techniques and Python Implementation

by Kurious Fox
August 20, 2025September 19, 2025

“The cat is so cute, but not the carpet! I want to grab the cat area only.” “What should I do now, Mr. Crystal?” Kevin asked, to which the magic crystal replied, “You can do…

Convolution & Filtering

by Kurious Fox
August 20, 2025August 20, 2025

“The world was a dazzling mosaic of colors and shapes, but I wonder if the magical computer to see it differently.” The little fairy thought and flew to the house of the Great Wizard. “It’s…

Image Transformations for Data Augmentation

by Kurious Fox
August 19, 2025August 20, 2025

“Professor Elara,” chirped the little kid, “Can show me the magic of Image Transformations?” and Professor Elara blinked slowly. “Ok. We begin with Scaling.” With a gentle wave of her wing, Professor Elara conjured a…

Pixel Operations

by Kurious Fox
August 19, 2025August 19, 2025

Pixels are the smallest units of a digital image — think of them as the individual tiles in a mosaic. Each pixel holds color and intensity information, and by manipulating these values, we can transform…

Image Formation: Pixels & Color Spaces

by Kurious Fox
August 19, 2025August 19, 2025

In a realm painted with light and shadow, there lived tiny sprites of light called Pixels. They were the weavers of the visual world, each a tiny, glowing dot of energy. The more Pixels that…

« Previous
1
2
3
4
5
6
…
11
Next »