Forem

Deep Learning

This tag is for discussing, sharing articles, and asking questions primarily on deep learning - a subfield of machine learning.

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
Unlocking AI's Universal Secrets: Do Neural Networks Think in Fractals?

Unlocking AI's Universal Secrets: Do Neural Networks Think in Fractals?

Comments
2 min read
How I Built a 6B Image Model That Runs on a 16GB GPU (Z-Image)
Cover image for How I Built a 6B Image Model That Runs on a 16GB GPU (Z-Image)

How I Built a 6B Image Model That Runs on a 16GB GPU (Z-Image)

Comments
2 min read
🧑‍🚀 LLM Engine Telemetry: How to Profile Models and See Where Performance is Lost
Cover image for 🧑‍🚀 LLM Engine Telemetry: How to Profile Models and See Where Performance is Lost

🧑‍🚀 LLM Engine Telemetry: How to Profile Models and See Where Performance is Lost

Comments
5 min read
How Neural Networks Learn – A Simple Guide to Machine Learning & Deep Learning
Cover image for How Neural Networks Learn – A Simple Guide to Machine Learning & Deep Learning

How Neural Networks Learn – A Simple Guide to Machine Learning & Deep Learning

Comments
6 min read
Unlocking AI's Inner Geometry: Scale-Agnostic Structures in Neural Networks

Unlocking AI's Inner Geometry: Scale-Agnostic Structures in Neural Networks

Comments
2 min read
The Hidden Geometry of AI: A Scale-Free Secret to Smarter Networks

The Hidden Geometry of AI: A Scale-Free Secret to Smarter Networks

Comments
2 min read
Open-Weight AI for High-Quality Image Generation & Editing
Cover image for Open-Weight AI for High-Quality Image Generation & Editing

Open-Weight AI for High-Quality Image Generation & Editing

Comments
4 min read
Tame Your LLMs: A New Optimizer for Robust Deep Learning

Tame Your LLMs: A New Optimizer for Robust Deep Learning

Comments
2 min read
Surgical Precision with AI: A New Era in Lung Cancer Staging

Surgical Precision with AI: A New Era in Lung Cancer Staging

Comments
2 min read
Anon: The Adaptive Optimizer Bridging SGD and Adam for Peak AI Performance

Anon: The Adaptive Optimizer Bridging SGD and Adam for Peak AI Performance

Comments
2 min read
Turbocharge Your LLMs: A Breakthrough in Neural Network Optimization

Turbocharge Your LLMs: A Breakthrough in Neural Network Optimization

Comments
2 min read
Introducing PQNT — A New Power-Law Quantization Method
Cover image for Introducing PQNT — A New Power-Law Quantization Method

Introducing PQNT — A New Power-Law Quantization Method

Comments
1 min read
Unveiling the Hidden Geometry That Supercharges Neural Nets

Unveiling the Hidden Geometry That Supercharges Neural Nets

Comments
2 min read
How Search Engines Actually Answer Your Questions
Cover image for How Search Engines Actually Answer Your Questions

How Search Engines Actually Answer Your Questions

Comments
11 min read
BATCHNORM IN LANGUAGE MODELS
Cover image for BATCHNORM IN LANGUAGE MODELS

BATCHNORM IN LANGUAGE MODELS

Comments
16 min read
Giving AI Eyes: Multi-Modal LLMs

Giving AI Eyes: Multi-Modal LLMs

Comments
9 min read
Tokenization in NLP: The Foundational Step That Turns Language Into Data
Cover image for Tokenization in NLP: The Foundational Step That Turns Language Into Data

Tokenization in NLP: The Foundational Step That Turns Language Into Data

Comments
3 min read
Unlocking Data's Hidden Geometry: A New Era for Neural Networks by Arvind Sundararajan

Unlocking Data's Hidden Geometry: A New Era for Neural Networks by Arvind Sundararajan

Comments
2 min read
Linear Algebra for AI
Cover image for Linear Algebra for AI

Linear Algebra for AI

1
Comments
2 min read
Cross-Modal Embeddings: Bridging AI Modalities

Cross-Modal Embeddings: Bridging AI Modalities

Comments
11 min read
Observations from Finetuning Gemma Model on Strix Halo (Fedora 43)

Observations from Finetuning Gemma Model on Strix Halo (Fedora 43)

Comments
3 min read
Stock Price Prediction by ML Models

Stock Price Prediction by ML Models

Comments
1 min read
AI vs ML vs DL vs GenAI: Demystifying the Buzzwords
Cover image for AI vs ML vs DL vs GenAI: Demystifying the Buzzwords

AI vs ML vs DL vs GenAI: Demystifying the Buzzwords

1
Comments 2
3 min read
Fixing Identity Drift in AI Image Generation with a Deterministic Constraint Layer (Minimal PoC Inside)

Fixing Identity Drift in AI Image Generation with a Deterministic Constraint Layer (Minimal PoC Inside)

Comments
2 min read
How I Reached 84.35% on CIFAR-100 Using ResNet-50 (PyTorch Guide)
Cover image for How I Reached 84.35% on CIFAR-100 Using ResNet-50 (PyTorch Guide)

How I Reached 84.35% on CIFAR-100 Using ResNet-50 (PyTorch Guide)

Comments
2 min read
loading...