October 2024 – Insights into World Wide Web

Deep Learning Model Precision: FP32, BF16, INT8 and INT4

Dr. Anjing Wang October 31, 2024 0 Comments

When training or deploying deep learning models, precision isn’t just about getting accurate predictions—it’s also about finding the right balance between performance, memory usage, and speed. Choosing the optimal precision…

LoRA vs. QLoRA: Fine-Tuning Your Model’s Superpowers on a Budget

Dr. Anjing Wang October 31, 2024 0 Comments

Have you ever tried running a colossal language model on a GPU that feels more like a toaster than a supercomputer? Enter LoRA and QLoRA—two magical spells for squeezing every…

AI Infra

How to Run a CUDA Docker Image on an AWS Ubuntu LTS Instance

Dr. Anjing Wang October 29, 2024 0 Comments

Running a CUDA Docker image on an AWS Ubuntu instance enables you to leverage GPU-accelerated computations directly within Docker containers. In this guide, we’ll walk through the process of installing…

AI Infra

How to Install NVIDIA driver on AWS Ubuntu LTS 24.04

Dr. Anjing Wang October 29, 2024 1 Comments

Installing the NVIDIA driver on an AWS EC2 instance running Ubuntu 24.04 can sometimes be challenging due to AWS’s custom environment and kernel. Although the ubuntu-drivers tool is the recommended…

Infra

Deep Learning Model Precision: FP32, BF16, INT8 and INT4

LoRA vs. QLoRA: Fine-Tuning Your Model’s Superpowers on a Budget

How to Run a CUDA Docker Image on an AWS Ubuntu LTS Instance

How to Install NVIDIA driver on AWS Ubuntu LTS 24.04

Running One Linux Distribution Inside Another Using Docker: A Deep Dive

Fine-Tuning Qwen Models: Understanding the Key Parameters

Why PyTorch Uses Its Own Tensor Library Instead of NumPy