LoRA vs. QLoRA: Fine-Tuning Your Model’s Superpowers on a Budget
Have you ever tried running a colossal language model on a GPU that feels more like a toaster than a supercomputer? Enter LoRA and QLoRA—two magical spells for squeezing every…
Have you ever tried running a colossal language model on a GPU that feels more like a toaster than a supercomputer? Enter LoRA and QLoRA—two magical spells for squeezing every…
Running a CUDA Docker image on an AWS Ubuntu instance enables you to leverage GPU-accelerated computations directly within Docker containers. In this guide, we’ll walk through the process of installing…
Installing the NVIDIA driver on an AWS EC2 instance running Ubuntu 24.04 can sometimes be challenging due to AWS’s custom environment and kernel. Although the ubuntu-drivers tool is the recommended…
Fine-tuning large language models (LLMs) can be a challenging process due to the variety of parameters and configurations involved. In this blog, we’ll break down key parameters used to fine-tune…
If you’re just starting out with Python and have heard of NumPy, you probably know it’s a fantastic library for handling numbers, arrays, and matrices. So, why would PyTorch, a…
In the world of deep learning, images are a critical form of data. Whether you’re building a computer vision model, training on image datasets, or working on image processing tasks,…
When exploring your system’s GPU settings, you may encounter various memory types that can initially be confusing. Terms like Total Available Graphics Memory, Dedicated Video Memory, System Video Memory, and…
As the use of WSL (Windows Subsystem for Linux) for development becomes increasingly popular among programmers, many AI engineers are adopting WSL and Visual Studio Code as their preferred environment…
Upgrade to Tesseract OCR 5 on Ubuntu with a simple PPA addition, unlocking advanced text recognition features and accuracy.
This guide will walk you through installing PPOCRLabel on Windows, ensuring you can start your OCR projects quickly.
Microsoft Azure provides excellent AI speech service, but how it organizes its documents and SDK drags me back to the bad memories when I read Microsoft Visual C++ MSDN documents…
We'll go step by step to bring up a model on runpod.io and chat with it using text-generation-webui.
we'll explore how to use ChatGPT plugins in a general way with VoxScript, WebPilot, and AskYourPdf as examples.
We discuss similarity metrics such as Cosine Similarity, and Inner Product, especially why Inner Product can be used as a similarity metric.
We'll take a look at how OpenAI enables text embedding and use Python to do an exercise to cluster some texts.