LoRA Adapter Size Calculator

Trainable parameters and file size of LoRA adapters from rank, target modules and model architecture.

LayersHidden sizeLoRA rankTarget modulesFFN size (for all-linear)

—

Trainable params (M)

—

Adapter file (FP16) (MB)

—

Of a 7B base (%)

Defaults (7B-class, r=16, q/k/v/o) give a 33.5M-parameter, 67 MB adapter — 0.48% of the base model. Doubling rank doubles size; adding MLP targets roughly quadruples it and usually helps more than raising rank.

Formula

params per adapted matrix = r × (d_in + d_out) · total = layers × Σ adapted matrices — file size = params × 2 B (FP16)

References: Hu et al. (2021), LoRA: Low-Rank Adaptation of Large Language Models; Dettmers et al. (2023), QLoRA (all-linear targeting)

About LoRA Adapter Size Calculator

LoRA's economics live in one formula: each adapted weight matrix gains two skinny factors costing rank × (in + out) parameters — typically a fraction of a percent of the base model. This calculator turns rank and target-module choices into exact trainable-parameter counts and the MB your adapter file will weigh on the Hub. The defaults match the most common 7B recipe; switch to all-linear targeting to see why QLoRA's authors recommend it (4× the adapter, still under 2% of base) or to q,v-only to reproduce the original paper's minimal setup.

How to use LoRA Adapter Size Calculator

1Enter your values into LoRA Adapter Size Calculator — sensible, domain-typical defaults are pre-filled so you see a real result immediately.
2The result recomputes live using the formula shown on the page; there is no button to press.
3Adjust any input to compare scenarios, then read the worked example to see the substituted numbers.

Why use LoRA Adapter Size Calculator?

✓Computes LoRA Adapter Size instantly in your browser — no sign-up, no upload, no server round-trip.
✓100% free and unlimited, with the exact formula shown: params per adapted matrix = r × (d_in + d_out).
✓Runs entirely client-side, so every value you enter stays private on your device.
✓Live recompute as you type, with a worked example and authoritative references for trust.

Frequently asked questions

What rank should I use?+

r=8–16 handles most style/format/domain fine-tunes; r=32–64 for harder behavioral shifts or multi-task adapters. The QLoRA ablation found TARGET COVERAGE (adapting all linear layers) mattered more than rank — broaden targets before raising r.

Why are LoRA files sometimes bigger than this estimate?+

Checkpoints may store optimizer states (3× larger), keep FP32 copies, or include the merged base modules. A clean save_pretrained adapter at FP16 should match this calculator within a few MB; anything 10× bigger is carrying training baggage.

Does a bigger adapter slow inference?+

Unmerged, each adapted layer adds a small bypass matmul — a few percent latency. Merged (W + BA baked in), inference cost is exactly the base model's: zero overhead. Merge for deployment; keep adapters separate when hot-swapping many customers' tunes.

What is alpha and does it change the size?+

Alpha is a scalar scaling (effective update = α/r · BA) — zero parameters, zero size impact. Convention sets α = 2r or α = r; what matters is consistency between training and loading, or generations subtly weaken/strengthen.

Related tools

Related ML & AI tools

🧠

ROC-AUC Calculator (from TPR/FPR points)

Trapezoidal area under the ROC curve from your (FPR, TPR) operating points — the threshold-independent ranking score.

● Live

🧠

Classification Threshold Cost Calculator

Find the probability cutoff that minimizes expected cost given your false-positive and false-negative penalties.

● Live

🧠

Silhouette Score Calculator

Cluster cohesion vs separation for one point — the building block of the silhouette metric for choosing K.

● Live