🦁 ahanazoo.com

Run any model.
Half the storage.

AhanaZoo is a model repository where every model is stored in losslessly compressed .aarm format. Browse, download, and run LLMs at 51% smaller size — with full bit-perfect fidelity.

51% Smaller Than SafeTensors Lossless · Bit-Perfect Random Tensor Access HuggingFace Compatible

Get Early Access →

What Makes AhanaZoo Different

Model hubs store bytes. We store intelligence.

Every model in AhanaZoo is pre-compressed using a WeightTransformer trained on the distribution of neural network parameters across hundreds of architectures — not generic data compression.

🧠

Weight-aware compression

Our WeightTransformer is trained specifically on the statistical patterns of neural network weights — DPCM delta coding, per-block quantization, and zigzag encoding optimized for float16 tensors.

⚡

Random tensor access

Load any single layer from a compressed 70B model in milliseconds. The .aarm JSON table of contents maps every tensor to its exact byte offset — no sequential scan required.

🔄

HuggingFace compatible

AhanaZoo models load transparently with the standard HuggingFace API via our ACP5LazyStateDict adapter. Your existing code works — you just download 51% less data.

💾

Massive storage savings

A 32B parameter model (60 GB in SafeTensors) compresses to ~29 GB in .aarm. Multiply across a rack of model servers — the savings accumulate fast.

🦁

Curated model collection

AhanaZoo hosts a curated set of state-of-the-art open models — Llama, Qwen, Mistral, Phi, and others — all pre-compressed and verified for exact weight fidelity.

🏃

CAB-ready

Models from AhanaZoo are pre-indexed for CAB (Compressed Activation Buffer) inference — enabling 70B models to run on consumer GPUs via layer-streaming without quantization.

Run any model.Half the storage.