Asses Performance

Unbeatable Performance
for Your Next AI App

Our state of the art AHNSW Algorithim, built from the ground up for excellent Accuracy and Performance

Optimized for AI

Performance comparison across vector databases for 100k dbpedia dataset
Antarys, Chroma, Qdrant and Milvus

Write Performance (WPS)
Vectors written per second across databases for 1k batches (excluding pinecone since it has a batch size limit, the result is in 100 batches, see request limits)
Query Performance (QPS)
Queries processed per second across databases with different batch sizes
Recall Performance
Search accuracy at top 100 results
Query Latency
Average response time per query

Antarys Scales At Multimodal

Antarys is built from the ground up specific optimisations made for certain vector dimensions, enabling unparalleled performance

Check Image Benchmarks
Query Performance (ms)
The Lower The Better
Antarys
1.66ms
Fastest
Chroma
2.94ms
1.8x slower
Pinecone
13.09ms
7.8x slower
Qdrant
51.47ms
31x slower
Milvus
220.46ms
133x slower

Test Configuration - 1,000 vectors, 512 dimensions, ResNet34 model, Afghan Hound query image

Popular Image Embedding Dimensions

Common embedding dimensions for popular image models and Antarys specific optimization support

Model / FrameworkEmbedding DimensionGPU OptimizedCPU Optimized
CLIP (ViT-B/32)
512
CLIP (ViT-B/16)
512
CLIP (ViT-L/14)
768
OpenCLIP (ViT-H/14)
1024
ResNet-based embeddings
2048 (e.g., ResNet-50)
EfficientNet (e.g., B0-B7)
1280 to 2560
DINOv2 (ViT-G)
1536+
MobileNet
1024 (varies by config)

Vector Database Cost Analysis

Compare operational costs across vector databases based on query performance from our benchmark, this is just an estimated model

Query Volume
Number of queries to analyze (100K - 10M)
100K10M
Pricing Model
Cost per 10,000 queries batch
Total batches: 100
Cost Comparison
Estimated operational costs including performance overhead
Performance & Cost Breakdown
Detailed analysis of query performance and associated costs
DatabaseQPSLatencyEstimated Costvs Cheapest
AntarysMost Cost-Effective
75.713.2ms$2.0004Baseline
Qdrant
26.837.3ms$2.0425
+$0.0421
(1.0x)
Milvus
14.569.1ms$2.098
+$0.0977
(1.0x)
Chroma
9.1109.9ms$2.1694
+$0.1691
(1.1x)
Pinecone
1.7585.1ms$3.0002
+$0.9998
(1.5x)
Cost = (Queries ÷ 10K) × Price per batch × Performance efficiency multiplier (latency-based overhead up to 50%)
SMART
DIMENSION AWARE

Built to handle any embeddings We fine tuned our vector database to specific dimensions for maximum performance

placeholder
NATIVE
DEPLOY ANYWHERE

We built the database from the ground up So you can make AI work anywhere

AHNSW
ALGORITHMIC EFFICIENCY

Strategic Algorithmic Optimisation For Unbeatable Performance and Accuracy

Benchmark Summary

Comprehensive performance comparison across vector databases

MetricAntarysChromaPineconeQdrantMilvus
Write Performance
Throughput (vectors/sec)
2220
vec/s
2214
vec/s
2507
vec/s
743
vec/s
1961
vec/s
Avg Batch Time (ms)
450.4
ms
451.7
ms
39.9
ms
1345.6
ms
509.9
ms
P99 Latency (ms)
493.2
ms
475.4
ms
60.0
ms
1461.1
ms
830.6
ms
Read Performance (1000 queries)
Throughput (queries/sec)
75.7
q/s
9.1
q/s
1.7
q/s
26.8
q/s
14.5
q/s
Avg Query Time (ms)
13.2
ms
109.9
ms
585.1
ms
37.3
ms
69.1
ms
P99 Latency (ms)
19.2
ms
240.7
ms
861.6
ms
63.3
ms
337.8
ms
Accuracy
Recall@100 (%)
100.00
100.00
100.00
100.00
99.96
Recall Dev
0.0000
0.0000
0.0000
0.0000
0.0028

Antarys was benchmarked with HTTP/1.1 while every other system used gRPC, eventual shift to gRPC will greatly increase overall performance by a factor of 1.6x (expected)

FAQ

Common Questions & Answers

Find out how did we benchmark and what optimisations led to this results

1

Why didn't we use other open source benchmarks?

Other famous tests like qdrant's benchmark or zilliz VectorDBBench doesn't really support async python, we had to use loop.run_until_complete which made our python runtime run slow and until we have a working gRPC client and server, we will be using this simple benchmark which uses dbpedia dataset from huggingface

2

What hardware did we use to run the benchmarks

An Apple M1 Pro with 16GP RAM and 512GB SSD with 8 cores (6 performance and 2 efficiency)

3

What optimisations make Antarys compete with industry standard databases?

Antarys was built from the ground up. And not only did we reimagine how HNSW indexing works, we built an architecture around it to support our algorithm to run faster, consume less memory and yield faster responses with accuracy.

4

What is the A-HNSW Algorithm?

AHNSW stands for Async HNSW, the proprietary algorithm that powers our vector database. At the heart of AHNSW, we have made critical optimisations by leveraging parallel execution without thread lock, allowing much faster graph navigation while producing standard accuracy for your LLM apps

5

When will we release Antarys Cloud?

We are a young company working continuously to improve our vector databse engine, Antarys's vector database is a byproduct of our AHNSW and we are constantly researching on making AI faster, accessible and native, we are expecting to see Antarys Cloud by Jan 2026!

6

Mission of Antarys

Our mission is to make AI hardware and software accessible to everyone and everywhere, make AI deployable and run anywhere. There was a time when having computers at home was nothing short of a vision, a bold idea. We envision the same thing for AI. The heart of Antarys is to make AI run anywhere with our local first party AI infrastructre

Get Started in Minutes

Antarys can be self hosted and work offline with your existing LLM stack, get started now for your system with this one command, requires curl

curl -fsSL http://antarys.ai/start.sh | bash

Windows

Universal Windows binary for amdx64 platforms

Coming soon

Free for offline forever.


  • Antarys Database
  • Vertically Scaled by default
  • Supports every feature
  • Best for your hobby LLM project

Apple Intel

Apple binaries for Intel and ARM mac for everybody

Coming soon

Free for offline forever.


  • Antarys Database
  • Vertically Scaled by default
  • Supports every feature
  • Best for your hobby LLM project

Apple ARM

Apple binaries for Intel and ARM mac for everybody

Manual Install Guide

Free for offline forever.


  • Antarys Database
  • Vertically Scaled by default
  • Supports every feature
  • Best for your hobby LLM project

Linux x64

Universal Linux binary for amdx64 machines

Manual Install Guide

Free for offline forever.


  • Antarys Database
  • Vertically Scaled by default
  • Supports every feature
  • Best for your hobby LLM project
AI APP
LLM MODELS
RAG PIPELINES
CLOUD DEPLOY
Adib

Want to work with us?

Feel free to book a call

Antarys AIAntarys AIAntarys AI