Infinity
Infinity is an AI-native, open‑source database designed for LLM (Large Language Model) applications. It delivers ultra-fast hybrid search—combining dense/sparse vectors, tensor, and full-text—on structured and unstructured data. It supports RAG (Retrieval‑Augmented Generation) workflows while ensuring strong performance and scalability.
Incredibly Fast
~0.1 ms query latency with 15 K+ QPS on million-row vector datasets.
Hybrid Search (Vector + Full‑text + Tensor)
Supports dense embedding, sparse embedding, full-text, tensor search, plus fused reranking (e.g., RRF, ColBERT).
Rich Data Types & Indexes
Handles strings, numerics, vectors (multi-vector too), tensors. Built‑in indexes: columnar, ANN (IVF/HNSW), inverted (BM25).
AI‑Native Architecture
Supports structured queries, vector retrieval, full-text search, and filtering in unified execution via a push‑based pipeline.
Ease-of-Use
Single-binary deployment, no external dependencies, and intuitive Python and HTTP APIs.
ACID-Compliant Engine
Columnar storage with full ACID support, zonemap/Bloomfilter, secondary indexes, background GC, WAL replay.
from infinity import connect, NetworkAddress
inf = connect(NetworkAddress("127.0.0.1", 23817))
db = inf.get_database("default_db")
tbl = db.create_table("my_table", {
"num": {"type": "integer"},
"body": {"type": "varchar"},
"vec": {"type": "vector,4,float"}
})
tbl.insert([{"num":1,"body":"hi","vec":[0.1,0.2,0.3,0.4]}])
res = tbl.output(["*"]).match_dense("vec",[0.1,0.2,0.3,0.4],"float","ip",5).to_pl()
print(res)