The market & the fit

Who needs this, and why now.

A plain-language guide to the market CRBRL serves: what a vector database is, how it differs from the databases that came before it, and where a compression-native one pays off across real industries.

From the history of storage to a working use case in your sector.

Start with the market → Jump to your industry ↓

▲ 01 / The marketsandbox → production

AI is moving from the sandbox into production — and storage is the line that scales the wrong way.

For two years most companies kept AI in trials: small experiments, limited data, a handful of users. That phase is ending. Teams are now putting AI into the products and workflows that customers and staff rely on every day, which turns every document, conversation, and record into searchable memory.

Mass adoption means real scale, and at scale the cost of holding that memory grows faster than anything else. It is the line that climbs the wrong way as a project succeeds — and the one that decides which projects keep running.

~$170B

Projected AI-infrastructure market by 2029 — the layer a vector database sits in.

~31%

Compound annual growth rate of that AI-infrastructure market.

55–80%

Share of a typical vector-database bill that is storage and the I/O around it.

// the storage share is precisely where CRBRL acts — it reduces the data itself, then keeps it fast

▲ 02 / The road herea short history of data storage

How we got from rows and columns to vector search.

Each generation of database answered the question its era was asking. Vector databases are the latest step, built for a new kind of question: "what is similar in meaning?"

1970s →

Relational / SQL databases

Data is organised into tables of rows and columns with a fixed schema. Queries are exact: find the row where the customer ID equals this value, or sum the orders in this date range. Built for structured records and precise, repeatable answers — still the backbone of most business systems.

2000s →

NoSQL — document & key-value stores

As web applications grew, teams needed to store flexible, semi-structured data and scale it horizontally. Document stores keep records as JSON-like documents; key-value stores map a key directly to a blob. The trade was a looser schema and simpler lookups in exchange for scale and developer speed.

2000s →

Search engines — inverted indexes

To search large bodies of text, search engines build an inverted index: a map from each word to the documents that contain it. This makes full-text and keyword search fast. It matches the words you typed — but it does not understand that "car" and "automobile" mean the same thing.

2020s →

Vector databases — embeddings & similarity

AI models turn text, images, and audio into embeddings: lists of numbers that place items with similar meaning near each other. A vector database stores those embeddings and searches by closeness, so it can find results that mean the same thing even when the words differ. This is the layer behind semantic search, recommendations, and AI memory.

▲ 03 / The ideawhat a vector database is

A vector database is AI memory — searched by meaning.

An AI model reads a piece of content and produces an embedding: a long list of numbers — a high-dimensional vector — that captures its meaning. Items that mean similar things land close together in that space; items that mean different things land far apart. A common embedding has 1,536 dimensions.

To answer a question, the database embeds the query the same way and looks for the stored vectors nearest to it. This is nearest-neighbour search. At scale it is usually done as approximate-nearest-neighbour (ANN) search, which trades a tiny amount of exactness for a large gain in speed — close enough, far faster.

01 · Embed

Turn content into vectors

Each document, message, or image becomes a vector that encodes its meaning. This is the memory the AI will later search.

02 · Store

Hold the vectors

The database keeps every vector, organised so that similar ones can be found quickly without scanning everything.

03 · Search

Find what is closest

A query becomes a vector too; the database returns the nearest matches by meaning — the foundation of semantic search and AI memory.

▲ 04 / The difference, part 1vs. relational databases

Similarity, not exact match. Meaning, not rows.

A relational database and a vector database answer different questions. They often sit side by side — one for the facts, one for the meaning.

Relational / SQL

Exact answers over structured data

Stores structured rows and columns with a fixed schema.
Queries match exactly: equals, greater-than, joins on keys.
Ideal for transactions, ledgers, inventory — anything that must be precise.
Cannot tell that two differently worded sentences mean the same thing.

Vector

Similarity over unstructured meaning

Stores embeddings — the meaning of unstructured text, images, and audio.
Queries match by closeness: find the most similar, not the exact value.
Ideal for semantic search, recommendations, and AI memory.
Returns relevant results even when the wording is completely different.

▲ 05 / The difference, part 2vs. graph databases

Graph databases model explicit relationships. Vector databases model meaning.

A graph database stores entities as nodes and the relationships between them as edges — "this person works at that company," "this account paid that account." Its strength is traversal: following those edges to answer questions about paths, connections, and the shape of a network.

Where a vector database asks "what is similar?", a graph database asks "what is connected, and how?" The two are not competitors so much as complements — they are good at different questions.

Graph excels at

Traversal, paths & relationships

Following chains of edges quickly: who is connected to whom, the shortest path between two entities, which records sit two or three hops away from a starting point.

Ideal use cases

Where explicit links matter

Knowledge graphs, fraud rings detected by shared connections, recommendations driven by relationships, and network or dependency analysis — wherever the link itself is the answer.

// a graph database is the right tool when the relationship is explicit and known; a vector database is right when the relevant similarity is implicit in the content

▲ 06 / Using bothgraph + vector hybrid retrieval

The richest answers use similarity and relationships together.

Many real questions need both kinds of retrieval. Vector search finds the passages that are relevant by meaning; a graph supplies the explicit relationships that connect them — which document cites which, which account is linked to which, which step depends on which.

Combining them grounds an answer in two ways at once: it is on-topic because it is semantically similar, and it is well-supported because the relationships hold up. That is the basis of hybrid retrieval — and CRBRL ships vector, graph, and hybrid search so teams can use the right one, or both.

Semantic layer

Find what is relevant

Vector similarity surfaces the content that matches the question's meaning, even when the wording differs.

Relationship layer

Connect what is found

The graph adds explicit links between those results — citations, ownership, dependencies — so the answer is traceable.

Together

Grounded, richer answers

Hybrid retrieval blends both, giving answers that are both on-topic and supported by real, checkable relationships.

▲ 07 / The codecsTurboQuant + RaBitQ

Two compression codecs, selectable — because one size rarely fits every workload.

A codec is the method that compresses a vector down to a smaller form while keeping it searchable. CRBRL ships two, and the operator chooses per workload.

Codec · TurboQuant

Peer-reviewed compression

Built on peer-reviewed mathematics (arXiv:2504.19874).
Stores roughly 8× fewer bytes per vector at ≈0.98 cosine fidelity.
Searches in the compressed domain — no slow unpack step on every query.
No retraining or calibration; it works on the first record loaded.

Codec · RaBitQ

A robust, well-studied alternative

A widely studied quantisation method offered alongside TurboQuant.
Gives operators a second proven option to match different data.
Lets teams pick the codec that behaves best on their workload.
Broadens robustness across the range of vectors teams actually store.

// why both: real workloads differ — having a selectable codec means operator choice and steadier results across the spread of data, rather than one fixed trade-off for everyone

▲ 08 / The feature setfeature → benefit

Every capability, and what it does for you.

The full feature set, paired with the plain benefit each one delivers.

Architecture

Compression-native design

Data is stored compressed from the moment it arrives, not bolted on afterwards — so the saving applies to every record, copy, and backup by default.

Compressed-domain search

Queries run directly on compressed data, with no slow decompress step on every search. That is the part older bolt-on approaches never solved.

Density

8× fewer bytes per vector

A 1,536-dimension float32 vector drops from 6,144 B to 768 B. At 100M vectors that is 614 GB down to 77 GB — the same memory in a fraction of the footprint.

Fidelity

≈0.98 cosine fidelity

Retrieval quality stays effectively identical to full precision, so the cost saving does not come at the expense of the answers.

Adoption

Zero retraining or calibration

It works on the first record you load — no model to retrain, no calibration pass, no migration project before you see the benefit.

Compatibility

Chroma-compatible API

A standalone API that matches the Chroma interface, so teams can adopt CRBRL without rewriting how their application talks to the database.

Compatibility

crbrl-pg Postgres extension

Run the same compression-native engine inside Postgres, keeping AI memory next to the relational data teams already manage.

Retrieval

Semantic, full-text & hybrid search

Search by meaning, by keyword, or both at once — so each query uses the method that fits it best, in one system.

Tiering

Hot / warm / cold tiering

Place data across fast and economical tiers by how often it is used, controlling cost without giving up reach over the full archive.

Flexibility

Provider-neutral embeddings

Bring vectors from any embedding provider. You are not locked to one model, and you can change providers without leaving the database.

Operations

Observability & snapshots

See how the system is behaving and capture point-in-time snapshots, so operating it at scale stays predictable and recoverable.

Multi-tenancy

Multi-tenancy & RBAC

Isolate customers cleanly and control access by role — the basis for serving many tenants from one deployment at healthy margins.

Governance

Audit logs

A record of who did what and when, supporting the accountability that regulated and security-reviewed environments require.

Codecs

Two selectable codecs

Choose TurboQuant or RaBitQ per workload, matching the compression method to the data instead of accepting one fixed trade-off.

▲ 09 / Use casespick your industry

Find your sector. See the use case, the pain, and the benefit.

Choose an industry to see a typical workload, the problem it runs into, and what changes with CRBRL.

Banking & Finance

Searchable records across long retention

With CRBRL

▲ Where to next

You have the shape of it. Now take it to your own numbers — or under the hood.

For your workload

Talk it through

Bring your data sizes and retention needs. We will walk through what compression-native storage does to your specific AI-memory bill, and where it fits in your stack.

Discuss your workload →

For the engineers

See the technology

The codecs, the compressed-domain search, the Chroma-compatible API and crbrl-pg extension — the mechanics behind the figures on this page.

See the technology →