Skip to content

Instantly share code, notes, and snippets.

@surister
Last active November 21, 2024 11:16
Show Gist options
  • Select an option

  • Save surister/33a5768fe5c5b91179f2f20424e1f856 to your computer and use it in GitHub Desktop.

Select an option

Save surister/33a5768fe5c5b91179f2f20424e1f856 to your computer and use it in GitHub Desktop.
FOSDEM 2025

Hybrid Index: The secret to blazingly fast queries on any data structure @ CrateDB

One of the most effective ways to improve query performance is through indexing. At CrateDB, we said, what's faster than one index? everything indexed! - We took the bold approach: indexing every column by default. But we didn't stop there—we leverage multiple data structures for every indexed column. At query time, CrateDB intelligently selects the optimal index based on the query type, enabling faster and more efficient results.

But you probably have many questions. Does this actually work? How did you do it? Isn't there a performance penalty on write speed? And updates? How about storage size?

In this talk we will tell you all about Hybrid Idexes, one of the fundamental aspects of CrateDB: an Open-source distributed SQL Database for Real-Time Analytics and Hybrid Search.


There are many ways of improving your query performance like building indexes or choosing the right data structure, in CrateDB we solve all this complexity for you -- We took the bold approach: indexing every column by default and creating optimized columnar and row store at insert time. When you query, CrateDB will leverage the index and either use the columnar or row store for the most optimal performance; no configuration, only fast and efficient queries from the get go!

But you probably have many questions. Does this actually work? How did you do it? Isn't there a performance penalty on write speed? And updates? How about storage size?

In this talk we will tell you all about Hybrid Index, one of the fundamental aspects of CrateDB: an Open-source distributed SQL Database for Real-Time Analytics and Hybrid Search.


Hybrid Index: The secret to blazingly fast queries on any data structure @ CrateDB

Improving query performance often involves tasks like building indexes or selecting the right data structures. With CrateDB, we eliminate that complexity for you. We took the bold approach: CrateDB indexes every column by default and creates both columnar and row storage during data insertion. When you run a query, CrateDB automatically leverages the index and selects the most efficient storage format -- ensuring optimal performance. No configurations needed, just fast and efficient queries right out of the box!

But you probably have many questions. Does this actually work? How did you do it? Isn't there a performance penalty on write speed? And updates? How about storage size?

In this talk we will tell you all about Hybrid Index, one of the fundamental aspects of CrateDB: an Open-source distributed SQL Database for Real-Time Analytics and Hybrid Search.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment