What is Dnaerys Variant Store ?

Dnaerys is a state-of-the-art, distributed, horizontally scalable, in-memory low latency genome variant store — a specialised analytical DBMS designed to store genetic variations and execute genomic algorithms at scale.


Why a Specialised Variant Store ?

On a small scale, genomic data can be managed well in almost any modern DBMS. However, as datasets grow to hundreds of thousands of WGS samples, DBMS response times become the primary bottleneck for downstream applications.

Historically, horizontally scalable variant stores relied on general-purpose, disk-based architectures (Data Warehouses and Data Lakes). This disk-oriented design — often coupled with remote storage — puts them at a three-order-of-magnitude disadvantage compared to purpose-built, in-memory systems.

Thesis

General-purpose DBMS architectures are sub-optimal for low-latency, scalable variant stores.


I. Query Plan Optimization and Code Generation
Specialized systems avoid the heavy execution stages mandatory in general-purpose engines:

II. Storage Model Efficiency
Traditional Relational DBMS models face significant scaling hurdles in genomics:

III. Primary Data Allocation and Access Latency


The Solution

Dnaerys provides a specialised environment for high-throughput genomics:


Design Principles

Dnaerys is engineered for predictable latency and high availability in distributed environments.

I. Distributed Architecture

II. High-Performance Execution & Specialization

III. Resilience & The PACELC Model
Dnaerys operates primarily as a PA/EC system (Availability over Consistency during partitions; Consistency over Latency during normal operation).

IV. Cloud-Scale Operations