The Complete Guide to Peer-to-Peer Data Replication in Harper

Replication is the engine that powers Harper's global data fabric. It transforms distributed nodes into a seamless network of synchronized peers—no leader nodes, no single points of failure, just fast, reliable, and secure data movement.

This guide breaks down the architecture, shows you how to configure it, and explains what makes it special, with practical examples and code along the way.

‍

The Basics: What is Harper Replication?

Replication is Harper's mechanism for creating and maintaining multiple copies of data across distributed nodes. This ensures low-latency reads, high availability, and fault tolerance through eventual consistency.

Cluster: A group of Harper instances connected through replication.
Node: An individual Harper instance (VM, container, IoT device, etc.).
Peer-to-Peer: Every node can both publish and subscribe. No leader, no coordinator.

Real-World Example

A retailer deploys Harper on distributed servers near every user population center. Product catalog and inventory updates made in Denver are replicated in real-time to Chicago and Boston, ensuring all locations share consistent data without routing everything through a central server.

‍

How Harper Keeps Clusters in Sync

Harper's support for high-throughput, low-latency, geographically distributed deployments is made possible by three key ingredients:

‍

Audit Log Streaming

Rather than duplicating full data sets, Harper logs each change to an audit log, a compact, transaction-level journal optimized for minimum resource use.

If Node A inserts { id: 1, name: "Tiger" } into the animals table, that change becomes an audit entry that is replicated to other nodes.

‍

The hdb_nodes Table

Cluster configuration is driven by a system table called hdb_nodes. Every node has a record with its hostname, connection URL, and metadata. This table is replicated, so when a new node joins, it inherits the full topology of the cluster.

‍

Mutual TLS for Security

All connections are secured using mutual TLS (mTLS). Each node must present a certificate matching its hostname. Certificates are validated against trusted certificate authorities and cross-checked with hdb_nodes.

‍

A client writes data to Node A.
Node A records that transaction in its audit log.
That log entry is replicated securely to peers via websockets
Peers apply the transaction to their local table.

‍

How to Configure Replication

Option 1: Declarative YAML Config

The most robust and production-ready approach:

replication:
  hostname: node-a.harper.local
  routes:
    - node-b.harper.local
    - node-c.harper.local

When Harper initializes, it uses this process to establish connections and populate the hdb_nodes table. Note: for this to function correctly, valid certificates must be configured to enable the mTLS connection.

‍

Option 2: Dynamic API Call

For quick tests or automation:

{'

curl -X POST https://your-harperdb-endpoint.com \
  -u admin:password \
  -H "Content-Type: application/json" \
  -d '{
    "operation": "add_node",
    "hostname": "server-two",
    "verify_tls": false,
    "authorization": {
       "username": "admin",
        "password": "password"
     }
  }'

‍

Control and Flexibility

Harper's replication system offers powerful and granular control over how data is shared across nodes. Whether you're managing compliance constraints, optimizing for performance, or just fine-tuning your architecture, these capabilities let you shape replication to your exact needs.

Database Filtering

By default, Harper replicates all user-created databases. However, you can override this behavior in the harperdb-config.yaml file by explicitly listing only the databases you want replicated. This is useful when some data is sensitive, irrelevant across regions, or simply not needed outside of a specific node.

replication:
  databases:
    - customers
    - analytics

‍

Table Filtering

You can also control replication at the table level. Within your GraphQL schema, set the replicate: false flag to prevent specific tables from being included in replication.

type Product @table(replicate: false) {
 id: ID!
 name: String!
}

This is ideal for caching tables, temporary storage, or data that is large and local-only.

Directionality with Subscriptions

For more advanced scenarios, the add_node and set_node API calls support directional replication. This means you can:

Subscribe to a table without publishing.
Publish to a table without subscribing.
Do both (the default behavior).

This opens the door for sophisticated topologies, like selective upstream syncing or isolating write-heavy tables to specific nodes.

"subscriptions": [
  { "database": "data", "table": "metrics", "publish": false, "subscribe": true }
]

‍

Real-World Example

A multi-tenant SaaS platform deploys Harper nodes in major cloud regions: US-East, EU-West, and APAC. Each edge node holds customer data specific to that geography. Using table-level subscriptions, the system replicates only the relevant tenants’ records to each region. This minimizes cross-region bandwidth costs, improves latency for customers, and ensures compliance with local data residency regulations.

For instance, a Harper node in Frankfurt might only subscribe to customer_eu and invoices_eu, while ignoring customer_us and invoices_apac. Meanwhile, a central analytics node can be configured to subscribe to all tenant databases for cross-region reporting—without publishing any of its own data back.

This kind of targeted replication is what enables Harper to power modern, distributed apps that are both globally performant and locally compliant.

‍

Managing Node Lifecycles

Harper's replication system is built to accommodate the dynamic nature of distributed systems. Whether you're scaling up, handling failure recovery, or optimizing storage, Harper handles the lifecycle of each node with resilience and flexibility.

‍

New Nodes

When a new node is added to a cluster, Harper performs an initial sync, downloading the full contents of the relevant databases from its first connected peer. This ensures the new node becomes a fully consistent member of the cluster without requiring manual intervention.

This process is automatic and includes replicating database and tables and hdb_nodes entries, allowing the new node to discover and connect with other peers in the cluster.

‍

`startTime` Option

If you don’t want a new node to sync the entire history, you can specify a startTime parameter. This tells Harper to only replicate changes from that point forward.

Use case: spinning up a temporary analytics node that only cares about current and future data, not historical records.

replication:
  hostname: node-a.harper.local
  routes:
    - hostname: node-b.harper.local
      startTime: 2024-02-06T15:30:00Z

‍

Offline Resync

If a node goes offline—due to a network issue, reboot, or planned maintenance—it will automatically resync upon reconnecting. Harper compares audit log sequences and replays any missed transactions to bring the node up to date.

This happens without needing to reinitialize or manually trigger sync processes, making Harper well-suited for edge or intermittently connected environments.

‍

Audit Log Pruning

Over time, the audit log can grow significantly, especially in high-write environments. Harper allows you to purge old entries to free up storage. If a new or returning node requests a transaction that has already been purged, Harper will fall back to a full table copy to fill in the gaps.

Admins can configure log retention periods or trigger pruning operations based on time or size thresholds.


logging:
  auditRetention: 1d
  maxSize: 100K

‍

Conflict Resolution

Replication conflicts are rare but inevitable in distributed systems. Harper avoids duplicate inserts by identifying records with the same primary key and only applying the latest transaction using a last-write-wins strategy unless a Conflict-free Data Replication Type (CRDT) is used.

This ensures eventual consistency even if two nodes attempt to write the same record simultaneously.

‍

Monitoring and Troubleshooting

Need a quick health-check on your Harper cluster?

Call the cluster_status operation, and Harper returns a tidy JSON snapshot of every node—showing current WebSocket connections, subscribed tables, per-database latency, and a flag that tells you if clustering is enabled. This single endpoint gives ops teams immediate, actionable insight into replication health and connection quality, making it easy to spot broken links or lagging databases before they disrupt your app.

{
   "operation": "cluster_status"
}

‍

What This Enables

Harper’s replication architecture makes global low-latency experiences possible. Whether you're building applications at the edge, running global infrastructure, or scaling dynamically without downtime, Harper gives you the flexibility and resilience to do it right.

Edge Intelligence

Harper is well-suited for deployment on resource-constrained, distributed hardware commonly found in environments like warehouses, retail locations, delivery vehicles, and remote sensor networks.

Each node holds just the data it needs.
If it loses connection, it keeps running locally.
When it's back online, it syncs up automatically—no manual recovery needed.

Real-World Example: A retail chain rolls out Harper to POS devices in 300 stores. Each store gets real-time pricing updates and logs its own sales data locally. Even if the network goes down, the POS system keeps working. Once the connection is restored, the sales data syncs back to HQ without missing a beat.

‍

Multi-Region Deployments

Need to serve customers across continents while respecting data sovereignty laws and reducing latency? Harper has you covered.

Securely connect regions with mTLS.
Control exactly what gets replicated where.
Keep data close to users and inside legal boundaries.

Real-World Example: A SaaS company runs Harper clusters in the US, EU, and APAC. EU customer data never leaves the Frankfurt region to stay GDPR-compliant. Meanwhile, global metrics are rolled up to a central analytics cluster. It's fast, efficient, and compliant—all at once.

‍

Zero-Downtime Scale-Out

Scaling your infrastructure should be smooth—not stressful.

Add new nodes without restarts.
New nodes join and auto-sync with the cluster without manual setup.
Just declare them in the config.

Real-World Example: On Black Friday, an e-commerce platform adds five new edge nodes to handle the traffic spike. Each one boots up, connects securely to the cluster, downloads the necessary data, and starts serving traffic—no downtime, no disruption.

‍

Try It Yourself

Set up 3 Harper nodes with the YAML config.
Insert data into one—watch it appear on others.
Simulate an offline node, then bring it back.

‍

Final Thought

Harper’s peer-to-peer replication, declarative design, and secure architecture make it a powerful tool for building distributed systems that feel like one.

Once you understand how it works, you’ll start designing systems that let Harper do the heavy lifting, so your data is always where it needs to be, fast and safe.

‍

Our Story

Dev Center

The Complete Guide to Peer-to-Peer Data Replication in Harper

The Basics: What is Harper Replication?

Real-World Example

How Harper Keeps Clusters in Sync

The hdb_nodes Table

Mutual TLS for Security

How to Configure Replication

Option 1: Declarative YAML Config

Option 2: Dynamic API Call

Control and Flexibility

Database Filtering

Table Filtering

Directionality with Subscriptions

Real-World Example

Managing Node Lifecycles

New Nodes

`startTime` Option

Offline Resync

Audit Log Pruning

Conflict Resolution

Monitoring and Troubleshooting

What This Enables

Edge Intelligence

Multi-Region Deployments

Zero-Downtime Scale-Out

Try It Yourself

Final Thought

Dev Center

The Complete Guide to Peer-to-Peer Data Replication in Harper

The Basics: What is Harper Replication?

Real-World Example

How Harper Keeps Clusters in Sync

The hdb_nodes Table

Mutual TLS for Security

How to Configure Replication

Option 1: Declarative YAML Config

Option 2: Dynamic API Call

Control and Flexibility

Database Filtering

Table Filtering

Directionality with Subscriptions

Real-World Example

Managing Node Lifecycles

New Nodes

startTime Option

Offline Resync

Audit Log Pruning

Conflict Resolution

Monitoring and Troubleshooting

What This Enables

Edge Intelligence

Multi-Region Deployments

Zero-Downtime Scale-Out

Try It Yourself

Final Thought

`startTime` Option