ClickHouse DB::Exception: Table schema differs too much

The "DB::Exception: Table schema differs too much" error is raised when ClickHouse detects that a proposed schema change deviates too significantly from the current table structure. The error code TABLE_DIFFERS_TOO_MUCH typically appears with replicated tables when a replica's local schema diverges so far from the schema stored in ZooKeeper that ClickHouse refuses to automatically reconcile the difference.

Impact

The ALTER TABLE or replica synchronization operation fails. The table continues to function with its current schema, but the desired change is not applied. In a replicated setup, the affected replica may fall out of sync with others until the schema discrepancy is resolved.

Common Causes

Replica schema desynchronization -- a replica was offline during multiple ALTER TABLE operations, and now its schema is several steps behind.
Manual metadata edits -- someone modified table metadata files directly on disk, creating a mismatch with ZooKeeper.
Restoring a replica from an old backup -- the backed-up schema does not match the current cluster schema.
Incompatible ALTER operations -- attempting a change that ClickHouse considers too drastic to apply safely (e.g., changing the primary key or ORDER BY expression in certain contexts).
Different ClickHouse versions across replicas -- version skew can lead to different interpretations of the schema.

Troubleshooting and Resolution Steps

Compare the local schema with the ZooKeeper schema. Get the local schema:

SHOW CREATE TABLE my_database.my_table;

Then check what ZooKeeper has:

SELECT value FROM system.zookeeper
WHERE path = '/clickhouse/tables/shard1/my_table' AND name = 'metadata';

If the local replica is behind, try forcing a schema sync:
```
SYSTEM RESTART REPLICA my_database.my_table;
```
This reinitializes the replica's connection to ZooKeeper and may resolve minor discrepancies.
For more significant divergence, re-create the replica. Drop the local table and let it re-replicate from healthy replicas:
```
DROP TABLE my_database.my_table SYNC;
CREATE TABLE my_database.my_table (...) -- use the schema from a healthy replica
ENGINE = ReplicatedMergeTree(...)
ORDER BY ...;
```
The new replica will fetch data from other replicas automatically.

If the issue is a blocked ALTER, break it into smaller steps. Instead of one large schema change, apply incremental modifications:

ALTER TABLE my_database.my_table ADD COLUMN col1 String;
ALTER TABLE my_database.my_table ADD COLUMN col2 UInt64;
-- Instead of trying to change everything at once

Ensure all replicas are running the same ClickHouse version before performing schema changes:
```
SELECT host_name, version()
FROM system.clusters
WHERE cluster = 'my_cluster';
```

Check the replication queue for stuck entries:

SELECT * FROM system.replication_queue
WHERE table = 'my_table'
ORDER BY create_time;

Best Practices

Keep all replicas on the same ClickHouse version to prevent schema interpretation differences.
Apply ALTER TABLE operations when all replicas are online and healthy.
Avoid editing table metadata files on disk -- always use DDL statements.
When restoring from backups, verify the schema matches the current cluster state before reattaching.
Monitor the replication queue for long-running or stuck entries that could indicate schema drift.

Frequently Asked Questions

Q: Can I force ClickHouse to accept a large schema difference?
A: Not directly. The safest approach is to drop the divergent replica and recreate it with the correct schema. ClickHouse will then replicate the data from healthy replicas.

Q: How do I prevent this error when upgrading ClickHouse across a cluster?
A: Upgrade all replicas within a short time window. Avoid running ALTER TABLE statements while replicas are on different versions.

Q: Does SYSTEM RESTART REPLICA cause downtime?
A: Briefly, yes. The table is unavailable for a short period while the replica reinitializes. On a healthy replica, this is typically a matter of seconds.

Q: What if all replicas have divergent schemas?
A: This is a more serious situation. You may need to pick one replica as the source of truth, update its schema in ZooKeeper, and then resync the others. This is an advanced operation best done with guidance from the ClickHouse documentation or community.