Spanner lets you make schema updates with no downtime. You can update the schema of an existing database in several ways:
In the Google Cloud console
Submit an
ALTER TABLEcommand on the Spanner Studio page.To access the Spanner Studio page, click Spanner Studio from the Database overview or Table overview page.
Using the
gcloud spannercommand-line toolSubmit an
ALTER TABLEcommand by using thegcloud spanner databases ddl updatecommand.Using the client libraries
Using the
projects.instances.databases.updateDdlREST APIUsing the
UpdateDatabaseDdlRPC API
Supported schema updates
Spanner supports the following schema updates of an existing database:
- Add or drop a named schema.
- Create a new table. Columns in new tables can be
NOT NULL. - Delete a table, if no other tables are interleaved within it, and it has no secondary indexes.
- Create or delete a table with a foreign key.
- Add or remove a foreign key from an existing table.
- Add a non-key column to any table. New non-key columns cannot be
NOT NULL.- Drop a non-key column from any table, unless it is used by a secondary index, foreign key, stored generated column, or check constraint.
- Add
NOT NULLto a non-key column, excludingARRAYcolumns. - Remove
NOT NULLfrom a non-key column. - Change a
STRINGcolumn to aBYTEScolumn or aBYTEScolumn to aSTRINGcolumn. - Change a
PROTOcolumn to aBYTEScolumn or aBYTEScolumn to aPROTOcolumn. - Change the proto message type of a
PROTOcolumn. - Add new values to an
ENUMdefinition and rename existing values usingALTER PROTO BUNDLE. - Change messages defined in a
PROTO BUNDLEin arbitrary ways, provided that modified fields of those messages are not used as keys in any table and that the existing data satisfies the new constraints. - Increase or decrease the length limit for a
STRINGorBYTEStype (including toMAX), unless it is a primary key column inherited by one or more child tables. - Increase or decrease the length limit for an
ARRAY<STRING>,ARRAY<BYTES>, orARRAY<PROTO>column to the maximum allowed. - Enable or disable commit timestamps in value and primary key columns.
- Add or remove a secondary index.
- Add or remove a check constraint from an existing table.
- Add or remove a stored generated column from an existing table.
- Construct a new optimizer statistics package.
- Create and manage views.
- Create and manage sequences.
- Create database roles and grant privileges.
- Set, change, or drop the default value of a column.
- Change the database options (
default_leaderorversion_retention_periodfor example). - Create and manage change streams.
- Create and manage ML models.
Unsupported schema updates
Spanner doesn't support the following schema updates of an existing database:
If there is a
PROTOfield of theENUMtype that is referenced by a table or index key, you can't removeENUMvalues from the proto enums. (Removal ofENUMvalues from enums used byENUM<>columns is supported, including when those columns are used as keys.)Change a
STRING(36)column to aUUIDcolumn or aUUIDcolumn to aSTRING(36)column.
Schema update performance
Schema updates in Spanner don't require downtime. When you issue a batch of DDL statements to a Spanner database, you can continue writing and reading from the database without interruption while Spanner applies the update as a long-running operation.
The time it takes to execute a DDL statement depends on whether the update
requires validation of the existing data or backfill of any data. For example,
if you add the NOT NULL annotation to an existing column,
Spanner must read all the values in the column to make sure that
the column does not contain any NULL values. This step can take a long time if
there is a lot of data to validate. Another example is if you're adding an index
to a database: Spanner backfills the index using existing data,
and that process can take a long time depending on how the index's definition
and the size of the corresponding base table. However, if you add a new column
to a table, there is no existing data to validate, so Spanner can
make the update more quickly.
In summary, schema updates that don't require Spanner to validate existing data can happen in minutes. Schema updates that require validation can take longer, depending on the amount of existing data that needs to be validated, but data validation happens in the background at a lower priority than production traffic. Schema updates that require data validation are discussed in more detail in the next section.
Schema updates validated against view definitions
When you make a schema update, Spanner validates that the update won't invalidate the queries used to define existing views. If validation is successful, the schema update succeeds. If validation is not successful, the schema update fails. Check Best practices when creating views for details.
Schema updates that require data validation
You can make schema updates that require validating that the existing data meets the new constraints. When a schema update requires data validation, Spanner disallows conflicting schema updates to the affected schema entities and validates the data in the background. If validation is successful, the schema update succeeds. If validation is not successful, the schema update does not succeed. Validation operations are executed as long-running operations. You can check the status of these operations to determine if they succeeded or failed.
For example, suppose you have defined the following music.proto file with a
RecordLabel enum and Songwriter protocol message:
enum RecordLabel {
COOL_MUSIC_INC = 0;
PACIFIC_ENTERTAINMENT = 1;
XYZ_RECORDS = 2;
}
message Songwriter {
required string nationality = 1;
optional int64 year_of_birth = 2;
}
To add a Songwriters table in your schema:
GoogleSQL
CREATE PROTO BUNDLE (
googlesql.example.music.Songwriter,
googlesql.example.music.RecordLabel,
);
CREATE TABLE Songwriters (
Id INT64 NOT NULL,
FirstName STRING(1024),
LastName STRING(1024),
Nickname STRING(MAX),
OpaqueData BYTES(MAX),
SongWriter googlesql.example.music.Songwriter
) PRIMARY KEY (Id);
CREATE TABLE Albums (
SongwriterId INT64 NOT NULL,
AlbumId INT64 NOT NULL,
AlbumTitle STRING(MAX),
Label INT32
) PRIMARY KEY (SongwriterId, AlbumId);
The following schema updates are allowed, but they require validation and might take longer to complete, depending on the amount of existing data:
Adding the
NOT NULLannotation to a non-key column. For example:ALTER TABLE Songwriters ALTER COLUMN Nickname STRING(MAX) NOT NULL;Reducing the length of a column. For example:
ALTER TABLE Songwriters ALTER COLUMN FirstName STRING(10);Altering from
BYTEStoSTRING. For example:ALTER TABLE Songwriters ALTER COLUMN OpaqueData STRING(MAX);Altering from
INT64/INT32toENUM. For example:ALTER TABLE Albums ALTER COLUMN Label googlesql.example.music.RecordLabel;Removing existing values from the
RecordLabelenum definition.Enabling commit timestamps on an existing
TIMESTAMPcolumn. For example:ALTER TABLE Albums ALTER COLUMN LastUpdateTime SET OPTIONS (allow_commit_timestamp = true);Adding a check constraint to an existing table.
Adding a stored generated column to an existing table.
Creating a new table with a foreign key.
Adding a foreign key to an existing table.
These schema updates fail if the underlying data does not satisfy the new
constraints. For example, the ALTER TABLE Songwriters ALTER COLUMN Nickname
STRING(MAX) NOT NULL statement fails if any value in the Nickname column is
NULL, because the existing data does not meet the NOT NULL constraint of the
new definition.
Data validation can take from several minutes to many hours. The time to complete data validation depends on:
- The size of the dataset
- The compute capacity of the instance
- The load on the instance
Some schema updates can change the behavior of requests to the database before
the schema update completes. For example, if you're adding NOT NULL to a
column, Spanner almost immediately begins rejecting writes for
new requests that use NULL for the column. If the new schema update ultimately
fails for data validation, there will have been a period of time when writes
were blocked, even if they would have been accepted by the old schema.
You can cancel a long-running data validation operation using the
projects.instances.databases.operations.cancel
method or using gcloud spanner
operations.
Schema versions created during schema updates
Spanner uses schema versioning so that there is no downtime during a schema update to a large database. Spanner maintains the older schema version to support reads while the schema update is processed. Spanner then creates one or more new versions of the schema to process the schema update. Each version contains the result of a collection of statements in a single atomic change.
The schema versions don't necessarily correspond one-to-one with either batches of DDL statements or individual DDL statements. Some individual DDL statements, such as index creation for existing base tables or statements that require data validation, result in multiple schema versions. In other cases, several DDL statements can be batched together in a single version. Old schema versions can consume significant server and storage resources, and they are retained until they expire (no longer needed to serve reads of earlier versions of data).
The following table shows how long it takes Spanner to update a schema.
| Schema operation | Estimated duration |
|---|---|
CREATE TABLE |
Minutes |
CREATE INDEX |
Minutes to hours, if the base table is created before the index. Minutes, if the statement is executed at the same time as the |
DROP TABLE |
Minutes |
DROP INDEX |
Minutes |
ALTER TABLE ... ADD COLUMN |
Minutes |
ALTER TABLE ... ALTER COLUMN |
Minutes to hours, if background validation is required. Minutes, if background validation is not required. |
ALTER TABLE ... DROP COLUMN |
Minutes |
ANALYZE |
Minutes to hours, depending on the database size. |
Data type changes and change streams
If you change the data type of a column that a change
stream watches, the column_types field of
relevant subsequent change stream
records reflects its
new type, as does the old_values JSON data within the records' mods field.
The new_values of a change stream record's mods field always matches a
column's current type. Changing a watched column's data type does not affect any
change stream records predating that change.
In the particular case of a BYTES-to-STRING change, Spanner
validates the column's old values as part of
the schema update. As a result, Spanner has safely decoded the
old BYTES-type values into strings by the time it writes any subsequent change
stream records.