Serialize complex ZCL attribute types for database persistence by TheJulianJES · Pull Request #1792 · zigpy/zigpy

TheJulianJES · 2026-03-16T21:46:42Z

Background

This is somewhat needed for Ubisys devices that we re-interview (LD6). We (obviously) clear the whole attribute cache, including local attributes, so we need to actually read the device-specific attribute and parse that after the (re-)interview.

This already works fine at runtime. Between HA restarts, we rely on the local attribute, so we don't technically need this for that, but we log an error about failing to save this when reading the attribute. We could also just ignore those errors and not log them...?

EDIT: Actually, we already seem to only log these errors at debug level, so we don't need to push through with this PR now if we don't like it.

For this solution, see the summary below. The code changes are relatively small.

AI summary

Attribute values of complex ZCL types (e.g. LVList[LVBytes]) could not be persisted to the SQLite database, raising sqlite3.ProgrammingError: type 'AnonymousLVList' is not supported. This affected custom clusters like the ubisys 0xFC00 cluster that define array-typed attributes.
On save, values that aren't natively supported by SQLite (None, int, float, str, bytes) are now serialized to bytes via their .serialize() method before storage.
On load, serialized bytes values are deserialized back into their original typed form using the attribute definition's type, so cluster.get(attr) returns the correct type after restart.

Note: pre-existing type fidelity gap

There is a pre-existing issue (not introduced by this PR) where the type of simple attribute values is not preserved across restarts. For example, a uint16_t attribute read live returns a t.uint16_t instance, but after DB reload it comes back as a plain Python int, because SQLite stores both identically as INTEGER. This PR does not attempt to fix that — it only addresses the crash for types SQLite cannot store at all.

Note

We could also guard the the _save_attribute_cache call that happens for all attributes when zigpy is shutting down, so that it doesn't fail completely, but only for attributes that can't be serialized (of which there should be none..?).
However, because of the code below, we never call _save_attribute_cache for quirked devices anyway, purely relying on the runtime/event attributes updates for DB persistence. Since non-quirked devices using pure ZCL attributes mostly/only use "simple" types that can be saved directly into the DB, we don't really need the extra guard?

Either way, we could revert this commit to add that guard: 40ac143

Code preventing quirked devices from calling _save_attribute_cache:

zigpy/zigpy/appdb.py

Lines 408 to 420 in 7d1ea41

    
           if device.node_desc is not None: 
        
               await self._save_node_descriptor(device) 
        
           if isinstance(device, zigpy.quirks.BaseCustomDevice): 
        
               await self._db.commit() 
        
               return 
        
           await self._save_endpoints(device) 
        
           for ep in device.non_zdo_endpoints: 
        
               await self._save_clusters(ep) 
        
               await self._save_attribute_cache(ep) 
        
               await self._save_unsupported_attributes(ep) 
        
           await self._db.commit()

Copilot

Pull request overview

This PR improves Zigpy’s SQLite persistence of ZCL attribute cache entries by adding a serialization/deserialization layer for complex ZCL attribute value types that SQLite can’t store directly (e.g., LVList[...]). This addresses persistence failures seen with some custom/manufacturer clusters (notably ubisys-style array/list attributes) across restarts.

Changes:

Add _serialize_for_db() to convert non-SQLite-native attribute values into bytes via .serialize() before storing.
Add _deserialize_from_db() to restore serialized bytes into typed ZCL values using the attribute definition’s type on DB load.
Add tests covering native passthrough, complex type round-tripping, and persistence for a custom cluster attribute.

Reviewed changes

Copilot reviewed 2 out of 2 changed files in this pull request and generated 2 comments.

File	Description
`zigpy/appdb.py`	Serializes complex attribute values for SQLite storage and deserializes them on cache load.
`tests/test_appdb.py`	Adds unit/integration tests validating serialization behavior and DB round-trip for complex types.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

TheJulianJES · 2026-03-18T02:58:08Z

+    if hasattr(value, "serialize"):
+        return value.serialize()
+
+    raise ValueError(


I think it would be fine to log exceptions when we can't save something to the DB now? (not just in debug)
It shouldn't happen, except maybe because of the issue where we have quirks return datetime objects for attributes defined as CharacterString? Backlog issue:

Broken timestamp sensors due to datetime objects and zigpy conversion to str zha#519

Yeah, these errors would now no longer be confined to the debug log level. It's generally good IMO, but as long as we still have this fundamentally broken behavior for these quirks, we may want to consider just raising a custom SerializationError, which we can then only log at debug level again – just catching all ValueErrors might be a bit too much(?)

diff --git a/zigpy/appdb.py b/zigpy/appdb.py index e86a5dc9..efcb15d9 100644 --- a/zigpy/appdb.py +++ b/zigpy/appdb.py @@ -113,6 +113,10 @@ def aiosqlite_connect( _SQLITE_TYPES = (type(None), int, float, str, bytes) +class SerializationError(Exception): + """Raised when an attribute value cannot be serialized for database storage.""" + + def _serialize_for_db(value: Any) -> None | int | float | str | bytes: """Convert a ZCL attribute value to a type SQLite can store natively.""" if isinstance(value, _SQLITE_TYPES): @@ -121,7 +125,7 @@ def _serialize_for_db(value: Any) -> None | int | float | str | bytes: if hasattr(value, "serialize"): return value.serialize() - raise ValueError( + raise SerializationError( f"Cannot persist attribute value of type {type(value).__name__!r} to the " f"database: {value!r}" ) @@ -229,7 +233,7 @@ class PersistingListener(zigpy.util.CatchingTaskMixin): assert handler try: await handler(*args) - except sqlite3.Error as exc: + except (sqlite3.Error, SerializationError) as exc: LOGGER.debug( "Error handling '%s' event with %s params: %s", cb_name,

That kind of keeps the old behavior. We should later revert that if we no longer have any known non-matching types in attribute cache (compared to attribute definition type).

TheJulianJES · 2026-03-16T21:53:59Z

                attrid,
                manufacturer_code,
                Status.SUCCESS,
-                cache_item.value,
+                _serialize_for_db(cache_item.value),
                cache_item.last_updated.timestamp(),


Yeah, this is what I mentioned in the PR description. We could re-add that but it's not really used at all at the moment, as this never runs for quirked devices (40ac143).

TheJulianJES · 2026-03-16T22:38:30Z

The only test that's failing is this:

zigpy/tests/test_appdb_migration.py

Lines 705 to 726 in 7d1ea41

    
                       # Unmigrated row from v13->v14 (disambiguated cluster) 
        
                       (str(dev.ieee), 1, 0, 0xFC01, 0x0010, -1, b"\x42", 0), 
        
                       # Unmigrated row from v13->v14 (ambiguous cluster) 
        
                       (str(dev.ieee), 1, 0, 0xFC02, 0x0020, -1, b"\x99", 0), 
        
                       # Manually read through the UI after v13->v14, duplicating the above 
        
                       # but with a newer value 
        
                       (str(dev.ieee), 1, 0, 0xFC01, 0x0010, 0xABCD, b"\x43", 1.0), 
        
                   ], 
        
               ) 
        
               conn.commit() 
        
           with patch("zigpy.quirks.DEVICE_REGISTRY", registry): 
        
               app = await make_app_with_db(db_path) 
        
               dev = app.get_device(ieee=t.EUI64.convert("aa:bb:cc:dd:11:22:33:44")) 
        
           # Migration runs during load 
        
           disambiguated = dev.endpoints[1].in_clusters[0xFC01] 
        
           ambiguous = dev.endpoints[1].in_clusters[0xFC02] 
        
           # 2 candidates (1 manuf + 1 non-manuf): picked manuf-specific. 
        
           # Uses the newer value from the already-resolved row, not stale/unmigrated value. 
        
           assert disambiguated.get("manuf_attr") == b"\x43"

manuf_attr is defined as t.uint8_t, yet we store bytes in the DB. This doesn't happen in practice.

We could prevent deserialization if or issubclass(attr_type, (*_SQLITE_TYPES,)), but I feel like this is exclusively an issue with the existing test. If the bytes are replaced with integers, the issue no longer happens.

puddly · 2026-03-16T22:53:31Z

Awesome! What do you think about completely skipping the implicit storage done by sqlite and always serializing types, including integers and strings?

TheJulianJES · 2026-03-16T23:16:44Z

That would be the fourth open PR with a migration from v14 to v15 😄

But hmm... I'm not sure. It would make the serialization/deserialization added here less special. That's nice but it's also not complicated code to exclude some types. When peeking into a DB for debugging purposes, you'd also get slightly less readable values. And there's some DB tests that would need to be updated to use the serialized values (likely both queries and their output). No big issues though.

I think it would be fine..? It does feels a bit like using a DB and then just storing JSON in it (ignoring our backups 😅).
Alternatively, we could deserialize/convert "SQLite types" to zigpy objects when restoring attribute values, so we have the correct types at runtime (in a future PR). We'd have most of the same benefits, except that we still use the SQLite types when possible.

But yeah, I don't have a real opinion here. Both ideas seem fine.

This reverts commit 8ec3b5b.

codecov · 2026-03-16T23:35:41Z

Codecov Report

✅ All modified and coverable lines are covered by tests.
✅ Project coverage is 99.51%. Comparing base (7d1ea41) to head (6949152).

Additional details and impacted files

@@           Coverage Diff           @@
##              dev    #1792   +/-   ##
=======================================
  Coverage   99.51%   99.51%           
=======================================
  Files          63       63           
  Lines       12792    12808   +16     
=======================================
+ Hits        12730    12746   +16     
  Misses         62       62

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

puddly · 2026-03-17T23:45:48Z

Alternatively, we could deserialize/convert "SQLite types" to zigpy objects when restoring attribute values, so we have the correct types at runtime (in a future PR). We'd have most of the same benefits, except that we still use the SQLite types when possible.

I think that's a good approach.

The only cases where it would matter what objects make it into SQLite are:

Invalid integers for the data type (handled by serialize/deserialize on load)
SQLite's REAL is a double so it should store double, single, and half precision without any loss.
Bad character strings with null bytes or other weirdness. Do these survive a database round trip?
Everything else is opaque and doesn't benefit from being stored as a readable type.

TheJulianJES · 2026-03-18T00:53:29Z

Those are good points. From some quick tests, they should be non-issues.
Except maybe the first point with invalid ints:

If an invalid int (e.g. 999 for uint8_t) makes it into the DB (e.g. from a bad quirk calling update_attribute), and we just uint8_t(999) from the SQLite type with that approach when restoring from the DB, we would now catch the issue when loading (ValueError), instead of blindly restoring the int to 999 in runtime attribute cache.
If we serialized everything before saving, we would catch it when saving to the DB, but still keep the int in runtime cache.
So, before we normalize the types to zigpy ones when loading, we should probably have update_attribute validate the values (for defined types) that quirks call it with. Only read_attributes does some type validation at the moment.
- Related fun fact: Fix reading/writing attributes with mixed manufacturer codes #1719 removed the guards for failed normalization, so we now fail an entire cluster handler setup if one attribute definition's type is incorrect. This may be something to look into.

Also, we were apparently storing model and manufacturer strings as raw bytes in the DB before #147. Should we eventually migrate any such potential entries? I don't think we did so far and just keep converting those very legacy entries during appdb load?

zigpy/zigpy/appdb.py

Lines 113 to 117 in 5cb5429

    
           def decode_str_attribute(value: str | bytes) -> str: 
        
               if isinstance(value, str): 
        
                   return value 
        
               return value.split(b"\x00", 1)[0].decode("utf-8")

zigpy/zigpy/appdb.py

Lines 860 to 870 in 5cb5429

    
           # Populate the device's manufacturer and model attributes 
        
           if ( 
        
               row.cluster_id == Basic.cluster_id 
        
               and attr_def == Basic.AttributeDefs.manufacturer 
        
           ): 
        
               dev.manufacturer = decode_str_attribute(row.value) 
        
           elif ( 
        
               row.cluster_id == Basic.cluster_id 
        
               and attr_def == Basic.AttributeDefs.model 
        
           ): 
        
               dev.model = decode_str_attribute(row.value)

Technically, we could also use _deserialize_from_db a bit earlier and have it handle those bytes -> CharacterString cases for now, also when restoring to device.manufacturer and device.model. Either approach is something for another PR anyway.

Though currently, _deserialize_from_db leaves these untouched when trying to deserialize those legacy bytes to a CharacterString (for the attribute cache only), as it expects a length-prefixed "string" already. These are just raw bytes though (likely present in few instance today).

With this PR, we will debug log "Failed to deserialize cached value for type CharacterString, using raw bytes" for those legacy values (and just leave them untouched). I think this is fine?
Or should we explicitly not try to convert the manufacturer and model values for now, instead of letting deserialization fail for such legacy values and debug log? Or just ignore deserialization of bytes for all attr definitions that are str-based (issubclass(attr_type, str)) (or CharacterString directly). IMO, the current approach with debug logging is fine here.. We can fix existing raw bytes string values in a future migration.

Draft of a possible patch where `_deserialize_from_db` deals with this for all strings, but with no DB migration (CLICK TO EXPAND)

diff --git a/zigpy/appdb.py b/zigpy/appdb.py
index e86a5dc9..f2dad413 100644
--- a/zigpy/appdb.py
+++ b/zigpy/appdb.py
@@ -142,6 +142,10 @@ def _deserialize_from_db(value: Any, attr_type: type | None) -> Any:
     try:
         result, _ = attr_type.deserialize(value)  # type: ignore[attr-defined]
     except (ValueError, KeyError):
+        # Legacy bytes values for string types: decode as UTF-8 with null stripping
+        if issubclass(attr_type, str):
+            return value.split(b"\x00", 1)[0].decode("utf-8", errors="replace")
+
         LOGGER.debug(
             "Failed to deserialize cached value for type %s, using raw bytes",
             attr_type,
@@ -151,13 +155,6 @@ def _deserialize_from_db(value: Any, attr_type: type | None) -> Any:
     return result
 
 
-def decode_str_attribute(value: str | bytes) -> str:
-    if isinstance(value, str):
-        return value
-
-    return value.split(b"\x00", 1)[0].decode("utf-8")
-
-
 class PersistingListener(zigpy.util.CatchingTaskMixin):
     def __init__(
         self,
@@ -888,27 +885,29 @@ class PersistingListener(zigpy.util.CatchingTaskMixin):
 
                 continue
 
-            if row.status == Status.SUCCESS:
-                cluster._attr_cache.set_value(
-                    attr_def,
-                    _deserialize_from_db(row.value, attr_def.type),
-                    last_updated=datetime.fromtimestamp(row.last_updated, UTC),
-                )
-            else:
+            if row.status != Status.SUCCESS:
                 cluster._attr_cache.mark_unsupported(attr_def)
                 continue
 
+            value = _deserialize_from_db(row.value, attr_def.type)
+
+            cluster._attr_cache.set_value(
+                attr_def,
+                value,
+                last_updated=datetime.fromtimestamp(row.last_updated, UTC),
+            )
+
             # Populate the device's manufacturer and model attributes
             if (
                 row.cluster_id == Basic.cluster_id
                 and attr_def == Basic.AttributeDefs.manufacturer
             ):
-                dev.manufacturer = decode_str_attribute(row.value)
+                dev.manufacturer = value
             elif (
                 row.cluster_id == Basic.cluster_id
                 and attr_def == Basic.AttributeDefs.model
             ):
-                dev.model = decode_str_attribute(row.value)
+                dev.model = value
 
     @staticmethod
     def _resolve_unmigrated_attribute(

TheJulianJES added 6 commits March 16, 2026 22:05

Serialize non-basic types to bytes before saving to DB

081743d

Add tests

9aaf879

Also deserialize objects after restart

22a0983

Revert serialization guard in _save_attribute_cache for now

40ac143

Add test for serialization in _save_attribute_cache

75c00cc

Add test for deserialization only happening when needed

cea0a6b

TheJulianJES requested a review from Copilot March 16, 2026 21:46

Copilot started reviewing on behalf of TheJulianJES March 16, 2026 21:47 View session

Copilot AI reviewed Mar 16, 2026

View reviewed changes

TheJulianJES added 3 commits March 17, 2026 00:18

TEST: Do not deserialize SQLite-compatible attr types

8ec3b5b

Revert "TEST: Do not deserialize SQLite-compatible attr types"

357cff8

This reverts commit 8ec3b5b.

Fix existing DB test to use ints for int attr def

6949152

TheJulianJES force-pushed the tjj/serialize_non_sqlite_objects branch from 2a81410 to 6949152 Compare March 16, 2026 23:19

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Serialize complex ZCL attribute types for database persistence#1792

Serialize complex ZCL attribute types for database persistence#1792
TheJulianJES wants to merge 9 commits intozigpy:devfrom
TheJulianJES:tjj/serialize_non_sqlite_objects

TheJulianJES commented Mar 16, 2026 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

TheJulianJES Mar 18, 2026

Uh oh!

TheJulianJES Mar 18, 2026

Uh oh!

TheJulianJES Mar 16, 2026

Uh oh!

TheJulianJES commented Mar 16, 2026

Uh oh!

puddly commented Mar 16, 2026

Uh oh!

TheJulianJES commented Mar 16, 2026 •

edited

Loading

Uh oh!

codecov bot commented Mar 16, 2026

Uh oh!

puddly commented Mar 17, 2026

Uh oh!

TheJulianJES commented Mar 18, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

	if device.node_desc is not None:
	await self._save_node_descriptor(device)

	if isinstance(device, zigpy.quirks.BaseCustomDevice):
	await self._db.commit()
	return

	await self._save_endpoints(device)
	for ep in device.non_zdo_endpoints:
	await self._save_clusters(ep)
	await self._save_attribute_cache(ep)
	await self._save_unsupported_attributes(ep)
	await self._db.commit()

Conversation

TheJulianJES commented Mar 16, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Background

AI summary

Note: pre-existing type fidelity gap

Note

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

TheJulianJES Mar 18, 2026

Choose a reason for hiding this comment

Uh oh!

TheJulianJES Mar 18, 2026

Choose a reason for hiding this comment

Uh oh!

TheJulianJES Mar 16, 2026

Choose a reason for hiding this comment

Uh oh!

TheJulianJES commented Mar 16, 2026

Uh oh!

puddly commented Mar 16, 2026

Uh oh!

TheJulianJES commented Mar 16, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

codecov bot commented Mar 16, 2026

Codecov Report

Uh oh!

puddly commented Mar 17, 2026

Uh oh!

TheJulianJES commented Mar 18, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

TheJulianJES commented Mar 16, 2026 •

edited

Loading

TheJulianJES commented Mar 16, 2026 •

edited

Loading

TheJulianJES commented Mar 18, 2026 •

edited

Loading