the signatures must be fully retained, but the content can be compressed down to a series of diffs in the actual database storage format, if the concern is about efficiency of data storage... a compression algorithm will also automatically do a lot of this, but is more expensive than unpacking diffs