The hover is present:

Mixing them make perfect sense to me since they are in a single conversation between *two* people (groups/rooms, what do they have to do here?) that can use different clients; and so in this NIP-4-deprecation phase they can have a coherent and continuous conversation.
