I think a relay based model may work better.
MOD-REQ
This skips the overhead of signatures for each moderated event, and moderation may not be instant (so servers can wait a bit before responding).
And you can for example blur media or hide events while within the timeout.
Types of flags are specified in NIP-11.