I think these are two separate things; the e vs a tag is not really related to kind:1s replying to non-kind1s, which is the context issue.
e vs a is mainly about the version an event is tagging, which I think it's blown out of proportion honestly, but the NIP already recommends e tagging NIP-33s too 😅
