This is a misconception. Images and videos "on Nostr" are simply uploaded to normal HTTP servers, and native Nostr notes only contain links to them.
What a client does or does not render inline (so it looks like it's part of the Nostr note) depends entirely on the client.