If I remember right, there's also a command line script in neo4j-admin or something that's for even more data.
Discussion
Yes you’re right. I’ve never used anything other than simple cypher merge statements or the load csv method.
Looks like the neo4j-admin import command would be better for massive data loads, like initializing a neo4j db using a preexisting relay with lots of data.
And a fourth option: APOC procedures to load from “diverse sources like json, xml, and other databases.” I’ve used graph data science for PageRank but don’t think I’ve ever used APOC. Probably worth looking into.
The neo4j admin import is the most performant in my experience
Do you have any experience with LMDB? Most nostr relays use LMDB (either exclusively, like strfry, or as one of several options like khatru) so a set of performant ETL pipeline tools from LMDB to neo4j would be valuable to the nostr ecosystem
Unfortunately no experience whatsoever
Wanna learn? 😄💜
Yes, but my time is limited, so I am not sure how much I can contribute. I have been lurking for a while to see what you have been up to. Definitely seems interesting!
Feel free to chime in with thoughts and ideas! Gonna take lots of work to make personalized nostr knowledge graphs a thing; the more relevant talent and experience we can apply to the task, the better 🧠