If I remember right, there's also a command line script in neo4j-admin or something that's for even more data.

Reply to this note

Please Login to reply.

Discussion

Yes you’re right. I’ve never used anything other than simple cypher merge statements or the load csv method.

Looks like the neo4j-admin import command would be better for massive data loads, like initializing a neo4j db using a preexisting relay with lots of data.

And a fourth option: APOC procedures to load from “diverse sources like json, xml, and other databases.” I’ve used graph data science for PageRank but don’t think I’ve ever used APOC. Probably worth looking into.

The neo4j admin import is the most performant in my experience

Do you have any experience with LMDB? Most nostr relays use LMDB (either exclusively, like strfry, or as one of several options like khatru) so a set of performant ETL pipeline tools from LMDB to neo4j would be valuable to the nostr ecosystem

Unfortunately no experience whatsoever

Wanna learn? 😄💜

Yes, but my time is limited, so I am not sure how much I can contribute. I have been lurking for a while to see what you have been up to. Definitely seems interesting!

Feel free to chime in with thoughts and ideas! Gonna take lots of work to make personalized nostr knowledge graphs a thing; the more relevant talent and experience we can apply to the task, the better 🧠

Its also possible to write your own custom APOC functions and procedures in Java to expand upon the already existing ones...

Do you have any experience using graph data science pregel to write a custom centrality algo? I’m thinking of doing that for GrapeRank at some point