I have a neo4j instance that maintains a graph of about 200k nodes of users and their follow and mute connections. It can calculate personalized PageRank in about 15 seconds and is (as you would expect) better than any non graph method at generating a list of followers (or muters).

I’m thinking my next step is to pair neo4j with strfry and create an ETL pipeline from strfry’s LMDB to neo4j. Then neo4j can calculate reputation scores in real time, which we use to modify the strfry filters and keep out spam.

It seems to me that an ETL pipeline starting from LMDB should be pretty performant, in theory, although I haven’t run across any LMDB to neo4j tools. Relational db to neo4j tools are easier to find, on the other hand.

Any thoughts?

Reply to this note

Please Login to reply.

Discussion

No replies yet.