> A data set based on all arXiv publications, pre-processed for NLP, including structured full-text and citation network

https://github.com/IllDepence/unarXive/tree/master

Reply to this note

Please Login to reply.

Discussion

Generated it from this dataset