lundi 16 février 2015

Dump of arXiv metadata


Prompted by discussion in this post on Meta.MathOverflow.net I got interested in comparing usage of tags from MathOverflow to submissions in the respective disciplines of arXiv.


The question is, how to get such data?


There is arXiv API, but for bulk downloads of metadata they recommend Open Archives Initiative (OAI). Yet, as I see, it can query one article at a time, and one needs to know its id. So without knowing arXiv ids beforehand, it turns into a guessing game.


There are some plots in arXiv usage statistics, yet I don't see this exact data.


Also, one can get total submission to math from links in Mamthematics -> Article statistics by year, but it misses the splitting into subdisciplines.





Aucun commentaire:

Enregistrer un commentaire