Prompted by discussion in this post on Meta.MathOverflow.net I got interested in comparing usage of tags from MathOverflow to submissions in the respective disciplines of arXiv. (Vide a similar idea of language popularity, GitHub vs StackOverflow (or one from 20015).) Moreover, as people often use their real names on MO, it may be interesting to check the overlap of mathematicians.
The question is, how to get such data?
There is arXiv API, but for bulk downloads of metadata they recommend Open Archives Initiative (OAI). Yet, as I see, it can query one article at a time, and one needs to know its id. So without knowing arXiv ids beforehand, it turns into a guessing game.
There are some plots in arXiv usage statistics, yet I don't see this exact data.
Also, one can get total submission to math
from links in Mathematics -> Article statistics by year, but it misses the splitting into subdisciplines.
Aucun commentaire:
Enregistrer un commentaire