This visualisation allows one to view the past, present, and (estimated) future gender ratio of authors on academic publications listed on PubMed. The four buttons at the top allow subsetting of the data by journal, research discipline, the author's country of affiliation, and position in the author list (where 'overall' includes all authors).
The lefthand plot shows the estimated author gender ratio for each subset of the data (e.g. a journal, or a scientific discipline) in a certain year. The year can be controlled via the slider. The gender ratio was estimated by fitting a curve to the data, as described in the accompanying paper.
Clicking on a data point in the left plot will bring up a curve showing our estimate of the past, present, and future gender ratio, as well as the author gender ratio and its 95% confidence limits (shown by the error bars, which can be toggled on or off).
Hovering the mouse cursor over a data point shows the sample size in terms of the number of men and women authors, and the number of papers.
The data were collected by downloading all the ~27 million records on PubMed, and attempting to identify the gender of the authors by matching their given names against the genderize.io database. We assigned each of the journals on PubMed to a research discipline, using PubMed's own categorisation scheme where possible, and tried to identify the country in which each author was based from the address they provided. For clarity, the data accessible through this web app are limited to combinations for which we had a sufficiently large sample size in terms of the number of papers (at least 100), years (at least 5), and authors (at least 50 per year for 5 or more years).
The R scripts used to collect and analyse the PubMed data are archived ADD_URL_HERE.
This visualisation accompanies an article published in ADD_CITATION_HERE.