Time-to-Publication Data#

Publication times for 244 clinical trials funded by the National Heart, Lung, and Blood Institute.

  • posres: Did the trial produce a positive (significant) result? 1=Yes, 0=No.

  • multi: Did the trial involve multiple centers? 1=Yes, 0=No.

  • clinend: Did the trial focus on a clinical endpoint? 1=Yes, 0=No.

  • mech: Funding mechanism within National Institute of Health: a qualitative variable.

  • sampsize: Sample size for the trial.

  • budget: Budget of the trial, in millions of dollars.

  • impact: Impact of the trial; this is related to the number of publications.

  • time: Time to publication, in months.

  • status: Whether or not the trial was published at time: 1=Published, 0=Not yet published.

Source#

  • Gordon, Taddei-Peters, Mascette, Antman, Kaufmann, and Lauer. Publication of trials funded by the National Heart, Lung, and Blood Institute. New England Journal of Medicine, 369(20):1926-1934, 2013.

from ISLP import load_data
Publication = load_data('Publication')
Publication.columns
Index(['posres', 'multi', 'clinend', 'mech', 'sampsize', 'budget', 'impact',
       'time', 'status'],
      dtype='object')
Publication.shape
(244, 9)
Publication.columns
Index(['posres', 'multi', 'clinend', 'mech', 'sampsize', 'budget', 'impact',
       'time', 'status'],
      dtype='object')
Publication.describe().iloc[:,:4]
posres multi clinend sampsize
count 244.000000 244.000000 244.000000 244.000000
mean 0.401639 0.151639 0.184426 2000.573770
std 0.491237 0.359408 0.388629 6206.057536
min 0.000000 0.000000 0.000000 18.000000
25% 0.000000 0.000000 0.000000 150.000000
50% 0.000000 0.000000 0.000000 300.000000
75% 1.000000 0.000000 0.000000 810.250000
max 1.000000 1.000000 1.000000 48835.000000