Page 66 - Technology and Innovation Journal - 19-1
P. 66

404                           DEMPWOLF & SHNEIDERMAN



      new insights and frame better hypotheses. Event-  product idea was first conceived. Limiting our anal-
      Flow provides tools for visually simplifying event  ysis to drugs and medical devices, we take the date of
      sequences to reveal common and rare patterns (8,9).  final FDA approval as the end date and a reasonable
        Theory Formation (Modeling): A key goal for   proxy for product launch date. Neither the dates that
      researchers is to develop and test theories so as to   commercial ideas were originally conceived nor the
      guide future activities. The well-established linear   actual product launch dates are reliably recorded or
      model of innovation (basic research leads to applied   made publicly available, thus the need for proxies.
      research, then product development, culminating in     The datasets available for modeling STI processes
      commercialization) has its followers, as well as many  (see Table 3) have several current limitations, and
      critics. Comparisons with alternative models such   much of the work yet to be done under this study
      as the ABC principle (applied and basic combined)   involves cleaning, matching, transforming, and link-
      could advance understanding of what leads to more   ing existing datasets. We present two preliminary
      frequently successful outcomes (10). It is fairly com-  examples that demonstrate some of the event ana-
      mon practice in articles and presentations to show   lytics capabilities of EventFlow (www.cs.umd.edu/
      the linear model because of its simplicity and then   hcil/ eventflow) and suggest the methods and kinds
      immediately state that, in practice, innovation rarely   of final results we might expect when all of the data
      follows the linear model. The popular understanding   cleaning and matching is completed.
      of innovation might be improved by documenting the     The first example models and analyzes the trajec-
      prevalence of the linear model and its alternatives.  tories starting with clinical trials and ending with last
        Hypothesis Testing: Event analytics can be as   FDA approval for 2,402 medical devices. Clinical
      simple as seeing if event type A occurs more fre-  trial success is typically a necessary input for final
      quently before or after event type B. For example, do  FDA approval. In certain cases, successful results in
      patents precede or follow founding of companies?   early-stage trials may be sufficient for provisional,
      Another simple question is: How soon after found-  temporary approval, allowing the drug or device
      ing a company do companies release a product? A   to be deployed prior to completion of the full set of
      refined version of this question might examine the   clinical trials. The preliminary results of this second
      distribution of times between founding a company   analysis demonstrate EventFlow’s ability to simplify
      and releasing a product. There are more sophisticated   the visualization of the dataset in ways that suggest
      questions that can also be posed in event analytic  overarching patterns in the data and allow researchers
      tools, such as: Do companies with three or more  to pose clear, simple questions for further investiga-
      patents before product launches have more successful   tion. In this case, the visualization shows two distinct
      outcomes than companies with fewer patents?  groups in the data: one in which the FDA approval is
                                                  received after clinical trials are completed and one in
      MODELING AND MEASURING INNOVATION           which FDA approval is received during the clinical
      TRAJECTORIES: DATA AND EXAMPLES             trials (see Figures 3 and 4). The visualizations suggest
        The following examples are based on a dataset   several additional research questions, demonstrating
      comprising 34,331 records, each representing a spe-  EventFlow’s usefulness as a tool for data exploration.
      cific drug or medical device. Each record contains     The second example analyzes drug innovation
      the events—research, patents, clinical trials and FDA   trajectories from first patent to last FDA approval for
      approvals—associated with that product. In total, the   884 drugs, resulting in mean, median, and standard
      model includes 85,690 events. The list of event types   deviation metrics for these trajectories (see Figures
      and the count of each type is shown at the bottom of   5 and 6).
      the left EventFlow panel shown in Figure 2.
        As a practical matter, answering the question of   Data Gathering for Innovation Trajectories
      how long innovation takes requires identifying start     We use the EventFlow software to model innova-
      and end points. In our first example, we take the   tion trajectories in drugs and medical devices from
      date of first patent application as the starting point   multiple datasets (Table 3):
      and a reasonable proxy for the date that the initial
   61   62   63   64   65   66   67   68   69   70   71