Page 11 - Human Environment Interface (4)

P. 11

Brain Reading for Human-Machine Interfaces

ML Analysis. All ML evaluations have been performed using and 600–900 ms) and subject. Corrections were applied where
the open source signal processing and classification environment necessary. Classification performance after optimizing the classifier
pySPACE [66]. Data processing was as follows: Windowing and were analyzed using repeated measures ANOVA with subject as
preprocessing were performed directly on the raw data from the within-subjects factor. Where necessary, the Greenhouse–Geisser
recording device. In order to avoid that preprocessing artifacts correction was applied and the corrected p-value is reported. For
such as, e.g., filter border artifacts, influence classification multiple comparisons, the Bonferroni correction was applied.
performance, we performed the complete preprocessing (including
decimation and filtering) on a larger window between {200 and In a further analysis we investigated the possibility to improve
1400 ms relative to the stimulus onset. We chose the following classification performance by the combination of information from
preprocessing based on the rationale issued above (see Sec. two windows. We combined the middle time window (300 to
‘‘Average ERP Analysis’’ and [64]): The data were baseline- 600 ms) with both other time windows (early: 0 to 300 ms and
corrected (with 100 ms window prior to stimulus onset), decimated late: 600 to 900 ms time window) separately.
to 25 Hz and subsequently lowpass filtered with a cut-off
frequency of 4 Hz. To determine classification performance that can be achieved
under optimized conditions we finally performed a final analysis
As in the ERP analysis, run numbers 2, 3 and 4 of both sessions with the goal to get a better estimate of the applicability of our
were used for training and testing. In contrast to the average ERP approach of classifier transfer between the classes standard and
analysis described above we included the early time window of 0– missed target. The processing window was chosen based on the
300 ms in the ML analysis. This was done to control for the fact results of the systematic ML analysis explained above. We
that early time windows may still contribute to the classification of performed a classifier optimization of the SVM parameter
the different classes (standards, targets, missed targets) even though we complexity using a 5-fold cross validation in combination with a
hypothesized that main differences are caused by the P300 effect pattern search algorithm [69] to evaluate the overall performance
(see Sec. ‘‘ Relevant Averaged ERP Activity’’). in the application with an adjusted classifier.

It is important to note here that several dependencies have to be In this subsequent investigation we used an optimized SVM on
kept in mind when evaluating the results: First, performance a chosen time window. Further, we computed the balanced
depends on window size since a larger window contains more accuracy (BA) as a performance measure for the chosen time
features and thus a higher dimensionality of the signal. Second, the window. The balanced accuracy [70] is the arithmetic mean of
classifier parameters depend on the underlying data (and true positive rate (TPR) and true negative rate (TNR) and
dimensionality). However, the purpose of this investigation was calculated accordingly
to compare different windows (and sizes) concerning their quality
for classification, so we assessed the results always with respect to BA~ 1 ðTPRzTNRÞ: ð1Þ
window size and starting point of the window and performed 2
statistical analysis only on windows of equal length.
Both performance measures used (AUC and BA) are insensitive
Furthermore, the parameters of the classifier were adjusted to to unbalanced or even changing ratios of the two classes (positive
an unspecified value to omit data-dependent effects: in the entire class P and negative class N), which is most important in the
analysis, we used a support vector machine (SVM) as implemented application where we have an oddball-like situation with frequent
in LIBSVM [67] (SVC-C with a linear kernel) with a fixed and infrequent examples. It holds for both metrics that a value of
complexity of 100 simulating a hard margin. Hence, we cut 0:5 means guessing and 1 means perfect classification.
different windows by varying starting point (0 ms-700 ms) and
window size (200 ms-800 ms) in steps of 100 ms. Data used for Results
training and testing were different, as outlined above: We trained Behavior. In total 724 omission errors (575:2+82:23)
on standards and targets of one experimental run and tested missed
targets versus targets of another run within one session. All possible occurred, thus 724 missed targets were observed and 2876 targets
combinations of the above mentioned runs within one session were stimuli were found with correct responses. No commission error
tested. Classifier features were the preprocessed time-channel (i.e., responses on standards stimuli) could be found.
values, i.e., the amplitudes.
Figure 7 shows the median response time for each subject across
The corresponding classification performance was computed two sessions. Based on the buzzer press event, responses occurred
using the area under curve (AUC) [68] which is an indicator of 837 ms after the target stimuli (mean of subject’s medians). The
general separability of the two classes in the data. AUC is the area median of minimal response time was 597 ms and the median of
under the receiver operating characteristics curve. This curve maximal response time was 1783 ms. The difference between the
maps the different true positive rates (TPR) and false positive rates minimal and maximal response time was between 686 ms and
(1-TNR) obtained when the decision boundary is varied from 1657 ms (median: 1131 ms). EMG onsets began even earlier in
{? to ?. The AUC is then computed as the integral of the time (mean of subject’s medians: 551 ms). The median of minimal
resulting function. In this way, we investigated the linear response time was 336 ms and the median of maximal response
separability of the data essentially independent of the applied time was 1466:5 ms. The difference between the minimal and
classifier. maximal response time was between 563 ms and 1621 ms
(median: 1130:5 ms). No difference exists between median
For statistical inference, we chose three time windows from the difference in response time based on the buzzer event (median:
aforementioned temporal segmentation that match the later time 1131 ms) and median difference of response time based on the
windows which had been chosen for ERP analysis (300–600 ms, EMG onset (median: 1130:5 ms).
and 600–900 ms see Sec. ‘‘Average ERP Analysis’’) and the early
time window of 0–300 ms. This procedure relates the results of the Relevant Averaged ERP Activity. The grand average over
classifier performance-based approach to the results of the ERP all subjects of the standard, target and missed target ERP pattern
analysis. Classification performances for the different window sizes in the centro-parietal electrode (Pz) is depicted in Fig. 4.
were statistically analyzed using repeated measures ANOVA with Significant differences between standards and targets (i.e., P300
the within-subjects factors time window (0–300 ms, 300–600 ms, effect) were observed [F (2,50)~65:27,pv0:001, pairwise com-

PLOS ONE | www.plosone.org 9 December 2013 | Volume 8 | Issue 12 | e81732

6 7 8 9 10 11 12 13 14 15 16