## Block Segmentation

Figure 12.1 shows a generalized representation of block segmentation for PDF projection-based models. The input data is segmented in various ways. In the figure is divided into 1, 2, 3, and 4 equal segments. In general, the segments do not need to be of uniform size. What distiguishes block segmentation from other types is the fact that the segments are non-overlapping and no window function is used (rectangular weighting). It is also important that all samples of are included in the totality of the segments. This means that for uniform-sized segments, the input size must be an integer multiple of the segment size. For this reason, it is prudent to truncate all input data to a multiple of some highly divisible number, such as .

In Figure 12.1, each branch uses a different uniform segment size, and the same feature transformation is used for all segments in each branch. The uniform segmentation and uniform feature extraction is the most common because it lends itself to PDF estimation using HMM (Section 13.3). In general, the segmentation can be non-uniform and different feature transformations can be used within a branch. Furthermore, some branches could use the same segmentation, but different feature transformations.

Block segmentation is necessary for strict interpretation of the PDF projection theorem in the presence of segmented data. Let's talk about one branch in Figure 12.1. Let be the segment size, so , where is the number of segments. Then,

Let be the reference hypothesis, in which the samples of are iid, either exponential (3.9) or Gaussian (3.11). It follows that the segments are independent,

 (12.1)

and if each segment feature is computed from the corresponding data segment, , we have

 (12.2)

Then, PDF projection (2.2) is applied using (12.1) and (12.2). The joint feature PDF is obtained from the HMM forward procedure (Section 13.3). Thus, block segmentation allows the exact application of PDF projection (2.2) through the strict independence of the segments under . There are, however, significant disadvantages of block segmentation. It is not guaranteed that the segment boundaries fall at appropriate times. The occurrences of events in the data may get split into several segments, and extreme Gibbs-effect will be seen in the frequency domain. The result can be poor and erratic feature extraction. For these reasons, it may be prefereable to use overlapped/windowed segments.

Baggenstoss 2017-05-19