posted on 2008-01-01, 00:00authored byRoger B Dannenberg, Masataka Goto
Music is full of structure, including sections, sequences of distinct musical textures, and the repetition of
phrases or entire sections. The analysis of music audio relies upon feature vectors that convey information
about music texture or pitch content. Texture generally refers to the average spectral shape and statistical
fluctuation, often reflecting the set of sounding instruments, e.g. strings, vocal, or drums. Pitch content
reflects melody and harmony, which is often independent of texture. Structure is found in several ways.
Segment boundaries can be detected by observing marked changes in locally averaged texture. Similar
sections of music can be detected by clustering segments with similar average textures. The repetition of a
sequence of music often marks a logical segment. Repeated phrases and hierarchical structures can be
discovered by finding similar sequences of feature vectors within a piece of music. Structure analysis can
be used to construct music summaries and to assist music browsing.