
<mods xmlns="http://www.loc.gov/mods/v3" xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xsi:schemaLocation="http://www.loc.gov/mods/v3 http://www.loc.gov/standards/mods/v3/mods-3-4.xsd">
    
    <titleInfo>
        <title>Quantitative Analysis of a Common Audio Similarity Measure</title>
    </titleInfo>
    <name type="personal">
        <namePart type="family">Jensen</namePart>
        <namePart type="given">Jesper Hojvang</namePart>
        <role>
            <roleTerm type="text">author</roleTerm>
        </role>
    </name>
    <name type="personal">
        <namePart type="family">Christensen</namePart>
        <namePart type="given">Mads Gaesboll</namePart>
        <role>
            <roleTerm type="text">author</roleTerm>
        </role>
    </name>
    <name type="personal" ID="de171">
        <namePart type="family">Ellis</namePart>
        <namePart type="given">Daniel P. W.</namePart>
        <role>
            <roleTerm type="text">author</roleTerm>
        </role>
        <affiliation>Columbia University. Electrical Engineering</affiliation>
    </name>
    <name type="personal">
        <namePart type="family">Jensen</namePart>
        <namePart type="given">Soren Holdt</namePart>
        <role>
            <roleTerm type="text">author</roleTerm>
        </role>
    </name>
    <name type="corporate">
        <namePart>Columbia University. Electrical Engineering</namePart>
        <role>
            <roleTerm type="text">originator</roleTerm>
        </role>
    </name>
    <typeOfResource>text</typeOfResource>
    <genre>Articles</genre>
    
    <originInfo>
        <dateIssued encoding="w3cdtf" keyDate="yes">2009</dateIssued>
    </originInfo>
    
    <language>
        <languageTerm type="text">English</languageTerm>
    </language>
    <abstract>For music information retrieval tasks, a nearest neighbor classifier using the Kullback-Leibler divergence between Gaussian mixture models of songs&apos; melfrequency cepstral coefficients is commonly used to match songs by timbre. In this paper, we analyze this distance measure analytically and experimentally by the use of synthesized MIDI files, and we find that it is highly sensitive to different instrument realizations. Despite the lack of theoretical foundation, it handles the multipitch case quite well when all pitches originate from the same instrument, but it has some weaknesses when different instruments play simultaneously. As a proof of concept, we demonstrate that a source separation frontend can improve performance. Furthermore, we have evaluated the robustness to changes in key, sample rate, and bitrate.</abstract>
    <subject>
        <topic>Electrical engineering</topic>
    </subject>
    <subject>
        <topic>Acoustics</topic>
    </subject>
    <relatedItem type="host">
        <titleInfo>
            <title>IEEE Transactions on Audio, Speech, and Language Processing</title>
        </titleInfo>
        <part>
            <detail type="volume">
                <number>17</number>
            </detail>
            <detail type="issue">
                <number>4</number>
            </detail>
            <extent unit="page">
                <start>693</start>
                <end>703</end>
            </extent>
            <date>2009-05</date>
        </part>
        <identifier type="doi">http://dx.doi.org/10.1109/TASL.2008.2012314</identifier>
    </relatedItem>
    <identifier type="hdl">http://hdl.handle.net/10022/AC:P:11816</identifier>
    
    <location>
        <physicalLocation authority="marcorg">NNC</physicalLocation>
    </location>
    
    <recordInfo>
        <recordContentSource authority="marcorg">NNC</recordContentSource>
        <recordCreationDate encoding="w3cdtf">2011-11-18 12:33:10 -0500</recordCreationDate>
        <recordChangeDate encoding="w3cdtf">2012-12-18 00:48:51 -0500</recordChangeDate>
        <recordIdentifier>5799</recordIdentifier>
        <languageOfCataloging>
            <languageTerm authority="iso639-2b">eng</languageTerm>
        </languageOfCataloging>
    </recordInfo>
    
</mods>
