<p>A video editing device (100) is provided with a registration section (91) for registering a key candidate consisting of the feature vector of an acoustic signal judged to be registered on the basis of a co-occurrence score in a management section (51) as a search key and a clip section (71) for obtaining integrated scores of respective blocks from the degrees of the similarity of the search keys of the respective blocks registered and clipping a block group exceeding the integrated threshold in the integrated scores as one video scene.</p>