摘要 |
<p>Software clones in a large software package, e.g. DMS operating software are identified from the physical layout of the source code and/or from the occurrence frequency and distribution of keywords in the code. These features are used to define multicomponent vectors which are input to a neural network. The output of the network comprises a histogram analysis of the various code sequences (potential clones) in the source code.</p> |