摘要 |
The present invention mainly relates to a method determining passages and forming index. An application of this method is information retrieval. The method of the present invention is to form passages by merging each N consecutive paragraphs, Wherein N is a number greater than 1. Among the passages formed by the method, adjacent passages have N-1 paragraphs to overlap. The rules that people write articles are to express a topic or thought in a paragraph. But people generally can not delimit paragraph precisely. Several paragraphs (for example N paragraphs) are supposed to include a whole thought (or topic). The method of the present invention is that each N consecutive paragraphs in a document forms a passage. If N paragraphs include a topic (or thought), then each topic (or thought) included in the document should have passage that contains it. This is a method making use of people's writing rules to form passages. This method improves the retrieval precision.
|