摘要 |
There is provided an information processing apparatus including a data storage unit storing at least two rules for extracting information from a document written using a markup language, a selecting unit selecting, in accordance with an appearance frequency of a specific character string in at least one part of an input document written using the markup language, a rule to be applied to the part from the at least two rules stored in the data storage unit, and an extracting unit extracting information from the part using the rule selected by the selecting unit.
|