主权项 |
1. A system for identifying anchors for fields using optical character recognition data, the system comprising:
one or more processors; and a non-transitory computer readable medium storing a plurality of instructions, which when executed, cause the one or more processors to:
identify a first collection of characters comprising a first set of characters at a first position relative to a first field in a first document and a second set of characters at a second position relative to the first field in the first document, wherein the first set of characters is associated with a first word and the second set of characters is associated with a second word;create a first anchor in the first document based on the first collection of characters, wherein the first anchor is at a third position relative to the first field in the first document, and wherein the first anchor is associated with a second field in the first document;identify a second collection of characters comprising a third set of characters at a fourth position relative to a third field in a second document and a fourth set of characters at a fifth position relative to the third field in the second document, wherein the third set of characters is associated with a third word and the fourth set of characters is associated with a fourth word;determine a location of a second anchor in the second document by calculating a vector based on the first, second, third and fourth sets of characters; andidentify a fourth field in the second document that corresponds to the second field in the first document based on the location of the second anchor in the second document. |