主权项 |
1. A system for harvesting electronic content by custodian, comprising:
a processor to execute the following:
a collaboration environment to maintain content associated with user names for one or more custodians;a mapper to receive from a client user a custodian list comprising custodian names of at least a portion of the custodians, each custodian name comprising at least one of a full name, legal name, partial name, and nickname, to obtain an access report comprising the user names and associated unique identifiers for the custodians with access to the content within the collaboration environment, to compare the custodian names to the user names in the access report, to determine a similarity between at least one of the custodian names in the custodian list and one or more of the user names in the access report, and to identify for at least one of the custodians one or more of the user names that partially match the custodian name for that custodian based on the similarity as user name options;a comparison module to select the user name option most similar to the custodian name for the at least one custodian and to compare the similarity of the most similar user name option to a confidence threshold;an option module to, upon the confidence threshold being satisfied by the similarity of the most similar user name option, provide the most similar user name option as a user name suggestion for the at least one custodian to the client user;a confirmation module to receive a confirmation of the user name suggestion being the user name for the at least one custodian from the client user;a harvester to identify the content associated with the at least one custodian using the confirmed user name and to make a list of the identified content;an exporting module to export the identified content to a storage external to the collaboration environment;an application module to apply the list of the identified content to the external storage and to determine whether all of the identified content has been exported to the external storage; andan identifying module to, upon the determination that at least some of the identified content has not been exported, identify and export the unexported content to the external storage. |