发明名称 Method for Extracting Useful Content from Setup Files of Mobile Applications
摘要 The presented method is a tool based on a vertical search engine that allows automatic extraction of useful content from setup files of mobile applications for further indexation, computerised data processing and storage of useful content of mobile applications on a server for subsequent searches.
申请公布号 US2016239510(A1) 申请公布日期 2016.08.18
申请号 US201615138965 申请日期 2016.04.26
申请人 Closed Joint-Stock Company "RIWW" 发明人 NAGORNY Alexei S.
分类号 G06F17/30 主分类号 G06F17/30
代理机构 代理人
主权项 1. A method for extraction of useful content from setup files of b e applications for further computerised data processing, the method comprising: downloading from the Internet to a server an application setup file in a form of an archive; selecting an archiver for said file; if the archiver has been successfully selected, decompressing the setup file into a file directory; analysing the file directory and comprising a list of files located therein; selecting a file from the list of files for further analysis; selecting file reading software to read the file by searching through known formats; if the file reading software has been successfully selected, analysing the selected file via primary content search; compiling a list of primary content internal location addresses in a form of a row set; performing analysis of a next file as long as there are files in the directory; analysing the text content of the list of primary content internal location addresses and dividing the text of each row into a set of characters identifying the storage method for the relevant content unit, a set of characters identifying the document this content unit pertains to, and a set of characters identifying the type of this content unit; dividing the rows of content unit internal location addresses by storage method into utility content and useful content; removing of utility content; selecting row sets in a remaining list with content unit internal location addresses that have completely matching groups of characters reflecting the content storage method; statistically filtrating selected groups; analysing the text content of the address list rows by the set of document identifying characters and selecting the address groups of content units pertaining to each document of application useful content; extracting useful content pertaining to each document from the application into a separate file, thus generating the application documents; indexing the obtained document files of the application, thus generating a description of its content; storing the application name, link, and description in the database; downloading the setup file of a new application and performing of all the above mentioned sequences; performing computerised processing of the database; performing created indexed array of the database on a server; and using results for users' search queries coming in via the Internet.
地址 Moscow RU