发明名称 Video-voice preparation of electronic tax return
摘要 Methods, systems and computer program products for processing video of tax documents and associated verbal input provided by a user and populating at least a portion of an electronic tax return with processing results. A video/voice processor associated with a tax return preparation application executed by a computing apparatus such as mobile communication device receives a video of a tax document and voice data. The document type is determined using video and/or voice data. Voice to text conversion can be used to determine what a user said about the document to determine the document type. Tax data determined from the video is used to populate a field of the electronic tax return. A front facing camera may be used to take a video of a tax document while a rear facing camera is used to detect a facial expression, which may result in certain dialogue with the user.
申请公布号 US9406089(B2) 申请公布日期 2016.08.02
申请号 US201313874382 申请日期 2013.04.30
申请人 INTUIT INC. 发明人 Mori Kenichi;Marr Justin C.;Harriss Catherine M. H.
分类号 G06Q40/00 主分类号 G06Q40/00
代理机构 Vista IP Law Group LLP 代理人 Vista IP Law Group LLP ;Lueck Gary D.
主权项 1. A computer-implemented method for populating an electronic tax return, the computer-implemented method being executed by a mobile communication device comprising a data store comprising a tax return preparation application operable to prepare an electronic tax return, a first camera that is a front facing camera, a second camera that is a rear facing camera, a microphone and a video/voice processor, each of the data store, the first camera, the second camera and the microphone being in communication with the video/voice processor, the method comprising: the mobile communication device, by the first camera, recording a video of a tax document, the recorded video comprising a plurality of video frames and voice data generated based on a user of the mobile communication device speaking into the microphone during recording of the video, the voice data comprising a user-spoken description of how the tax document is relevant to the electronic tax return; converting, by the video/voice processor of the mobile communication device, the voice data from a voice format into a text format; analyzing, by the video/voice processor, at least one video frame of the video and the voice data in the text format to determine a document type and tax data contained within the at least one video frame; identifying, by the tax return preparation application executed by a processor of the mobile communication device, a field of the electronic tax return to be populated with determined tax data of the determined document type; populating, by the tax return preparation application, the field of the electronic tax return with the determined tax data to prepare at least a portion of the electronic tax return without the user typing tax data of the tax document that was captured in the video into the field of the electronic tax return; detecting, by the second camera, a facial expression or gesture of the user during preparation of the electronic tax return; determining, by the video/voice processor, a first response based at least in part on the detected facial expression or gesture; and presenting, by the tax return preparation application, the first response to the user during preparation of the electronic tax return.
地址 Mountain View CA US