发明名称 |
SYSTEM AND METHOD OF ANNOTATING UTTERANCES BASED ON TAGS ASSIGNED BY UNMANAGED CROWDS |
摘要 |
A system and method of tagging utterances with Named Entity Recognition ("NER") labels using unmanaged crowds is provided. The system may generate various annotation jobs in which a user, among a crowd, is asked to tag which parts of an utterance, if any, relate to various entities associated with a domain. For a given domain that is associated with a number of entities that exceeds a threshold N value, multiple batches of jobs (each batch having jobs that have a limited number of entities for tagging) may be used to tag a given utterance from that domain. This reduces the cognitive load imposed on a user, and prevents the user from having to tag more than N entities. As such, a domain with a large number of entities may be tagged efficiently by crowd participants without overloading each crowd participant with too many entities to tag. |
申请公布号 |
WO2017044409(A1) |
申请公布日期 |
2017.03.16 |
申请号 |
WO2016US50373 |
申请日期 |
2016.09.06 |
申请人 |
VOICEBOX TECHNOLOGIES CORPORATION |
发明人 |
ROTHWELL, Spencer, John;BRAGA, Daniela;ELSHENAWY, Ahmad, Khamis;CARTER, Stephen, Steele |
分类号 |
G10L15/00 |
主分类号 |
G10L15/00 |
代理机构 |
|
代理人 |
|
主权项 |
|
地址 |
|