Methods for generating natural language processing systems

a natural language processing and model technology, applied in the field of processing data, can solve the problems of inability to program computers to process human-readable language, inability to accurately and accurately predict the process of human-readable language, and difficulty in generating the model, etc., and achieve the effect of low degree of certainty

Active Publication Date: 2016-06-09
100 CO GLOBAL HLDG LLC
View PDF24 Cites 281 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Benefits of technology

[0018]In some embodiments, another method for updating a natural language model is presented. This method may include: utilizing the natural language model to identify topical content of untested data and to classify said untested data into at least two topical nodes of a hierarchical data structure according to the identified topical content of the untrained data, the hierarchical data structure comprising the at least two topical nodes, wherein the at least two topical nodes represent partitions organized by two or more topical themes among the topical content of the untested data within which the untested data is to be subdivided into; determining that the natural language model classifies at least a subset of the untested data into the at least two topical nodes with a low degree of certainty; and modifying the natural language model with updated data, the updated data comprising a subset of the untested data that the natural language model has classified with a low degree of certainty.
[0019]In some embodiments, a non-transitory computer readable medium is presented comprising instructions that, when executed by a processor, cause the processor to perform operations comprising: ingesting training data representative of documents to be analyzed by a natural...

Problems solved by technology

However, programming computers to process human-readable language has proven to be far more difficult than imagined, particularly as languages continue to change and evolve, and the meaning of words and phrases are more ambiguous and nuanced than assumed.
A ...

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Methods for generating natural language processing systems
  • Methods for generating natural language processing systems
  • Methods for generating natural language processing systems

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0039]Example methods, apparatuses, and systems (e.g., machines) are presented for generating natural language models.

[0040]The modes of human communications brought upon by digital technologies have created a deluge of information that can be difficult for human readers to handle alone. Companies and research groups may want to determine trends in the human communications to determine what people generally care about for any particular topic, whether it be what car features are being most expressed on Twitter®, what political topics are being most expressed on Facebook®, what people are saying about the customer's latest product in their customer feedback page, what are the key categories written about in a large body of legal documents, and so forth. In some cases, the companies or the research groups may want to determine what are the general topics being talked about, to begin with. It may be desirable for companies to aggregate and then synthesize the thousands or even millions...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

Methods are presented for generating a natural language model. The method may comprise: ingesting training data representative of documents to be analyzed by the natural language model, generating a hierarchical data structure comprising at least two topical nodes within which the training data is to be subdivided into by the natural language model, selecting a plurality of documents among the training data to be annotated, generating an annotation prompt for each document configured to elicit an annotation about said document indicating which node among the at least two topical nodes said document is to be classified into, receiving the annotation based on the annotation prompt; and generating the natural language model using an adaptive machine learning process configured to determine patterns among the annotations for how the documents in the training data are to be subdivided according to the at least two topical nodes of the hierarchical data structure.

Description

CROSS REFERENCES TO RELATED APPLICATIONS[0001]This application claims the benefits of U.S. Provisional Application 62 / 089,736, filed Dec. 9, 2014, and titled, “METHODS AND SYSTEMS FOR ANNOTATING NATURAL LANGUAGE PROCESSING,” U.S. Provisional Application 62 / 089,742, filed Dec. 9, 2014, and titled, “METHODS AND SYSTEMS FOR IMPROVING MACHINE PERFORMANCE IN NATURAL LANGUAGE PROCESSING,” U.S. Provisional Application 62 / 089,745, filed Dec. 9, 2014, and titled, “METHODS AND SYSTEMS FOR IMPROVING FUNCTIONALITY IN NATURAL LANGUAGE PROCESSING,” and U.S. Provisional Application 62 / 089,747, filed Dec. 9, 2014, and titled, “METHODS AND SYSTEMS FOR SUPPORTING NATURAL LANGUAGE PROCESSING,” the disclosures of which are incorporated herein by reference in their entireties and for all purposes.[0002]This application is also related to US non provisional applications (Attorney Docket No. 1402805.00007_IDB007), titled “ARCHITECTURES FOR NATURAL LANGUAGE PROCESSING,” (Attorney Docket No. 1402805.00012_I...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F17/24G06F17/22G06F17/28
CPCG06F17/241G06F17/2241G06F17/28G06Q50/01G06F16/35G06F16/93G06F16/288G06F16/367G06F16/3329G06F40/169G06F40/30G06F16/243G06F16/285G06F16/951G06F16/24532G06F40/40G06F40/42G06F40/137G06F40/221G06N20/00G06F3/0482
Inventor MUNRO, ROBERT J.ERLE, SCHUYLER D.WALKER, CHRISTOPHERLUGER, SARAH K.BRENIER, JASONKING, GARY C.TEPPER, PAUL A.MECHANIC, ROSSGILCHRIST-SCOTT, ANDREWLONG, JESSICA D.ROBINSON, JAMES B.CALLAHAN, BRENDAN D.CASBON, MICHELLESARIN, UJJWALNAIR, ANEESHBASAVARAJ, VEENASAXENA, TRIPTINUNEZ, EDGARHINRICHS, MARTHA G.MOST, HALEYSCHNOEBELEN, TYLER J.
Owner 100 CO GLOBAL HLDG LLC
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products