Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Interactive segment extraction in computer-human interactive learning

A machine learning and segment technology, applied in the field of interactive segment extraction in human-computer interaction learning, which can solve problems such as decreased accuracy and no automatic understanding of information.

Active Publication Date: 2016-03-09
MICROSOFT TECH LICENSING LLC
View PDF0 Cites 9 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Although Internet search engines often do a reasonable job of targeting title queries and documents, accuracy quickly drops to low levels because the information is not automatically understood by the search engine

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Interactive segment extraction in computer-human interactive learning
  • Interactive segment extraction in computer-human interactive learning
  • Interactive segment extraction in computer-human interactive learning

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0031] The approach described herein creates several engineering and scientific challenges, which are discussed below. Said challenges include:

[0032] a) Active annotation exploration

[0033] b) Automatic regularization and cold start

[0034] c) scaling with the number of entries and the number of classifiers

[0035] d) Active characterization

[0036] e) Segmentation and schematization

[0037] In a first aspect, a computer readable medium embodying computer usable instructions is provided for implementing a method of segment extraction by a user for a machine learning system. A collection of data entries is stored, where each data entry includes a plurality of tokens. A segment extractor is provided, the segment extractor trainable to recognize a segment in a data entry as a concept instance, where the segment includes a set of tokens. On the user interface, a concept hierarchy representing the concept is presented, wherein the concept hierarchy includes sub-conce...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

A collection of data that is extremely large can be difficult to search and / or analyze. Relevance may be dramatically improved by automatically classifying queries and web pages in useful categories, and using these classification scores as relevance features. A thorough approach may require building a large number of classifiers, corresponding to the various types of information, activities, and products. Creation of classifiers and schematizers is provided on large data sets. Exercising the classifiers and schematizers on hundreds of millions of items may expose value that is inherent to the data by adding usable meta-data. Some aspects include active labeling exploration, automatic regularization and cold start, scaling with the number of items and the number of classifiers, active featuring, and segmentation and schematization.

Description

Background technique [0001] Collections of very large data can be difficult to search and / or analyze. For example, in the case of the web, a large portion of the data is unstructured and the values ​​are locked within the data itself. It is not enough to store the web page of the service provider. In order for the information to be useful, the information needs to be understood. A string of numbers that could be a model number, bank account, or phone number depending on the context. For example, in the context of a snowboard product, the string "length: 170, 175, 180cm" refers to 3 different snowboard lengths, not 1700 kilometers of skis. Interpreting data incorrectly can yield useless information. [0002] As an example, if a user enters the two words "mtor" and "stock" into an Internet search engine, and the results consist primarily of web pages related to the drug mTor, the search engine fails to recognize the search as a stock quote query. As another example, if a us...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06N99/00G06F17/27G06N20/00
CPCG06N20/00G06F16/951G06F40/242G06F40/30H04L1/0079H04L1/0072G06N7/01G06F3/0482
Inventor P·Y·西马德D·M·奇克林D·G·格朗吉耶D·X·查理L·布特欧S·A·阿默诗A·拉克希米拉坦C·G·J·苏亚雷斯
Owner MICROSOFT TECH LICENSING LLC
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products