Patents
Literature
Patsnap Copilot is an intelligent assistant for R&D personnel, combined with Patent DNA, to facilitate innovative research.
Patsnap Copilot

811results about "Semi-structured data indexing" patented technology

Visual and interactive wrapper generation, automated information extraction from web pages, and translation into xml

A method and a system for information extraction from Web pages formatted with markup languages such as HTML [8]. A method and system for interactively and visually describing information patterns of interest based on visualized sample Web pages [5,6,16-29]. A method and data structure for representing and storing these patterns [1]. A method and system for extracting information corresponding to a set of previously defined patterns from Web pages [2], and a method for transforming the extracted data into XML is described. Each pattern is defined via the (interactive) specification of one or more filters. Two or more filters for the same pattern contribute disjunctively to the pattern definition [3], that is, an actual pattern describes the set of all targets specified by any of its filters. A method and for extracting relevant elements from Web pages by interpreting and executing a previously defined wrapper program of the above form on an input Web page [9-14] and producing as output the extracted elements represented in a suitable data structure. A method and system for automatically translating said output into XML format by exploiting the hierarchical structure of the patterns and by using pattern names as XML tags is described.
Owner:LIXTO SOFTWARE

Keyword based evaluation expert intelligent search and recommendation method

The invention discloses a keyword based evaluation expert intelligent search and recommendation method. The keyword based evaluation expert intelligent search and recommendation method specifically comprises step 1, segmenting an expert information main text into substring sequences, performing ICTCLAS word segmentation of Chinese academy of sciences and performing stop word filtering on the result of the word segmentation to obtain the word collection; step 2, extracting feature words of the expert information according to fields; step 3, building an expert knowledge representation model based on the fields and the weight of the feature words and establishing an expert information index database; step 4, performing automatic prompting according to a search term thesaurus when a user inputs keywords and meanwhile performing real-time update on the search term thesaurus through a search term counter; step 5, calculating the search relevance between the keywords and the expert information based on the semantic information and the like; step 6, listing relevant experts from high to low according to the matching degree. According to the keyword based evaluation expert intelligent search and recommendation method, the intelligent full-text search and recommendation of the expert information can be achieved through the keyword input and accordingly the experts which are matched with a pended science and technology project can be searched out accurately.
Owner:HANGZHOU DIANZI UNIV

Associating objects in databases by rate-based tagging

Embodiments of the present invention provide automatic systems and methods for associating objects in databases of a web site by rate-based tagging. The frequencies of users entering specific tag terms for objects stored in the databases of the web site are used to determine hard associations between objects and tag terms and between objects. When the frequencies of user tags exceed established thresholds, hard associations between objects and tag terms are established. When objects are identified or determined to have hard association with tag terms, the objects are determined to be more clearly associated with the corresponding tag terms. Therefore, they should be highlighted or featured in more prominent locations on web pages of the web site to increase users' confidence in content of the web site. To identify hard-associated objects, more weights can be assigned to the hard-associated objects, which allows them to be more likely to be selected for display in prominent locations. In addition, objects that are determined to have hard associations with tag terms can also have hard associations with one another due to the common tag terms they share. The hard association relationship between objects can be displayed through links to associated objects when an object is selected for display.
Owner:R2 SOLUTIONS
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products