Patents
Literature
Patsnap Copilot is an intelligent assistant for R&D personnel, combined with Patent DNA, to facilitate innovative research.
Patsnap Copilot

93 results about "Context-free grammar" patented technology

In formal language theory, a context-free grammar (CFG) is a certain type of formal grammar: a set of production rules that describe all possible strings in a given formal language. Production rules are simple replacements. For example, the rule A → α replaces A with α. There can be multiple replacement rules for any given value. For example, A → α A → β means that A can be replaced with either α or β.

System for discovering data artifacts in an on-line data object

A system for discovering data artifacts in an on-line data object is described. One embodiment includes a data acquisition subsystem configured to parse the on-line data object into at least one string; a string pre-parser configured to divide each string into a set of separate characters; a lexical analyzer configured, for each set of separate characters, to aggregate the separate characters in that set of separate characters into a sequence of tokens, each token in the sequence of tokens being one of a word, a punctuation symbol, a HyperText-Markup-Language tag, and a number; a syntax analyzer configured, for each sequence of tokens during a first analysis phase, to determine, for each of a plurality of rule sets, whether the sequence of tokens includes one or more candidate data artifacts of a distinct type to which that rule set corresponds, each of the plurality of rule sets being adapted to discovery of the distinct type of data artifact to which that rule set corresponds, at least one rule set in the plurality of rule sets including a context-free grammar; compute, for each candidate data artifact of a distinct type, a probability ranking indicating a degree of likelihood that the candidate data artifact is a data artifact of that distinct type; and classify each candidate data artifact as a data artifact of the distinct type for which a most favorable probability ranking was computed for that candidate data artifact, the syntax analyzer being configured to associate with each classified data artifact a subject found within the on-line data object; and a storage subsystem including at least one data structure in which to store the classified data artifacts, the storage subsystem being configured to index and organize the classified data artifacts by subject for retrieval in response to a search query indicating a particular subject.
Owner:PROQUO

Method for discovering data artifacts in an on-line data object

InactiveUS20080147588A1Web data indexingKnowledge representationRankingSubject indexing
A method for discovering data artifacts in an on-line data object is described. One embodiment parses the on-line data object into at least one string; divides each string into a set of separate characters; for each set of separate characters, aggregates the separate characters in that set of separate characters into a sequence of tokens, each token in the sequence of tokens being one of a word, a punctuation symbol, a HyperText-Markup-Language tag, and a number; for each sequence of tokens during a first analysis phase, determines, for each of a plurality of rule sets, whether the sequence of tokens includes one or more candidate data artifacts of a distinct type to which that rule set corresponds, each of the plurality of rule sets being adapted to discovery of the distinct type of data artifact to which that rule set corresponds, at least one rule set in the plurality of rule sets including a context-free grammar; computes, for each candidate data artifact of a distinct type, a probability ranking indicating a degree of likelihood that the candidate data artifact is a data artifact of that distinct type; and classifies each candidate data artifact as a data artifact of the distinct type for which a most favorable probability ranking was computed for that candidate data artifact; associates with each classified data artifact a subject found within the on-line data object; and stores the classified data artifacts in a storage subsystem that includes at least one data structure, the classified data artifacts in the storage subsystem being indexed and organized by subject for retrieval in response to a search query indicating a particular subject.
Owner:PROQUO
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products