Patents
Literature
Patsnap Copilot is an intelligent assistant for R&D personnel, combined with Patent DNA, to facilitate innovative research.
Patsnap Copilot

427 results about "Text mining" patented technology

Text mining, also referred to as text data mining, roughly equivalent to text analytics, is the process of deriving high-quality information from text. High-quality information is typically derived through the devising of patterns and trends through means such as statistical pattern learning. Text mining usually involves the process of structuring the input text (usually parsing, along with the addition of some derived linguistic features and the removal of others, and subsequent insertion into a database), deriving patterns within the structured data, and finally evaluation and interpretation of the output. 'High quality' in text mining usually refers to some combination of relevance, novelty, and interest. Typical text mining tasks include text categorization, text clustering, concept/entity extraction, production of granular taxonomies, sentiment analysis, document summarization, and entity relation modeling (i.e., learning relations between named entities).

Semantic meaning-based specific task text keyword extraction method

ActiveCN107193803AAchieve characterizationRealize the characterization of semantic similaritySemantic analysisSpecial data processing applicationsSemantic vectorText mining
The invention discloses a semantic meaning-based specific task text keyword extraction method, and belongs to the field of natural language processing. The method comprises the following steps of: firstly, extracting a subject word of a certain specific task in a related text, and converting the subject word into a semantic vector by utilizing a semantic representation technology; secondly, carrying out word segmentation, part-of-speech tagging and screening on a text of a to-be-extracted keyword by utilizing a word segmentation tool; thirdly, converting the screened words into semantic vectors and calculating a similarity between each screened word and the subject word of the specific task; and finally, constructing a word network chart by taking the words as nodes, and calculating the importance degree of each word on the basis of the word similarity so as to extract important words in the word network chart. The method disclosed by the invention comprehensively considers the semantic features and structural features of the words in the texts, and is suitable for the extraction of specific-task oriented text keywords, so as to realize a function of obtaining important information from the texts and provide important technical support for the field of text mining, natural language processing, knowledge engineering and the like.
Owner:北京东方科诺科技发展有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products