Unnormalized language processing method base on web mining
A technology of standard language and processing method, applied in the direction of electrical digital data processing, special data processing applications, instruments, etc., can solve problems such as non-standard language, achieve the effect of easy operation and solve the problem of non-standard language
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0025] The main purpose is to invent a method for dealing with non-standard languages with minimal effort. Below is a further description of the present invention:
[0026] For the processing of typical non-standard language, the present invention adopts the pattern matching method based on the sequence covering algorithm. The specific implementation method is as follows: First, we need to deal with typical non-standard words. So in order to avoid being limited to a certain field, the data we collect cannot be concentrated in a certain field. For example, collect non-standard words related to this field in certain car forums or mobile phone forums. For the sake of fairness, the extracted data are all domain-independent. The following algorithm is employed to extract the rules identifying this non-canonical NIL.
[0027] 1) Training data set S, sen is an instance in S. The rule set R is initially empty. If the keyword contained in s is a non-standard word, it is marked ...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com