Hybrid application method of two word segmentation machines based on SOLR search engine
A search engine and hybrid application technology, applied in the field of search engines, can solve problems such as inability to meet accurate word segmentation, and achieve the effect of improving usability
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Examples
Embodiment 1
[0061] The hybrid application method of the two word segmenters based on the SOLR search engine is to use the features of solr to integrate multiple word segmentation plug-ins when applying search engine searches, and use the fine-grained word segmentation method of IK Analyzer to create an index. When searching the index, the word segmentation method of mmseg4j is used to search.
[0062] The method steps are as follows:
[0063] Step 1. First, configure the IK Analyzer tokenizer in solr;
[0064] 1) In the configuration file schema.xml (location {SOLR_HOME} / config / ), the configuration information is as follows:
[0065]
[0066]
[0067]
[0068]
[0069]
[0070]
[0071] mode="complex" dicPath=" / hadoop / kbscloud / hhh / solr / credit / conf" / >
[0072]
[0073]
[0074]
[0075]
[0076]
[0077]
[0078] 2) IKAnalyzer.cfg.xml configuration file
[0079] Copy stopword.dic and IKAnalyzer.cfg.xml to the class root directory to en...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More