Substitution dictionary generating method and device
A technology of dictionaries and words, applied in the field of data search, can solve the problem of low accuracy and recall rate of replacing dictionaries
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0100] See figure 1 The method for generating a replacement dictionary provided in this embodiment specifically includes: operations 101 to 104.
[0101] In operation 101, a sentence pair resource is obtained.
[0102] Specifically, the sentence pair resource is composed of the query question sentence input by the user and the user clicked title part (here in bold font) words corresponding to the query question. These sentence pair resources can be obtained on the Internet. For example, using the Baidu search tool and the user enters teen movie, Baidu showed the following results:
[0103] Top 10teenage moviesfor girls of all time 2014–Squidoo
[0104] www.Squidoo.com> …> Movies> Blockbuster Movies ▼ translate this page
[0105] These are my favorite high school movies.It’s probably a bit juvenileof me.but I always love a good teenagemovie.And since I’m a girl.Iguess……
[0106] Ranking the 10Best Teen Films of 2013Thus Far|BlackBook
[0107] www.bbook.com / ranking-the-10-best-teen-fil...
Embodiment 2
[0159] Based on the foregoing embodiment, this embodiment provides another alternative dictionary generation method.
[0160] See image 3 The replacement dictionary generation method provided in this embodiment specifically includes: operation 201 to operation 208.
[0161] In operation 201, the sentence pair resource is obtained. For details, please refer to the description in the foregoing embodiment 1, which will not be repeated here.
[0162] In operation 202, the sentence is preprocessed to the resource.
[0163] This operation performs error correction processing, word segmentation processing, part-of-speech tagging, proper name recognition, word segmentation correction processing and data normalization processing on sentence resources. The above-mentioned preprocessing can filter out more wrong data in sentence pair resources, and avoid alignment errors caused by partial word segmentation errors. For example, prior to word segmentation processing, first perform error correcti...
Embodiment 3
[0198] See Figure 5 The replacement dictionary generation device provided in this embodiment specifically includes: an acquisition module 11, a rule alignment module 12, a statistical alignment module 13, and a generation module 14.
[0199] The obtaining module 11 is used to obtain sentence pair resources;
[0200] The rule alignment module 12 is configured to use prior knowledge of the language to perform regular alignment on the sentence pair resources to generate a first replacement dictionary;
[0201] The statistical alignment module 13 is used to perform statistical alignment on the remaining corpus in the sentence pair resource using the IBM model fused with prior knowledge of language to generate a second replacement dictionary; wherein, the remaining corpus is the sentence pair resource The remaining words after the regular alignment is performed by the regular alignment module;
[0202] The generating module 14 is configured to generate a third replacement dictionary avail...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More - R&D
- Intellectual Property
- Life Sciences
- Materials
- Tech Scout
- Unparalleled Data Quality
- Higher Quality Content
- 60% Fewer Hallucinations
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2025 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com



