Method for identifying entities named by Cambodian on basis of cross-language resource
A named entity recognition, cross-language technology, applied in natural language data processing, special data processing applications, instruments, etc., can solve the problem of low correct rate of Cambodian named entity recognition, and achieve the effect of effective recognition
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0044] Embodiment 1: as figure 1 As shown, a method of Cambodian named entity recognition based on cross-language resources, the specific steps of the method are as follows:
[0045] Step1. Obtain English-Cambodian bilingual parallel text corpus and Cambodian monolingual text corpus;
[0046] Step2, use the Word2vec tool to process the obtained Cambodian monolingual text corpus to obtain the text
[0047]The word vector text corresponding to each Cambodian word in ;
[0048] Step3. Calculating the similarity between Cambodian monolingual words is achieved by using the cosine similarity method for word vectors; let the vectors of any two words in the Cambodian document be expressed as w i and w j , where w i =(w i1 ,w i2 ...w in ), w j =(w j1 ,w j2 ...w jn ), then the similarity between the two words is expressed as:
[0049]
[0050] Step4. Realize the word alignment between Cambodian words and English words: use the standard word alignment technology IBM model ...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com