Patents
Literature
Hiro is an intelligent assistant for R&D personnel, combined with Patent DNA, to facilitate innovative research.
Hiro

49 results about "Seed entity" patented technology

Recognition method and system of named entities in microblog messages

The invention provides a recognition method of named entities in microblog messages. The recognition method includes that a few named entities are specified as seeds; a certain number of microblog messages from the original microblog message set to be processed are automatically marked as a training data set; and then the training data set is utilized to train a named entity identifier and the trained named entity recognizer is utilized to recognize the named entities in the microblog messages. According to the recognition method of the named entities in the microblog messages, only a few existing seed entities need to be specified to enable a high quality training set to be automatically marked; the labor costs are significantly reduced for the microblog messages which are texts capable of being updated rapidly; and an iterative mode is utilized to generate high quality marked data step by step and each time first N newly named entities which can most reflect the appearing law of the named entities in real microblog data are selected to add into a seed bank, so that finally generated marked data can well cover the integral microblog message set.
Owner:INST OF COMPUTING TECH CHINESE ACAD OF SCI

Method for cultivating lucid ganoderma

The invention provides a method for cultivating lucid ganoderma, relating to a method for cultivating edible fungi, in particle to a method for cultivating the lucid ganoderma with new cultivating material. The cultivating material in the method contains eucalyptus, wherein the cultivating material is preferably contains eucalyptus bark or eucalyptus branch or mixture of the eucalyptus bark and the eucalyptus branch. The method selects the eucalyptus as the cultivating material, wherein the eucalyptus is not usually regarded as to be suitable for cultivating the lucid ganoderma, to obtain lucid ganoderma seed entities with faster growth speed, wherein the seed entities almost have the same effective constituent of the lucid ganoderma. Furthermore, the eucalyptus has fast growth, short circulation period and high yield, therefore, the price is lower than the other broad-leaved trees. Because the other constituents such as cotton seed hulls, bagasse, and the like, are not needed to be added, comparatively speaking, the method has lower price.
Owner:GUANGDONG YUEWEI EDIBLE FUNGI TECH +1

Iterative entity alignment model

The invention provides an entity alignment method and device. The method and device aim to solve the problem that traditional knowledge alignment method usually requires cumbersome manual labor or elaborate manual construction. The method comprises the steps of acquiring vector representations of entities in a first knowledge graph and vector representations of the entities in a second knowledge graph separately, binding vector representations, with the same meanings, of the entities in the first knowledge graph and the second knowledge graph according to an input alignment seed entity pair set so as to acquire a third knowledge graph, performing iterative calculation of the similarity among the vector representations of the entities according to the third knowledge graph, the vector representations of the entities in the first knowledge graph and the vector representations of the entities in the second knowledge graph, in the process of iterative calculation, if the distance between a certain pair of vector representations of the entities is less than a threshold, adding the entity corresponding to the pair of vector representations to the third knowledge graph until the number of entities in the third knowledge graph is no longer increased. The entity alignment method and device are good in practicality.
Owner:TSINGHUA UNIV

Synthetic method of truffle and bacteriorhiza

The invention provides a method for synthesizing Chinese truffles and mycorhiza of China pinus montana. The steps related are as follows: filtration of China pinus montana seeds and truffle seed entities; culture of aseptic seedlings; preparation of microbial inoculum; arrangement, optimal selection and inoculation of culture substrates; culture of mycorrhizal seedlings; anatomization of mycorhiza forms; detection and confirmation of molecules and transplanting culture, and finally, the culture of truffles is realized.
Owner:KUNMING INST OF BOTANY - CHINESE ACAD OF SCI

A method for dynamically updating knowledge map

The invention discloses a method for dynamically updating knowledge map, which is used for solving the synchronization problem between an encyclopedia knowledge map and a data source thereof. The invention takes the hot content on the World Wide Web as a starting point. Named entities are extracted as seed entities from which updates are likely to occur. Other entities associated with the seed entity are then captured on the encyclopedia site as extension entities. Then, a certain number of entities are obtained from the encyclopedia website to carry out feature engineering, and the update information of entity feature representation is mined by machine learning algorithm, and the predictor is constructed. Entities with high update probability are selected from the extended entities by using the predictor. Finally, the dynamic updating of knowledge map is realized by using seed entities and extended entities with high updating probability as updating objects under the circumstance of limited access to data sources.
Owner:SOUTHEAST UNIV

Method for culturing artificial cordyceps sinensis by using silkworm pupas as carriers

A method for culturing artificial cordyceps sinensis by using silkworm pupas as carriers is used for inoculating bacterial suspension prepared by using wild natural cordyceps sinensis bacterial strains native to the Tibet plateau on live silkworm pupas and artificially culturing the live silkworm pupas into silkworm pupa cordyceps sinensis. The method comprises the following steps of (1) screening of cordyceps sinensis bacterial strains, (2) preparation of the silkworm pupas, (3) preparation of bacterial suspension, (4) preparation of silkworm pupas before inoculation, (5) inoculation, (6) culturing of seed seats, and (7) collection and storage of silkworm pupa cordyceps sinensis. According to the method, silkworm pupas of domesticated silkworms serve as hosts, the cordyceps sinensis bacterial strain suspension is inoculated in the silkworm pupas artificially, and then the silkworm pupas are cultured into the silkworm pupa cordyceps sinensis. Inoculation rate and petrification rate after inoculation are above 96%, and the yield of seed entities is above 92%. The period from inoculation to mature of the seed entities takes 36-44 days. The chemical composition of the silkworm pupa cordyceps sinensis is the same with that of cordyceps sinensis basically, and the active ingredients such as cordycepin and cordycepic acids of the silkworm pupa cordyceps sinensis are 3.2-3.9 times the active ingredients of the cordyceps sinensis.
Owner:凌中鑫

Method and device for extracting open category named entity by means of random walking on map

The invention discloses a method for extracting an open category named entity by means of random walking on a map. The method comprises the steps that 1, a context, on a corpus, of a seed is analyzed to obtain a template; 2, the template is used for extracting a candidate entity from the corpus; 3, a map is structured according to the relation among a seed entity, the template and the candidate entity; 4, the confidence coefficient of the candidate entity is computed through the random walking algorithm on the map. The method can overcome the adverse effects on the computation of confidence coefficient of the candidate entity caused by different qualities of the template, and effectively improve the accuracy of extraction of the open category named entity. Experiments prove that the average accuracy of an extraction result is improved by 4.36%.
Owner:INST OF AUTOMATION CHINESE ACAD OF SCI

Entity set extension method

ActiveCN104794163AGuaranteeing the efficiency of collection expansionEasy to handleSpecial data processing applicationsSeed entityUser input
The invention provides an entity set extension method. The method comprises the steps that a seed entity set input by a user is acquired, and attribute information corresponding to each seed entity is determined in an RDF knowledge base according to the entity name of each seed entity in the seed entity set; according to the attribute information corresponding to each seed entity, same attributive characters corresponding to the seed entity set are determined, and other entities with the same attributive characters in the RDF knowledge base are determined to form an extension entity set; entities of the extension entity set are added to the seed entity set to obtain an extended entity set. According to the entity set extension method, the entity set extension method based on the RDF knowledge base is provided, due to the fact that the RDF knowledge base uses structured XML data, a server can excavate out semantic information among the seed entities, the extension result becomes more intelligent and accurate, and the extension efficiency of the entity set can be guaranteed.
Owner:RENMIN UNIVERSITY OF CHINA

Grifola frondosa culture material

The invention discloses a Grifola frondosa culture material. Forest land surface soil with content in weight percent being 10-30% is added in regular culture material. Due to adding forest land surface soil, the culture material in the invention can prevent Grifola frondosa from yield reduction owing to yellow water exudation, can keep moisture in the culture material well, thus increasing yield of Grifola frondosa seed entity, enhancing biological efficiency and having good popularization value.
Owner:SHANGHAI ACAD OF AGRI SCI +2

Nested entity data identification method and device, and electronic equipment

The invention discloses a nested entity data identification method and device, and electronice equipment, and relates to the technical field of data identification. The method comprises the steps of permutating and combining seed entity vocabularies of different entity categories to generate a short text data set; defining at least one entity category label for a short text in the short text dataset, and index information of starting and ending of a sub-text, corresponding to each entity category label, in the short text; training a deep learning recognition model by using the defined short text data set as a training set; and recognizing the nested entity data by using the recognition model which is trained to reach the standard. According to the method, the entity labeling information is defined for the statement according to the start and end indexes and the entity category label, so that the multi-nested entity content labeling is simpler to implement, the labeling process and workload of nested entity recognition are optimized, the time cost and the labor cost are saved. And thus, the identification efficiency and accuracy of the nested entity data can be improved.
Owner:BEIJING PERFECT WORLD SOFTWARE TECH DEV CO LTD

Bootstrap Chinese entity extracting method based on modes

The invention discloses a bootstrap Chinese entity extracting method based on modes. Starting from a small number of seed entities, inner modes of the entities and outer modes of the entities, more entities and modes are learnt from a linguistic data in an iterative mode. The bootstrap Chinese entity extracting method based on modes is a method of combining statistics and modes and has the advantages that the method does not need to depend on a large number of manually annotated linguistic data or field mode base. Compared with a current mode bootstrap method, the bootstrap Chinese entity extracting method based on modes uses the inner modes and characteristics of the entities to conduct a grade assessment on candidate modes and entities which can not be marked accurately based on observing entity type modes in specific fields so as to improve precise degrees of modes and grades of entities, and thus the method is applicable to entity extracting and knowledge base establishing in specific fields.
Owner:THE 28TH RES INST OF CHINA ELECTRONICS TECH GROUP CORP

Entity set expansion method and apparatus

ActiveCN106951526AImprove effectivenessReflect specific common featuresSpecial data processing applicationsNODALSeed entity
Embodiments of the invention provide an entity set expansion method and apparatus. The method comprises the steps of extracting candidate entities from a target knowledge graph to form a candidate entity set according to a pre-determined seed entity set; determining meta-paths between seed entities from a heterogeneous information network corresponding to the target knowledge graph, wherein each meta-path is a connection path, consisting of an entity type and a relational type, between two node types in the heterogeneous information network, and the two node types are node types corresponding to different seed entities; determining a first importance degree of each meta-path according to a quantity of seed entity pairs connected by each meta-path; according to the first importance degree of each meta-path, determining a second importance degree of each candidate entity in the candidate entity set; and determining the candidate entity with the second importance degree meeting a first preset condition in the candidate entity set as a to-be-expanded entity, and adding the to-be-expanded entity to the seed entity set. By applying the method and the apparatus, effective entity set expansion can be carried out.
Owner:BEIJING UNIV OF POSTS & TELECOMM

Entity reference item identification method based on topic model and semantic analysis

The invention discloses an entity reference item recognition method based on a topic model and semantic analysis, and the method comprises the following steps: 1, carrying out the sentence segmentation, word segmentation, part-of-speech tagging and dependency analysis of an input corpus; step 2, based on syntactic analysis, obtaining noun phrases with complete boundaries as a candidate set of entity reference items, and then comprehensively utilizing an LDA topic model and a TF-IDF statistical algorithm to filter non-entity reference items from the candidate set; and step 3, measuring semanticsimilarity between the entity reference items and the seed entities, selecting seed categories with high similarity as entity categories, and then classifying the entity reference items of each entity category into corresponding reference item categories by utilizing shallow syntactic knowledge setting rules. The effectiveness of the entity boundary detection and classification method can be improved.
Owner:INST OF ELECTRONICS & INFORMATION ENG OF UESTC IN GUANGDONG

Named entity identification method and device, equipment and storage medium

The invention discloses a named entity identification method and device, equipment and a storage medium, and relates to the fields of natural language processing, semantic analysis and understanding,artificial intelligence and the like. The method comprises the steps of performing entity identification on new domain text data to obtain new domain seed entity words; labeling the new domain text data according to the new domain seed entity words to obtain labeled new domain text data; training a named entity recognition model by using the labeled new domain text data to obtain a named entity recognition model suitable for the new domain; and identifying entity words in other text data of the new domain by utilizing the named entity identification model suitable for the new domain. Accordingto the embodiment of the invention, the data annotation workload can be reduced, the model migration training threshold is reduced, and the field universality of the algorithm is improved.
Owner:ZTE CORP

Entity alignment method based on weighted neighbor information coding

The invention discloses an entity alignment method based on weighted neighbor information coding, and the method specifically comprises the steps: 1), carrying out the preprocessing of data in two knowledge bases needing to be aligned, and extracting two knowledge base triples, entities and neighbor information thereof, and entities and type information thereof; 2) based on all currently discovered matching entity pairs, obtaining a vector representation corresponding to each entity through triple-based knowledge representation learning, weighted neighbor information coding and cross-knowledge-base entity-type graph embedding; 3) reasoning matching entity pairs in combination with three different vector representations of the entity; and 4) forming new training data by the discovered matching entity pairs and the priori aligned seed entity pairs, repeating the steps 1)-4) until the specified iteration times are reached, and outputting the discovered matching entity pairs. According tothe method, fewer entities appearing in the triad can be more accurately matched, and the method has a wide application prospect in the fields of knowledge fusion, knowledge questions and answers andthe like.
Owner:ZHEJIANG UNIV

Automatic generation of domain models for virtual personal assistants

Technologies for automatic domain model generation include a computing device that accesses an n-gram index of a web corpus. The computing device generates a semantic graph of the web corpus for a relevant domain using the n-gram index. The semantic graph includes one or more related entities that are related to a seed entity. The computing device performs similarity discovery to identify and rank contextual synonyms within the domain. The computing device maintains a domain model including intents representing actions in the domain and slots representing parameters of actions or entities in the domain. The computing device performs intent discovery to discover intents and intent patterns by analyzing the web corpus using the semantic graph. The computing device performs slot discovery to discover slots, slot patterns, and slot values by analyzing the web corpus using the semantic graph. Other embodiments are described and claimed.
Owner:INTEL CORP

Word classification model training method based on artificial intelligence and word processing method and device

The invention provides a word classification model training method and device based on artificial intelligence, a word processing method and device, electronic equipment and a storage medium. The method comprises the steps: obtaining a seed entity word set composed of a plurality of seed entity words, wherein the plurality of seed entity words belong to a to-be-mined entity type; combining any twoseed entity words in the seed entity word set to obtain a positive example sample pair; obtaining a historical text including the seed entity words, and constructing a negative example sample pair according to the seed entity words and the historical text excluding the seed entity words; updating a word classification model through the positive example sample pair and the negative example samplepair, wherein the updated word classification model is used for determining the probability that the entity words to be recognized belong to the entity types to be mined. Through the method and the device, the richness of model training samples can be improved, the corpus annotation cost required by entity mining is reduced, and meanwhile, the training effect of the word classification model can also be improved.
Owner:TENCENT TECH (SHENZHEN) CO LTD

Knowledge graph fusion method based on entity sequence coding

The invention discloses a knowledge graph fusion method based on entity sequence coding. The method comprises the steps of 1, knowledge graph entity representation learning; 2, selecting a path code and an alignment model; 3, performing cross-language entity alignment on the model, wherein in the source language knowledge graph space, a two-hop sequence corresponding to other seed entities is constructed for one entity, a sequence possibly corresponding to the entity is constructed in the target language knowledge graph space, an alignment sequence with the highest probability is found out, and then a node at the same position is found out from the alignment sequence to serve as an alignment node of the node; 4, adding a new candidate seed node. Aiming at the problem of insufficient training corpus of a deep learning model in the prior art, the method based on entity path representation learning is put forward.
Owner:BEIHANG UNIV

Pumpkin-glossy ganoderma intercropping method for repairing organic polluted soil

The invention discloses a pumpkin-glossy ganoderma intercropping method for repairing organic polluted soil, which comprises the following steps of: making a furrow on the organic polluted soil, arranging drainage ditches on four sides of the furrow, digging a glossy ganoderma bed in the furrow, opening a culture bag, vertically arranging the culture bag in the furrow bed in a mode that the inoculation surface is upward, covering the bag with soil, filling gaps by using soil, and keeping the soil moist; continuously culturing the culture bag in the open air and promoting the culture bag to grow the glossy ganoderma naturally; and transplanting vigorous pumpkin seedlings on two sides of the glossy ganoderma bed, constructing a pergola by using a plurality of pumpkin seedlings, shading the glossy ganoderma seed entities by the grown pumpkin stems and leaves, and providing CO2 by the glossy ganoderma seed entities to promote the photosynthesis of the pumpkin plants. The intercropping system promotes degradation or removes the organic polluted soil by using the pumpkin roots, the glossy ganoderma culture bag and the spore powder till the glossy ganoderma seed entities are mature and the growth period of the pumpkin is finished. The method can effectively realize efficient repair of the organic polluted soil, realizes safe production of the pumpkin and the glossy ganoderma at the same time of repairing, and is particularly suitable for control of repair of the polycyclic aromatic hydrocarbon, polychlorinated biphenyl and phthalate ester polluted soil.
Owner:INST OF SOIL SCI CHINESE ACAD OF SCI

Schizophyllum anoxic-resistant fermented product and its fermentation method and culture medium

The invention discloses an anoxic-resistant fermented product produced by using schizophyllum as a starting strain, as well as a fermentation production method and a culture medium. The main raw materials of the culture medium provided by the present invention include the seed body of any one of highland barley, barley, wheat and corn cereal crops, the powder body of the fruit body, the powder juice of the fruit body, the bud body of the fruit body, and the fruit body of the fruit body. Bud juice of bud powder or fruit body; the fermentation method can be liquid aerated fermentation, liquid static fermentation or solid state static fermentation. The anoxic-resistant fermented product provided by the present invention can improve the hypoxia tolerance of the body, and is beneficial to reduce and alleviate a series of hypoxic symptoms such as altitude sickness, and is especially suitable for rapid advance or resident plateau troops to train in anoxic environment, It is suitable for use in patrols, exercises and operations, and is also suitable for use in special fields such as aviation, spaceflight and diving. It can also be used in sports, plateau tourism, plateau operations and other fields or groups of people.
Owner:THE QUARTERMASTER EQUIPMENT RESEARCH INSTITUTE OF THE GENERAL LOGISITIC DEPARTME

Gradually-increased knowledge graph entity extraction method and system for electric power customer service questions and answers

The invention discloses a gradually-increased knowledge graph entity extraction method and system for electric power customer service questions and answers, and the method comprises the steps: takingelectric power customer service historical question and answer record data as a basic corpus, and forming a basic data set; extracting named entities and event entities on the basic data set and setting a coexistence relationship between the entities to form an initial seed entity set; extracting named entities and event entities on the corpus data of the electric power customer service field andconstructing a coexistence relationship between the entities to form an entity set corresponding to each data source; screening entities from the entity set corresponding to each power customer service corpus data source to expand the seed entity set, and forming an entity set oriented to the power customer service question and answer knowledge graph. The invention has the capability of autonomously selecting corpus data and entities.
Owner:STATE GRID JIANGSU ELECTRIC POWER CO LTD MARKETING SERVICE CENT +2

North cordyceps sinensis herba epimedii compound tea bag product and producing method thereof

The invention discloses a north cordyceps sinensis herba epimedii compound tea bag product and a producing method thereof. The producing method comprises the steps of: uniformly mixing 88% of rice, 9.6-10% of crushed corn particles, 0.4% of yeast powder, 0.4% of peptone, 0.15% of KH2PO4, 0.05% of MgSO4 and 1% of cane sugar or 0.4% of herba epimedii powder; adding stilled water according to a ratio by weight of 1: 1.3; after cooking and sterilizing at a high pressure, inoculating north cordyceps sinensis strains; freezing and drying seed entities after being cultured for 50 days; crushing the seed entities and sieving the seed entities with a 60-mesh sieve to be combined with the herba epimedii of 60 meshes according to a ratio of 1: (0.5-1); and filling the mixture in a filtering bag of 80 meshes, wherein each bag is 3-5 g; sealing, bagging and sterilizing by irradiation to obtain the north cordyceps sinensis herba epimedii compound tea bag finished product. The north cordyceps sinensis herba epimedii compound tea bag product, disclosed by the invention, has the advantages of enhancing immunity, preventing tumor, resisting to bacteria and diminishing inflammation, defying aging and the like and playing the function of berberidaceae perennial herb epimedii of tonifying kidney and strengthening yang as well as dispelling wind and eliminating dampness effectively.
Owner:福建农大科技开发有限公司

Text entity detection method and system and related components

The invention discloses a text entity detection method, and the method comprises the steps: carrying out the matching of each statement instance in a target statement through a seed entity set to obtain a matching result, and generating annotation data corresponding to the target statement according to the matching result; querying a statement instance matched with an unlabeled corpus word frequency table in the target statement, and modifying the labeled data according to a query result to obtain local labeled data; training a sequence annotation neural model by utilizing the local annotationdata; and performing sequence annotation on the unannotated corpus in the target statement by utilizing the trained sequence annotation neural model so as to obtain an entity set of the target statement. According to the method, high-quality entity mining can be realized on the premise of not being limited by the quality and the scale of the unlabeled corpus. The invention further discloses a text entity detection system, a computer readable storage medium and electronic equipment, which have the above beneficial effects.
Owner:SUZHOU UNIV

Information mining method and apparatus

Embodiments of the present invention provide an information mining method and apparatus. The method comprises the following steps of: mining each query statement of each specific category from a search log; giving the particular class of seed entities; generating an expression template corresponding to each query statement of the specific category according to the seed entity of the specific category and each query statement; according to various query sentences and corresponding expression templates, extracting high-frequency query sentences and high-frequency expression templates from the search log; By using the search log of the user as the data source, the obtained high-frequency expression of high-frequency sentences are rich and can cover all kinds of expression habits of users, which can include the content that cannot be covered by artificial enriched templates such as colloquial expression.
Owner:BEIJING BAIDU NETCOM SCI & TECH CO LTD

Unsupervised knowledge graph entity alignment method and equipment

The invention discloses an unsupervised knowledge graph entity alignment method and equipment. The method comprises the following steps: acquiring data of two knowledge graphs; generating a text distance matrix by using the auxiliary information of the entities in the knowledge graph; generating an initial alignment result as a seed entity pair set by using threshold bidirectional nearest neighbor search; on the basis that the seed entity pair set is marked data, learning a structure distance matrix of the entity by using a graph convolutional network; fusing the text distance matrix and the structure distance matrix of the entity to obtain a fused distance matrix; performing progressive learning to obtain a newly generated alignment entity pair, merging the newly generated alignment entity pair into a seed entity pair set, and using the merged seed entity pair set to iteratively update structure embedding; and repeating the first three steps until the number of the newly generated alignment entity pairs is lower than a preset value, and obtaining a final entity alignment result.
Owner:NAT UNIV OF DEFENSE TECH

Methods and systems for electronic transactions

An example system can comprise a closed group of member financial institution servers. The system can comprise a data feed manager configured to communicate amongst the closed group of member financial institution servers over a network. The system can comprise a data aggregator configured to aggregate data related to transactions conducted by the closed group of member financial institution servers. The aggregated data can comprise an aggregated interest from the transactions. The system can comprise a distribution manager configured to determine one or more distribution levels for the member financial institutions and one or more seeding entities based on the aggregated data.
Owner:GLOBEONE

Preparation method of leucocytopenia adjuvant

The invention discloses a Chinese medicine preparation, especially a preparing method for an auxiliary medicine in treatment of leucopenia diseases. The main technology of the invention comprises: taking the tremella seed entity; making tremella polysaccharide dried paster by submerged fermentating, concentrating, ethanol eluting, separating and drying; smashing the dried paster; performing maturation and sterilization to obtain the powder of the invention, in which the powder can be put into capsules for the convenience in taking. The medicine in the invention achieves immunization stimulation function, can improve specific or non specific immunological reaction, and is used mainly for treating leucopenia caused by actinotherapy, chemotherapy or other reasons of tumor patient in clinic.
Owner:JINLIN ZHENGTAI ZHONGHUI PHARMA GROUP

Recognition method, device and electronic equipment for nested entity data

The application discloses a nested entity data identification method, device and electronic equipment, and relates to the technical field of data identification. The method includes: arranging and combining the seed entity vocabulary of different entity categories to generate a short text data set; defining at least one entity category label for the short text in the short text data set, and corresponding to each entity category label in the short text Index information of the start and end of the subtext; use the defined short text dataset as the training set to train the recognition model of deep learning; use the trained recognition model to recognize nested entity data. This application adopts the method of starting and ending index + entity category label for sentences to define entity labeling information, which makes the realization of multi-nested entity content labeling easier, optimizes the process and workload of nested entity recognition for labeling, and saves Time cost and labor cost, which in turn can improve the recognition efficiency and accuracy of nested entity data.
Owner:BEIJING PERFECT WORLD SOFTWARE TECH DEV CO LTD

Neural machine translation method and device based on knowledge graph, equipment and medium

PendingCN114118104AImprove entity translation accuracyNatural language translationKnowledge representationSeed entitySentence pair
The invention provides a neural machine translation method and device based on a knowledge graph, equipment and a medium, and the method comprises the steps: obtaining an original bilingual parallel statement pair, extracting a word and phrase translation pair according to the original bilingual parallel statement pair, and obtaining a corresponding seed entity translation pair; obtaining a source language knowledge graph and a target language knowledge graph, and constructing a corresponding vector space according to the seed entity translation pair, the source language knowledge graph and the target language knowledge graph; when a to-be-translated entity set is obtained, deducing the to-be-translated entity set according to the vector space to obtain a corresponding to-be-translated entity translation pair; and calculating the distance between the seed entity translation pair and the to-be-translated entity translation pair, and obtaining a pseudo bilingual parallel sentence pair containing the to-be-translated entity translation pair according to the distance. According to the method, the knowledge graph is fused into the neural machine translation, and the entity translation accuracy of the neural machine translation is improved by utilizing rich entity knowledge in the knowledge graph.
Owner:INST OF AUTOMATION CHINESE ACAD OF SCI
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products