Natural language processing method and system for clinical phenotype information of infertility
A technology of natural language processing and infertility, applied in the field of natural language processing methods and systems for infertility clinical phenotype information, which can solve the inconvenience of rapid matching between infertility clinical phenotype information and phenotype ontology , complex and diverse formats, irregular grammar, etc.
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0020] Such as figure 1 As shown, the present invention provides a Chinese segmentation and matching method for infertility clinical phenotype information.
[0021] Step 101, perform natural language preprocessing on the Chinese clinical phenotype character string to obtain the preprocessed Chinese clinical phenotype initial character string.
[0022] Since most of the infertility clinical phenotype information input by medical practitioners is presented in non-standardized language, it contains complex formats (for example: "OR 35, 18 MII"), multilingual mixed (for example: " Full external detection + Microdeletion / Microduplication", "day3 a 7C2 transplantation failed to conceive"), irregular grammar (for example: "prostate ca"), abbreviations or common names instead of standard terms (for example: "RSA", "IVF", "PCOS"), error messages (for example: "CVAVD", "CUAVD"), symbols in the text (for example: "No sperm?.", "FSH 55↑", "<1mL"), etc., increase the Difficulty of matchi...
Embodiment 2
[0060] Such as figure 2 As shown, the present invention provides an English segmentation and matching method for infertility clinical phenotype information.
[0061] Step 201, performing natural language preprocessing on the Chinese clinical phenotype strings to obtain the preprocessed English clinical phenotype initial strings.
[0062] Perform natural language preprocessing on the original strings of Chinese clinical phenotypes, and generate the preprocessed initial strings of English clinical phenotypes can be implemented in the following specific ways: uniformly modify the encoding of the original strings of Chinese clinical phenotypes to UTF-8 encoding format ;Convert all full-width symbols to half-width symbols; convert Arabic numerals to English numerals; eliminate meaningless character strings, such as focus on, none, unchecked, unchecked, normal, past medical history, specific manifestations, require inspection, auspicious see Attachments, etc.; replace irregular cl...
Embodiment 3
[0086] Such as image 3 As shown, the embodiment of the present invention provides the overall flow and weighting rules of the natural language processing method for the clinical phenotype of infertility.
[0087] Such as image 3 The overall process shown, through the natural language processing, splitting, exact matching, and fuzzy matching of the original string of Chinese clinical phenotype, the following string is output: a Chinese independent string that is exactly matched with the Chinese ontology dictionary (step 304) , the English independent character string (step 304) that exactly matches the English ontology dictionary, the Chinese split character string (step 306) that exactly matches the Chinese ontology dictionary, and the English split character string (step 306) that exactly matches the English ontology dictionary , one or more ontologies of the Chinese ontology dictionary that match the Chinese independent character string maximum (step 307), and one or more...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com