Patents
Literature
Patsnap Copilot is an intelligent assistant for R&D personnel, combined with Patent DNA, to facilitate innovative research.
Patsnap Copilot

40 results about "Word language" patented technology

Voice recognition text error correction method in specific field

The invention relates to a voice recognition text error correction method in a specific field, wherein the method comprises the following steps: firstly, performing statistics by using correct field corpora to obtain a character and word level language model and a pinyin language model; then, receiving a text sequence to be subjected to error correction, and performing clause processing on more than one sentence; determining suspected wrong words by using a word, word and pinyin language model; determining a candidate word list of the suspected wrong words according to a language model vocabulary and a pronunciation-prone dictionary; and finally, substituting candidate words into the original text sequence, and selecting and outputting the most reasonable sentence in combination with macroscopic and microcosmic scores. Basic units with different granularities and dimensions such as characters, words, pinyin and initial and final consonants are selected to construct a language model, and word segmentation error interference caused by wrong characters is reduced; isolated character disorder is processed by adopting a word language model, and continuous recognition errors caused by pronunciation deviation is distinguished by adopting the pinyin language model; and candidate sentences after the wrong words are replaced are comprehensively evaluated by macroscopic and microcosmic scores, and the smoothness degree of the replaced sentences are measured.
Owner:网经科技(苏州)有限公司

Text positioning method and system based on visual structure attribute

The invention belongs to the technical field of image recognition, and particularly relates to a text positioning method and system based on the visual structure attribute. Based on the visual attribute of a text, by means of color polarity difference transformation and edge neighborhood tail end bonding, abundant closed edges are detected so that abundant candidate connection elements can be obtained, then character stroke attributive character and text colony attributive character screening is conducted, the connection elements belonging to characters are extracted from the candidate connection elements, and then the final text is positioned through multi-channel blending and repeated connection element removal. The method is high in robustness and can be adapted to the situation that multiple word language categories are mixed, or various font styles exist, or arrangement directions are random, or background interference exists and other situations, the positioned text can be directly provided for OCR software for recognition, and OCR software recognition rate can be increased. The text positioning method and system based on the visual structure attribute can be applied to image video retrieval, junk information blocking, vision assisted navigation, street view positioning, industrial equipment automation and other fields.
Owner:SHENZHEN UNIV

Multi-language instant translation system based on big data processing and manual intervention

The invention discloses a multi-language instant translation system based on big data processing and manual intervention. The system comprises a parallel corpus and a third-party translation engine. A translation method of the system comprises the steps that a user inputs a to-be-translated language into the parallel corpus, wherein the to-be-translated language is a word language; the parallel corpus retrieves the to-be-translated language, and whether the parallel corpus can directly translate the to-be-translated language is judged; if the parallel corpus can directly translate the to-be-translated language, internally recorded information is retrieved, and then a retrieved translation result is output; if the parallel corpus cannot directly translate the to-be-translated language, the to-be-translated language is input into the third-party translation engine to be translated, and a translation result is output; and the translation result is modified manually, the information obtained after modification is fed back to the parallel corpus, and the information in the parallel corpus is updated continuously. According to the system, through combination of the parallel corpus and the third-party translation engine in combination with manual modification, the translation effect is more intelligent, and meanwhile the translation engine has a learning function.
Owner:成都星阵地科技有限公司

Text detection and recognition method and system combined with text classification

The invention discloses a text detection and recognition method and system combined with text classification. The method comprises the following steps: acquiring all target text line boxes in a target picture; cutting and extracting all the target text line frames to obtain a text graph; sending the text graph into a text direction classification model, and carrying out correction identification to correct the text graph in any direction to the same horizontal direction so as to obtain a text correction graph; sending the text correction graph to a character language classification model, and carrying out character language category recognition to obtain a character language category image; and sending the character language category image into a language text recognition model corresponding to the language category, and performing recognition to obtain a final text content. According to the invention, the problem that text detection in the prior art cannot detect texts with any shapes and complex scenes is solved; the conditions that the text is reversed and the direction is not positive cannot be detected; and the problems of high time cost and low efficiency caused by the fact that multi-language text regions need to be sent into a plurality of models for recognition are solved.
Owner:成都人人互娱科技有限公司

Sign language recognition and conversion system and method based on deep learning and big data

The invention discloses a sign language recognition and conversion system and method based on deep learning and big data. The system comprises an image acquisition module, an image recognition module,an information matching module, a content arrangement module, a text output module and a voice output module. The method includes: collecting a human body image sequence; extracting face key point coordinates and hand key point coordinates in each frame of image of the human body image sequence; searching natural language morphemes most matched with the face key point coordinates and the hand keypoint coordinates in a sign language action database, and calculating matching values; filtering the natural language morphemes according to the repetition condition and the matching value between the adjacent morphemes; converting the reserved natural language morphemes into characters and displaying the characters on a screen; and searching voices corresponding to the characters according to the character language database, and playing the voices. According to the system, the sign language image sequence can be conveniently and quickly converted into characters and voice of other languagesto be output, the meaning of the sign language can be understood more easily, and the communication efficiency is improved.
Owner:TSINGHUA UNIV

Certificate picture generation method, device and equipment, and storage medium

The invention relates to the field of artificial intelligence, discloses a certificate picture generation method, device and equipment and a storage medium, which are used for improving the accuracy of generating a certificate picture conforming to a real scene. The certificate picture generation method comprises the steps of obtaining a sample certificate picture, wherein the sample certificate picture comprises sample text data and sample background data; using a picture similarity comparison algorithm for forming certificate background data and certificate character data based on the sample certificate picture, and the certificate character data comprising character language data and font style data; writing the certificate text data into the random position of the certificate background data to generate an initial certificate picture; preprocessing the initial certificate picture to generate a plurality of preprocessed certificate pictures; and adopting a preset random scaling function to perform multiple times of random scaling on the plurality of preprocessed certificate pictures, so that a plurality of target certificate picture groups can be generated. In addition, the invention also relates to a blockchain technology, and the sample certificate picture can be stored in the blockchain.
Owner:ONE CONNECT SMART TECH CO LTD SHENZHEN

ID mapping method and system based on regulation and control cloud platform

PendingCN111695351AFully consider the differences in text and languageReduce semantic differencesData processing applicationsNatural language data processingDatabaseWord language
The invention provides an ID mapping method based on a regulation and control cloud platform, and the method comprises the steps: obtaining the equipment information of regulation and control cloud and D5000 of a power system, and carrying out the classification of the equipment information of the regulation and control cloud and the D5000 of the power system according to the types; on the basis of classification, sequentially utilizing a pre-constructed Chinese equipment word bank to carry out word segmentation processing on equipment in the power system regulation and control cloud and equipment in D5000, and constructing text information corresponding to the equipment in the power system regulation and control cloud and the equipment in D5000 by considering the occurrence frequency of word segmentation; based on the text information corresponding to the equipment in the power system regulation and control cloud and the text information corresponding to the equipment in the D5000, determining a relationship between the equipment in the power system regulation and control cloud and the equipment in the D5000; through the cloud platform and the D5000 which are matched in a mappingmode, efficient collection and storage can be well conducted, and information exchange sharing and analysis synchronization are achieved; according to the method, the difference of character languagesis fully considered, word segmentation is performed on equipment names by establishing a Chinese word bank, and the semantic difference is reduced.
Owner:CHINA ELECTRIC POWER RES INST

Search text processing method and device, equipment, storage medium and program product

The invention relates to a search text processing method and device, computer equipment, a storage medium and a computer program product. The method comprises the following steps: acquiring a search text for searching commodities; the method comprises the following steps: performing error correction on commodity words extracted from a commodity corpus to obtain a commodity word bank, and performing word segmentation processing on a search text to obtain a word sequence; phrases formed by the independent words in the word sequence and the adjacent words of the independent words are used as potential wrongly-identified words in the search text; on the basis of the pinyin editing distance, candidate words used for correcting the potential wrongly-identified words are searched; using the commodity corpus after error correction to train the language model to obtain a commodity word language model, and determining statement smoothness of potential wrong words and candidate words; and when the statement smoothness of the potential wrong words and the target candidate words meets a replacement condition, replacing the potential wrong words in the search text with the target candidate words to obtain an error correction text. The method is suitable for a commodity search scene.
Owner:TENCENT TECH (SHENZHEN) CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products