Patents
Literature
Patsnap Copilot is an intelligent assistant for R&D personnel, combined with Patent DNA, to facilitate innovative research.
Patsnap Copilot

146 results about "Data balancing" patented technology

Image automatic marking method based on Monte Carlo data balance

The present invention relates to an image automatic marking method based on Monte Carlo data balance. The method comprises the steps of carrying out the region segmentation on the training sample images in a public image library, enabling the segmented regions possessing different characteristic description to correspond to one marking word, then carrying out the Monte Carlo data balance on the different types of image sets, extracting the multiscale characteristics of the balanced images, and finally inputting the extracted characteristic vectors in a robustness least squares increment limit learning machine to carry out the classification training to obtain a classification model in the image automatic marking; for the to-be-marked images, carrying out the region segmentation on the to-be-marked images, adopting the same multiscale characteristic fusion extraction method and inputting the extracted characteristic vectors in the least squares increment limit learning machine to obtain a final image marking result. Compared with a conventional image automatic marking method, the method of the present invention enables the images to be marked more effectively, is strong in timeliness, can be used for the automatic marking of the large-scale images, and possesses the actual application meaning.
Owner:FUZHOU UNIV

Abnormal operation early warning technology for gateway electric energy metering device

The invention provides an abnormal operation early warning method for a gateway electric energy metering device. The method is characterized by carrying out 7*24-hour uninterruptible overall real-time tracking on the data acquired by the electric energy metering system to realize overall analysis and abnormity early warning of gateway electric energy metering operation, wherein the tracked contents comprise such information as operation maintenance, data processing, acquired events, system working conditions, data balance and computational formula of the electric energy metering device. The device realizes that different managers subscribe to various types and grades of alarm information according to the authorities of the users and can provide various alarm methods including webpage alarm, mail alarm and SMS (short message service) alarm according to the subscription information of the users. The system has the following beneficial effects: the low-cost fault early alarm system solution based on the existing gateway metering charge system is provided, has the characteristics of maintainability, expandability and reusability and can be applicable to other gateway metering charge systems and power utilization information acquisition systems.
Owner:STATE GRID JIANGXI ELECTRIC POWER CO LTD RES INST +1

Text topic classification model based on multi-source-domain integrated migration learning and classification method

The invention discloses a text topic classification model based on multi-source-domain integrated migration learning. The model is composed of a target domain data module, a tagging module, an integrated learning module for multi-source-domain tag determination and a correct data module. According to a classification method for the text topic classification model based on multi-source-domain integrated migration learning, first, data without class tags is classified through the tagging module; and next, data with tags is determined, the data correctly classified through three classifiers is selected and added into the target domain data module, classification is performed through the three classifiers to obtain data with dummy tags and different types of text topics, one type of text topics is selected to serve as target domain data, other types of text topics are used as source domain data and added into the target domain data, and a Softmax classifier is used to test the correct rate. In this way, the negative migration phenomenon brought by single-source-domain migration is effectively avoided, data composition comes from all aspects of a target domain, and data balance can be better met.
Owner:YUNNAN UNIV

Credit evaluation method of online borrowers based on multidimensional data

The invention discloses a P2P borrower credit evaluation method based on big data. The invention comprises a data acquisition module, a data processing module and a model building module. In the era of big data, credit data sources are expanding, mainly including the following four aspects: credit data generated by financial institutions, credit data generated by relevant government departments, credit data generated by other public utilities, Internet credit data generated by the network. The data module is mainly divided into two parts, the credit data generated by financial institutions, relevant government departments and public utilities are qualitatively defined as structured data collection; Social media data, such as WeChat friends and Sina Weibo, are collected as unstructured datain Internet credit data. Data processing module is mainly aimed at structured data, including data balance processing and feature selection. As that imbalance phenomenon exist in the structured dataof personal credit, the invention uses CART-SMOTE algorithm for data balance processing; Under the background of big data, the characteristics of personal credit evaluation data are complicated, and irrelevant and redundant variables will have adverse influence on the accuracy of model prediction. The invention uses random forest and gradient descent decision tree to select evaluation characteristics. The structured data model uses an improved lightGBM for preliminary credit ratings; Feature extraction from unstructured social text data, credit evaluation and affective tendency analysis usingin-depth learning. Then the emotional tendencies in personal social media text data are fed back to the credit evaluation of P2P borrowers to study the correlation between them. Provide a reference for the final credit evaluation structure.
Owner:NANJING UNIV OF TECH

Magnetic tension gradient data balance boundary identification method based on analysis signals

The present invention discloses a magnetic tension gradient data balance boundary identification method based on analysis signals. The method comprises the following steps of: S1, analyzing signal features based on a magnetic tension data direction, and establishing a reasonable analysis signal ratio function to achieve boundary identification work; S2, employing the ratio function to balance effects of geologic bodies with different depths, employing a balance boundary identification technology of different order derivative ratios of direction analysis signals, and obtaining a convergent boundary result; S3, employing a balance boundary identification filter based on the analysis signal ratio to obtain the range of the geologic bodies; and S4, enhancing the resolution of the boundary identification result based on a step balance boundary identification filter based on the analysis signal derivative ratios. The magnetic tension gradient data balance boundary identification method basedon analysis signals can accurately and clearly give the regional mineral distribution, improves the resolution of geologic bodies with large depths while reducing the inclination magnetization interference, and has a large application prospect for the deep mineral resource exploration.
Owner:JILIN UNIV

Data increment method and device, computer equipment and storage medium

The invention discloses a data increment method and device, computer equipment and a storage medium, and the method comprises the steps: obtaining a scene classification sample corresponding to a specific scene and a specified sample proportion, carrying out the text preprocessing of the scene classification sample through employing a regular expression, and obtaining a to-be-trained text; carrying out incremental training on the to-be-trained text by adopting the original word vector model to obtain a target word vector model; based on the actual sample number corresponding to each classification label and the total sample number corresponding to the scene classification samples, determining an actual sample proportion corresponding to the classification labels; if the actual sample proportion is smaller than the specified sample proportion, taking the scene classification sample corresponding to the classification label as a sample to be incremented; and inputting the to-be-incremented sample into the target word vector model for processing, obtaining candidate phrases corresponding to the to-be-incremented sample, randomly selecting one target synonym from each candidate phrasefor replacing the to-be-incremented sample, and obtaining a first newly added sample. The method can effectively guarantee data balance.
Owner:PING AN TECH (SHENZHEN) CO LTD

Vector contour line data partitioning method with space proximity relation considered

The invention discloses a vector contour line data partitioning method with the space proximity relation considered. The method comprises the steps of (1) reading contour line data and conducting quantitative statistics on the characteristics of the contour line data, (2) calculating the coordinates of the central point of the minimum enclosing rectangle of each contour line and expressing vector contour line data with a three-dimensional point provided with elevation information, (3) setting the number K of parallel computational nodes, (4) calculating the load threshold of each computational node in an ideal load balanced state and calculating the lower limit and the upper limit of the load thresholds, (5) selecting M (M=20K) points to serve as initial clustering central points, (6) clustering point features into M class clusters, (7) recalculating the coordinates of the central point of the M class clusters, (8) expressing the M class clusters with tetrads, (9) taking the tetrads as minimum data partitioning units and clustering the M tetrads into K class clusters, and (10) the end. According to the method, the data balancing principle is met, load balancing is guaranteed, and a high spatial clustering degree of partitioned data is guaranteed.
Owner:NANJING NORMAL UNIVERSITY

Orthogonal frequency division multiplexing spread spectrum underwater acoustic communication pilot-free decision feedback channel estimation method under sparse channel condition

The invention discloses an orthogonal frequency division multiplexing spread spectrum underwater acoustic communication pilot-free decision feedback channel estimation method under the sparse channel condition. The method comprises the steps that 1) OFDM frequency domain information extraction is performed; 2) the extracted frequency domain information in the step 1) is transmitted to a spread spectrum identification module to perform spread spectrum identification; 3) the spread spectrum identification result of the step 2) is transmitted to a decision feedback loop module, iterative learning of an underwater acoustic communication is performed and the channel is reconstructed; and 4) despreading outputting of the data balanced by using the reconstructed channel is performed. The spread spectrum signal is effectively identified by using the sparse characteristic of the underwater acoustic communication channel, and decision feedback estimation is performed on the underwater acoustic communication channel response by using the identification result so that reliable reconstruction of the underwater acoustic communication channel can be realized, balancing of the underwater acoustic communication signal can be reliably realized without loss of the communication quality and the underwater acoustic communication rate can be effectively enhanced.
Owner:SUZHOU SOUNDTECH OCEANIC INSTR

Positive and negative sample data balancing method in factory PCB defect detection

The invention discloses a positive and negative sample data balancing method in factory PCB defect detection, which is a data balancing method in PCB positive and negative sample classification basedon an adversarial generative network, and mainly comprises the following steps: collecting, sorting and classifying a data set; designing an encoder which is composed of five convolution layers, and extracting features from the input image by the encoder; designing a converter which is composed of eight residual blocks and converts the feature vector from a source domain to a target domain; designing a decoder, wherein the decoder is composed of five deconvolution layers; designing a discriminator, wherein the discriminator is composed of seven convolution layers; designing a loss function, wherein the loss function comprises four parts; preparing a training set for model training; the obtained weight file is used for a test set, and a negative sample needing to be amplified is synthesized. The method is high in robustness, wide in application range and excellent in synthesis effect. And by means of cyclic consistency conditions, the effect of standardizing the model is achieved, and the generation effect of the shape and texture of the synthesized image is flexibly controlled to a certain extent.
Owner:FOSHAN NANHAI GUANGDONG TECH UNIV CNC EQUIP COOP INNOVATION INST +1

Continuous time balance circuit applied to high-speed serial interface

The invention discloses a continuous time balance circuit applied to a high-speed serial interface. The continuous time balance circuit comprises a programmable matching resistor module which is coupled to the ground, a continuous time balance amplifier circuit and an imbalance calibration module, wherein an external data signal is connected with the programmable matching resistor module through direct-current coupling or alternating-current coupling to generate locally received signals INN and INP; the signals are subjected to data balance through the continuous time balance amplifier, and meanwhile, direct-current level conversion is finished; the unbalanced data signals INN and INP which are referenced to the ground are converted into balance data signals OUTN and OUTP which are referenced to a power supply; and meanwhile, the system imbalance is measured by the imbalance calibration module; and the output Ioffsetn and the output Ioffsetp of the imbalance calibration module are regulated, so that the imbalance and removal are finished. According to the continuous time balance circuit, the three functions of level conversion, imbalance calibration and balance amplification of the data are realized by utilizing the continuous time balance amplifier at the same time; the error code rate of the data transmission is reduced; and the power consumption and the area of an integrated circuit are reduced.
Owner:SHENZHEN GRADUATE SCHOOL TSINGHUA UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products