[go: up one dir, main page]

CN104102813A - White noise compensation method and device - Google Patents

White noise compensation method and device Download PDF

Info

Publication number
CN104102813A
CN104102813A CN201410245026.2A CN201410245026A CN104102813A CN 104102813 A CN104102813 A CN 104102813A CN 201410245026 A CN201410245026 A CN 201410245026A CN 104102813 A CN104102813 A CN 104102813A
Authority
CN
China
Prior art keywords
sample
attribute vector
value
described attribute
measured value
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201410245026.2A
Other languages
Chinese (zh)
Inventor
朱宝
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Baidu Mobile Network Technology (beijing) Co Ltd
Original Assignee
Baidu Mobile Network Technology (beijing) Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Baidu Mobile Network Technology (beijing) Co Ltd filed Critical Baidu Mobile Network Technology (beijing) Co Ltd
Priority to CN201410245026.2A priority Critical patent/CN104102813A/en
Publication of CN104102813A publication Critical patent/CN104102813A/en
Pending legal-status Critical Current

Links

Landscapes

  • Complex Calculations (AREA)

Abstract

The invention relates to a white noise compensation method and a white noise compensation device. The white noise compensation method is characterized by comprising the following steps: calculating the average value of measurement values of an attribute vector of each sample in sample space, and taking the average value as an estimation value of the attribute vector; calculating the average value of the estimation values of the attribute vectors of all samples; complementing the number of the measurement values of the attribute vector of the sample with the number of the measurement values of the attribute vector smaller than a preset number to the preset number by taking the average value of the estimation values of the attribute vectors of all samples as the measurement value of the attribute vector of the sample; recalculating the average value of the measurement values of the attribute vector of the sample of which the number of the measurement values of the attribute vectors is complemented, and taking the recalculated average value as the estimation value.

Description

A kind of white noise compensation method and device
Technical field
The present invention relates to field of information processing, particularly the method for the statistical treatment in field of information processing and system.
Background technology
Current, in various fields, all relate to the statistics of sample attribute value, such as the statistics of the sample attribute value all relating in fields such as internet industrys.
But when the measured value of sample attribute value is less, statistics cannot reflect actual conditions sometimes truly, even there will be statistics allow the elusive situation of people.Particularly in recommendation of personalized information field, behavioral data to user arranges, when generating the interactive relation matrix between user and article according to user's operation behavior, conventionally exist certain user's operation behavior very few, the unsettled situation of statistics.
Summary of the invention
The present invention completes in view of the above problems, its object of the present invention is to provide in the situation that work as the less method of utilizing white noise to compensate of measured value of sample attribute value, realization, in the situation that the measured value of sample attribute value is less, is obtained stable sample attribute Data-Statistics result.
According to a kind of white noise compensation method of the present invention, it is characterized in that, comprise the following steps: the mean value of the measured value of the attribute vector of each sample in calculating sample space is as the estimated value of described attribute vector; Calculate the mean value of estimated value of the described attribute vector of all samples; For the number of measurements of described attribute vector, be less than the sample of predetermined number, utilize the mean value of estimated value of described attribute vector of above-mentioned all samples as the measured value of the described attribute vector of sample, the number of measurements of the described attribute vector of sample is complemented to predetermined number; Recalculate the mean value of described attribute vector measured value of sample of the number of measurements of being supplied described attribute vector as estimated value.
According to a kind of white noise compensation method of the present invention, it is characterized in that, comprise the following steps: the mean value of the measured value of the attribute vector of each sample in calculating sample space is as the estimated value of described attribute vector; Calculate the mean value of measured value of all described attribute vectors of all samples; For the number of measurements of described attribute vector, be less than the sample of predetermined number, utilize the mean value of measured value of all described attribute vectors of above-mentioned all samples as the measured value of the described attribute vector of sample, the number of measurements of the described attribute vector of sample is complemented to predetermined number; Recalculate the mean value of described attribute vector measured value of sample of the number of measurements of being supplied described attribute vector as estimated value.
According to a kind of white noise compensation system of the present invention, it is characterized in that, comprising: the estimated value computing unit of each sample attribute vector, the mean value of the measured value of the attribute vector of each sample in calculating sample space is as the estimated value of described attribute vector; The average calculation unit of sample attribute vector, calculates the mean value of estimated value of the described attribute vector of all samples; White noise is supplied unit, for the number of measurements of described attribute vector, be less than the sample of predetermined number, utilize the mean value of estimated value of described attribute vector of above-mentioned all samples as the measured value of the described attribute vector of sample, the number of measurements of the described attribute vector of sample is complemented to predetermined number; Supply the estimated value computing unit of white noise sample attribute vector, recalculate the mean value of described attribute vector measured value of sample of the number of measurements of being supplied described attribute vector as estimated value.
According to a kind of white noise compensation system of the present invention, it is characterized in that, comprising: the estimated value computing unit of each sample attribute vector, the mean value of the measured value of the attribute vector of each sample in calculating sample space is as the estimated value of described attribute vector; The average calculation unit of all sample attribute vectors, calculates the mean value of measured value of all described attribute vectors of all samples; White noise is supplied unit, for the number of measurements of described attribute vector, be less than the sample of predetermined number, utilize the mean value of measured value of all described attribute vectors of above-mentioned all samples as the measured value of the described attribute vector of sample, the number of measurements of the described attribute vector of sample is complemented to predetermined number; Supply white noise sample attribute vector estimated value computing unit, recalculate the mean value of described attribute vector measured value of sample of the number of measurements of being supplied described attribute vector as estimated value.
According to white noise compensation method of the present invention and device, even if also can access good statistics effect in the situation that of sample attribute vector measurement value negligible amounts, can be applicable to any field of information processing that need to carry out sample attribute Data-Statistics, efficiently solve the problem of stablizing statistics in statistics.
Accompanying drawing explanation
Fig. 1 illustrates the process flow diagram of the white noise compensation method of embodiment 1;
Fig. 2 illustrates the process flow diagram of another white noise compensation method of embodiment 1;
Fig. 3 illustrates the process flow diagram of the white noise compensation method of embodiment 2;
Fig. 4 illustrates the process flow diagram of another white noise compensation method of embodiment 2;
Fig. 5 illustrates the process flow diagram of the white noise compensation method of embodiment 3;
Fig. 6 illustrates the process flow diagram of another white noise compensation method of embodiment 3;
Fig. 7 is the block diagram that white noise compensation system is shown;
Fig. 8 is the block diagram that another white noise compensation system is shown.
Embodiment
The sample group that the present invention be directed to the sample attribute vector measurement value negligible amounts of sample space is obtained the attribute vector estimated value of sample comparatively exactly.
First, calculate the estimated value of each sample attribute vector.Utilize the attribute vector measured value of known sample, ask for the average of attribute vector measured value as the estimated value of the attribute vector of each sample.
Then, according to the estimated value of each sample attribute vector, obtain the average of the estimated value of sample attribute vector.
Then, for attribute vector number of measurement values in sample, be less than the sample of some, utilize the average of the estimated value of resulting attribute vector to supply sample attribute vector measurement value number as sample attribute vector measurement value.
Finally, for the sample of supplying attribute vector number of measurements, recalculate the estimated value of new sample attribute vector.
Be specifically described for example below.
Embodiment 1
Factory adds up produce three batches of part lifes, but because sampling sample size is different, first batch more with second batch sub-sampling sample, the 3rd batch of sampling sample is less, in order further to understand the situation of the 3rd batch of sampling sample, carry out white noise compensation, the 3rd batch of sampling sample complemented to the quantity M that can substantially reflect accurate estimated value.Concrete operations illustrate as follows with reference to figure 1.
First, as shown in the step S11 of Fig. 1, calculate respectively the mean lifetime t1 of the part of first production, the mean lifetime t2 of part of second batch time production and the mean lifetime t3 of the part of the 3rd batch of production.
Then, as shown in step S12, calculate the mean lifetime t of these three batches all parts of producing.
Then,, as shown in step S13, with the sample number that the mean lifetime t of all parts removes to supply the 3rd series-produced part, making the 3rd batch of number of samples of producing is M.
Finally, as shown in step S14, recalculate the mean lifetime t1 ' of the 3rd series-produced part, as the mean lifetime of the 3rd series-produced part.
Here, as shown in Figure 2, replace step S12, also can not calculate the mean lifetime t of part used, but as shown in step S22, calculate the mean value of mean lifetime of the part of three batches, i.e. the value of (t1+t2+t3)/3 gained, and with this mean value, as part life, go the sample number of supplying the 3rd series-produced part to M in step S23.
Embodiment 2
With reference to figure 3, take the quality of evaluating and testing a film by the scoring of the film on web film describes as example.As shown in table 1, for example there are three films, be respectively film A, film B, film C, the scoring of film A is had to 2 people, the scoring of film B is had to 5 people, the scoring of film C is had to 9 people, mark is 0 minute to 10 minutes, 10 are divided into full marks.First row represents film, and spectators' scoring is shown in 2-10 list.
Table 1
? 1 2 3 4 5 6 7 8 9
Film A 5 7 ? ? ? ? ? ? ?
Film B 6 8 7 6 5 ? ? ? ?
Film C 8 7 8 6 5 7 8 6 8
First, as shown in the step S31 of Fig. 3, calculate the average mark of every film, the average mark of film A divides for (5+7)/2=6, the average mark of film B divides for (6+8+7+6+5)/5=6.4, and the average mark of film C divides for (8+7+8+6+5+7+8+6+8)/9=6.
Then,, as shown in step S32, calculate the average mark (5+7+6+8+7+6+5+8+7+8+6+5+7+8+6+8)/(2+5+9)=6.6875 of all film marks.
As shown in step S33, at film A and the B lower than 6 people to film pricer number, utilize the average mark 6.6875 of all films to supply sample to 6, obtain following table 2.
Table 2
? 1 2 3 4 5 6 7 8 9
Film A 5 7 6.6875 6.6875 6.6875 6.6875 ? ? ?
Film B 6 8 7 6 5 6.6875 ? ? ?
Film C 8 7 8 6 5 7 8 6 8
As shown in step S34, reappraise the new score value of every film, the average mark of film A divides for (5+7+6.6875+6.6875+6.6875+6.6875)/6=6.458, the average mark of film B divides for (6+8+7+6+5+6.6875)/6=6.45, and the average mark of film C divides for (8+7+8+6+5+7+8+6+8)/9=6.Therefore to the fractional value of the scoring of film A, being 6.458, is 6.45 to the fractional value of the scoring of film B, to the fractional value of the scoring of film C, is 6.
In above-mentioned, as shown in Figure 4, the step S32 that also can replace Fig. 3 calculates the step of all film average marks, but as shown in step S42, calculate the average mark to all film average marks, i.e. (6+6.4+6)/3=6.13, and in step S43 by the 6.13 evaluation numbers of removing to supply film A and film B as fractional value.
By the above-mentioned sample for the discontented sufficient predetermined number of numerical value number in each sample, by supplying sample size to a certain predetermined value, thereby obtain stablizing the sample estimated value of statistics.
Embodiment described above has carried out sample to supply in the situation that there is an attribute.Also can be that attribute number does not meet predetermined number when carrying out sample statistics with a plurality of independent attributes sample carries out sample and supplies.
Embodiment 3
With reference to 5 pairs of situations that exist a plurality of independent attributes to carry out sample statistics of figure, describe.In fictitia, on some websites, have a lot of films, marking with watching duration is the attribute vector of film.Want now that the scoring and the user that determine certain film watch duration.Known have three users that this film is watched and marked now.Measured value about scoring on this website is respectively 7,5,8, about watching the measured value of duration to be respectively 1.4,1.6,1.5.Concrete condition is as shown in table 3.
Table 3
? Scoring (full marks 10 minutes) Watch duration (hour)
User 1 7 1.4
User 2 5 1.6
User 3 8 1.5
First as shown in step S51, calculate about the mean value of the scoring measured value of this film and the mean value of watching duration measured value, the mean value of measured value of marking is (7+5+8)/3=6.67, and the mean value of watching duration measured value is (1.4+1.6+1.5)/3=1.5.
The number of users that a known film is watched, more than 30, could reflect film in the actual scoring of this website and watch duration.But because above-mentioned this website only has 3 for the evaluation quantity of this film, evaluate quantity very few, therefore two of this film of this website attribute vectors are carried out to white noise compensation.
As shown in step S52, in order to predict that accurately scoring and the user of this film watches duration, this website is found on other similar websites, obtain each website to the scoring of this film and watch the mean value of duration, and calculate and comprise this website in the scoring of each interior website and watch the mean value of the mean value of duration measured value, as shown in table 4 below.
Table 4
? Scoring (full marks 10 minutes) Watch duration (hour)
Film 6 1.2
As shown in step S53, utilize the mean value of above-mentioned mean value to carry out white noise compensation to two of this film of this website attributes, the quantity of attribute vector is complemented to 30.As shown in step S54, calculate the mean value of attribute vector measured value of this film of supplying after attribute vector measured value as estimated value, this website is predicted the scoring of this film and is watched duration to be:
([7,1.4]+[5,1.6]+[8,1.5]+27*[6,1.2])/30=[6.0667,1.230]
So this film is predicted in this website, the scoring on this website is 6.0667 minutes, and watching duration is 1.230 hours.Thereby obtained prediction more accurately.
Also can replace the step S52 in Fig. 5, and shown in step S62 as shown in Figure 6, calculate the mean value of the measured value about attribute scoring of all websites, and calculate all websites about attribute, watch the mean value of the measured value of duration, and as shown in step S63, utilize the mean value of the above-mentioned measured value calculating to go to supply the number of attribute vector measured value of this film of this website, be 30, and utilize the attribute vector measured value of supplying after white noise to come the mean value of computation attribute vector measurement value as estimated value.
Fig. 7 illustrates a kind of white noise compensation system 700, comprising: the estimated value computing unit 701 of each sample attribute vector, and the mean value of the measured value of the attribute vector of each sample in calculating sample space is as the estimated value of described attribute vector; The average calculation unit 702 of sample attribute vector, calculates the mean value of estimated value of the described attribute vector of all samples; White noise is supplied unit 703, for the number of measurements of described attribute vector, be less than the sample of predetermined number, utilize the mean value of estimated value of described attribute vector of above-mentioned all samples as the measured value of the described attribute vector of sample, the number of measurements of the described attribute vector of sample is complemented to predetermined number; Supply the estimated value computing unit 704 of white noise sample attribute vector, recalculate the mean value of described attribute vector measured value of sample of the number of measurements of being supplied described attribute vector as estimated value.
Fig. 8 illustrates another kind of white noise compensation system 800, comprising: the estimated value computing unit 801 of each sample attribute vector, and the mean value of the measured value of the attribute vector of each sample in calculating sample space is as the estimated value of described attribute vector; The average calculation unit 802 of all sample attribute vectors, calculates the mean value of measured value of all described attribute vectors of all samples; White noise is supplied unit 803, for the number of measurements of described attribute vector, be less than the sample of predetermined number, utilize the mean value of measured value of all described attribute vectors of above-mentioned all samples as the measured value of the described attribute vector of sample, the number of measurements of the described attribute vector of sample is complemented to predetermined number; Supply white noise sample attribute vector estimated value computing unit 804, recalculate the mean value of described attribute vector measured value of sample of the number of measurements of being supplied described attribute vector as estimated value.
According to above-mentioned, method and the device of white noise compensation has been described, it is conducive to stablize statistics, realizes and in the situation that number of samples is few, obtains good statistics.
The invention is not restricted to above embodiment, in the scope of its technical conceive, all belong to scope of the present invention.

Claims (4)

1. a white noise compensation method, is characterized in that, comprises the following steps:
The mean value of the measured value of the attribute vector of each sample in calculating sample space is as the estimated value of described attribute vector;
Calculate the mean value of estimated value of the described attribute vector of all samples;
For the number of measurements of described attribute vector, be less than the sample of predetermined number, utilize the mean value of estimated value of described attribute vector of above-mentioned all samples as the measured value of the described attribute vector of sample, the number of measurements of the described attribute vector of sample is complemented to predetermined number;
Recalculate the mean value of described attribute vector measured value of sample of the number of measurements of being supplied described attribute vector as estimated value.
2. a white noise compensation method, is characterized in that, comprises the following steps:
The mean value of the measured value of the attribute vector of each sample in calculating sample space is as the estimated value of described attribute vector;
Calculate the mean value of measured value of all described attribute vectors of all samples;
For the number of measurements of described attribute vector, be less than the sample of predetermined number, utilize the mean value of measured value of all described attribute vectors of above-mentioned all samples as the measured value of the described attribute vector of sample, the number of measurements of the described attribute vector of sample is complemented to predetermined number;
Recalculate the mean value of described attribute vector measured value of sample of the number of measurements of being supplied described attribute vector as estimated value.
3. a white noise compensation system, is characterized in that, comprising:
The estimated value computing unit of each sample attribute vector, the mean value of the measured value of the attribute vector of each sample in calculating sample space is as the estimated value of described attribute vector;
The average calculation unit of sample attribute vector, calculates the mean value of estimated value of the described attribute vector of all samples;
White noise is supplied unit, for the number of measurements of described attribute vector, be less than the sample of predetermined number, utilize the mean value of estimated value of described attribute vector of above-mentioned all samples as the measured value of the described attribute vector of sample, the number of measurements of the described attribute vector of sample is complemented to predetermined number;
Supply the estimated value computing unit of white noise sample attribute vector, recalculate the mean value of described attribute vector measured value of sample of the number of measurements of being supplied described attribute vector as estimated value.
4. a white noise compensation system, is characterized in that, comprising:
The estimated value computing unit of each sample attribute vector, the mean value of the measured value of the attribute vector of each sample in calculating sample space is as the estimated value of described attribute vector;
The average calculation unit of all sample attribute vectors, calculates the mean value of measured value of all described attribute vectors of all samples;
White noise is supplied unit, for the number of measurements of described attribute vector, be less than the sample of predetermined number, utilize the mean value of measured value of all described attribute vectors of above-mentioned all samples as the measured value of the described attribute vector of sample, the number of measurements of the described attribute vector of sample is complemented to predetermined number;
Supply white noise sample attribute vector estimated value computing unit, recalculate the mean value of described attribute vector measured value of sample of the number of measurements of being supplied described attribute vector as estimated value.
CN201410245026.2A 2014-06-04 2014-06-04 White noise compensation method and device Pending CN104102813A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201410245026.2A CN104102813A (en) 2014-06-04 2014-06-04 White noise compensation method and device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201410245026.2A CN104102813A (en) 2014-06-04 2014-06-04 White noise compensation method and device

Publications (1)

Publication Number Publication Date
CN104102813A true CN104102813A (en) 2014-10-15

Family

ID=51670958

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201410245026.2A Pending CN104102813A (en) 2014-06-04 2014-06-04 White noise compensation method and device

Country Status (1)

Country Link
CN (1) CN104102813A (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107358946A (en) * 2017-06-08 2017-11-17 南京邮电大学 Speech-emotion recognition method based on section convolution

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20080109428A1 (en) * 2006-11-07 2008-05-08 University Of Washington Efficient top-k query evaluation on probabilistic data
CN101251850A (en) * 2008-01-04 2008-08-27 杨虡 Internet topics ranking system and method based on user prestige
US20090019009A1 (en) * 2007-07-12 2009-01-15 At&T Corp. SYSTEMS, METHODS AND COMPUTER PROGRAM PRODUCTS FOR SEARCHING WITHIN MOVIES (SWiM)
CN103514304A (en) * 2013-10-29 2014-01-15 海南大学 Project recommendation method and device

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20080109428A1 (en) * 2006-11-07 2008-05-08 University Of Washington Efficient top-k query evaluation on probabilistic data
US20090019009A1 (en) * 2007-07-12 2009-01-15 At&T Corp. SYSTEMS, METHODS AND COMPUTER PROGRAM PRODUCTS FOR SEARCHING WITHIN MOVIES (SWiM)
CN101251850A (en) * 2008-01-04 2008-08-27 杨虡 Internet topics ranking system and method based on user prestige
CN103514304A (en) * 2013-10-29 2014-01-15 海南大学 Project recommendation method and device

Non-Patent Citations (3)

* Cited by examiner, † Cited by third party
Title
"基于用户投票的排名算法(一):Delicious和Hacker News";hzdjwl2010;《http://wenku.baidu.com/view/ffbe761452d380eb62946ddf.html?from=search》;20120424;第23-26页 *
HZDJWL2010: ""基于用户投票的排名算法(一):Delicious和Hacker News"", 《HTTP://WENKU.BAIDU.COM/VIEW/FFBE761452D380EB62946DDF.HTML?FROM=SEARCH》 *
张敏: ""对简单算术平均数与加权算术平均数的关系的质疑"", 《统计与决策》 *

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107358946A (en) * 2017-06-08 2017-11-17 南京邮电大学 Speech-emotion recognition method based on section convolution
CN107358946B (en) * 2017-06-08 2020-11-13 南京邮电大学 Voice emotion recognition method based on slice convolution

Similar Documents

Publication Publication Date Title
US11887133B2 (en) Methods and apparatus to generate electronic mobile measurement census data
US11494636B2 (en) Machine learning-based semiconductor manufacturing yield prediction system and method
US9852440B2 (en) Methods and systems for selecting content for display based on conversion probabilities of paths
SG11201811395YA (en) Method and device for controlling data risk
KR102148248B1 (en) Flagging of ad frames for automatic content recognition
KR20220164621A (en) Methods and apparatus to compensate impression data for misattribution and/or non-coverage by a database proprietor
KR20150067224A (en) Methods and apparatus to estimate demographics of users employing social media
US20140279736A1 (en) Method and system for mapping short term ranking optimization objective to long term engagement
US10825030B2 (en) Methods and apparatus to determine weights for panelists in large scale problems
CN108322827B (en) Method, system, and computer-readable storage medium for measuring a user's video preferences
JP7085486B2 (en) User behavior data processing method and equipment
US20220012661A1 (en) Methods, systems, articles of manufacture and apparatus to calibrate payload information
US11711419B2 (en) System and a method for multisession analysis
US20240414075A1 (en) System and a method for identifying client devices used by the same user
CN104102813A (en) White noise compensation method and device
JP2011227720A (en) Recommendation system, recommendation method and recommendation program
CN110009161A (en) Water supply forecast method and device
CN111400512B (en) Method and device for screening multimedia resources
CN106375850B (en) A kind of judgment method and device matching video
US20170115332A1 (en) Electricity consumption predicting system and electricity consumption predicting method applied for processing machine
US11880643B2 (en) Device and method for integrated circuit assistance design, and method for constructing electrical performance gradient model
JPWO2016157927A1 (en) Data processing apparatus, data processing method, and program
JP2017117023A (en) Data analysis device, data computing device, and program
TW201620057A (en) Yield rate determination method for wafer and method for multiple variable detection of wafer acceptance test
CN113888199B (en) Message ordering and publishing method and device

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication
RJ01 Rejection of invention patent application after publication

Application publication date: 20141015