Background technology
Increasingly mature along with this emerging medium of building TV, the clear location of its lively form of expression, Focus, compulsory view reception effect enjoy middle and high end advertiser's favor, become China's successful story in new media market for over ten years.The LCD TV radio hookup net of tens thousand of business premises of whole nation dozens of cities has been present among the life of city white collar truly, affects their Brang Awareness subtlely, is provoking their desire to purchase for extensive stock.If before 2004 when the building TV exposes as brand-new media vehicles, also have many people may hold the suspection attitude to its market outlook and profit, advancing by leaps and bounds of international funds that the today after 3 years, the building TV was raised and advertising income brought unprecedented impact all for whole medium market.
Building advertisement is a kind of medium that Focus Media establishes the earliest, just is mounted in the advertisement that a kind of building LCD TV in places such as high-grade office building, hotel, hotel, office building, megastore is play.Its success can reduce the following aspects.At first, compare traditional outdoor advertising, it is a kind of outdoor advertising of televised, it is the advertisement of audio ﹠ video combination, it has more expressive force and moves power than conventional outdoor advertising, its value is to improve brand recognition, more may change audient crowd's Brang Awareness with the powerful expressive force of its video display, provokes the desire for consumer goods of people to promotion item.On the other hand, compare with conventional advertisement, it is again a kind of advertisement of Focusization, and the business premises network advertising that Focus is made can accurately lock the stratum that corporate boss, manager and white collar etc. have more consumptive power, accurately hit audient crowd.In addition, the business premises network advertising of Focus also has high mandatory view reception effect.Compare outdoor advertising, the LCD of Focus is placed on mandatory rating district, and only is unique channel in business premises, and the advertisement of LCD bottom news rolling in real time simultaneously and excellent fashion interts mutually merges, and very high ad attention is arranged.From another angle, the building television hookup advertisement of Focus not only has the mandatory of vision, have more the mandatory of psychology, similarly be that audient crowd can unconsciously remove to see the flight magazine aboard, because audient crowd is in the area of an information vacuum, they can feel bored, barren and uncomfortable, so the audient crowd who is detained at the business premises lift port also has same situation, helpless seeking freed, boring sensation seeking, barren seek interesting, as long as a dot information is arranged, just can activate and note and interest, even advertisement.Because they are in a time and a space than advertisement more bored, this is so-called " waiting power economy " maximum feature.
And in the present building advertisement, though the lively form of expression and compulsory view reception effect that it has can form good communication effect, but the clear polarization of its Focus but still is worth further improving, show more intellectuality and hommization, the reception and registration effect of reinforcement advertising.Here show mainly that correspondent advertisement is observed and made in audient crowd's active plays the adjustment aspect, need utilize new technology that audient crowd is carried out the more layering of human nature and also play respective advertisement targetedly, thereby can more effectively publicize widely, strengthen the automatic station-keeping ability of Focus of building advertisement.
Summary of the invention
The invention provides a kind of speciality that can real-time judge commercial audience crowd, and choose the intelligence controlling device and the method towards building television advertising system of corresponding ad playing content according to this speciality.
In order to reach above-mentioned purpose, a kind of intelligence control method towards building television advertising system of the present invention mainly comprises training and discerns two stages, wherein:
This training stage comprises the branch of no sequencing and fully independently estimation of Age training and sex recognition training, the weighting parameter of each training is kept in the register of corresponding neural network chip automatically, and this neural network chip obtains weighting parameter automatically when by the time discerning from register;
The training of described estimation of Age is divided into men and women's two parts by sex to be carried out separately, two groups of files of the age label of facial image correspondence in sex face database of being made up of facial image by the data-interface input earlier by sex before the training and the face database; In the DSP graphics processing unit, sample is finished geometrical normalization and the standardized image pre-service of gray balance light that comprises facial image rotation, convergent-divergent then; Then pretreated imagery exploitation NNSC algorithm is calculated image feature value in the DPS graphics processing unit, and this image feature value imported into corresponding estimation of Age neural network chip together with class label information through data bus, estimate that through the age neural network chip training draws hidden layer and also is stored to automatically in the corresponding register with the weighting parameter of output layer;
Described sex recognition training, first two groups of files of the sex label of facial image correspondence in face database of being made up of facial image by the data-interface input and the face database are finished the geometrical normalization that comprises facial image rotation, convergent-divergent and the standardized image pre-service of light of use gray balance to sample then in the DSP graphics processing unit; Then pretreated imagery exploitation PCA algorithm computation is published picture as eigenwert, and image feature value and classification information imported in the corresponding sex identification neural network chip through data bus, obtain hidden layer through the training of sex identification neural network chip and also be stored to automatically in the corresponding register with the weighting parameter of output layer;
This cognitive phase specifically may further comprise the steps:
Step 1, earlier obtain by image collecting device and be subjected to behind building TV everybody group image and be sent to the DSP graphics processing unit to carry out the image pre-service;
Step 2, end user's face detection tech obtains the facial image of all audient crowd's individualities in audient crowd's image in the DSP graphics processing unit, detected every facial image is carried out eyes coordinate setting, and utilize the binocular information that obtains to carry out the image geometry normalized, use the gray balance technology to carry out the standardized image pre-service of light simultaneously, and it is pulled into capable vector form, then this sample vector is done projection to prior off-line training good PCA and two basis matrixs of NNSC respectively, obtain the two group face characteristic values relevant with sex and age;
Step 3, lineup's face eigenwert relevant with sex in the step 2 is exported to sex to be estimated in the neural network chip, neural network chip reads the weights data that trained and carries out sex identification from its register, can obtain the sex label information of current facial image, according to the sex label information, exporting lineup's face eigenwert relevant with the age in the step 2 to men age respectively according to sex estimates in neural network chip or the female age estimation neural network chip, this neural network chip reads the weighting parameter that has trained and carries out age identification from its corresponding register, and the estimation of Age value of this facial image is temporarily stored in its counter tank; This step of circular flow is sent to control module with the pairing sex of each facial image, estimation of Age value in the counter tank after face images is finished identification;
Step 4, control module are judged the sex age level that current audient crowd occupies the majority according to these data, and the advertisement of selecting thus to have had at this audient crowd's sex age level is delivered to the building TV and play.
Described NNSC arthmetic statement is as follows:
Step 1, iterations initial value k=1 is set, PRP conjugate gradient algorithm precision precision ζ>0.01 and image reconstruction error ζ≤0.02; Select base vector a
i(k) and coefficient vector s
i(k) non-negative initial value, and to base vector a
i(k) and coefficient vector s
i(k) carry out the normalization operation;
Step 2, iterative process:
Fixing current base vector a
i(k), adopt the PRP algorithm to realize coefficient vector s
i(k) iteration; Make coefficient component s
i(k) all negative elements in are zero, and normalization
Order then
The currency s that fixedly obtains
i(k+1), adopt the PRP algorithm to realize base vector a
i(k) iteration; Make base vector a
i(k) all negative elements in are zero, make a
i(k+1) :=a
i(k); Make k:=k+1; If algorithm convergence then finish the conjugate gradient iterative process, otherwise restart the conjugate gradient iterative process;
Step 3, to basis function A and matrix of coefficients S after upgrading, judge
Whether set up, if set up, learning process, the basis function A that obtains after the algorithm convergence is the NNSC basis matrix of being asked if finishing; Otherwise the conjugate gradient iterative process of repeating step 2.
A kind of intelligence controlling device towards building television advertising system of the present invention mainly comprises image collecting device, DSP graphics processing unit, sex estimation neural network chip, men age estimation neural network chip, female age estimation neural network chip and control module;
Image collecting device wherein: be used to gather building TV audient crowd image, and be sent to the DSP graphics processing unit and carry out the image pre-service;
The DSP graphics processing unit: end user's face detection tech obtains the facial image of all audient crowd's individualities from audient crowd's image that image collecting device transmits, detected every facial image is carried out eyes coordinate setting, and utilize the binocular information that obtains to carry out the image geometry normalized, use the gray balance technology to carry out the standardized image pre-service of light simultaneously, and it is pulled into capable vector form, then this row vector is done projection to prior off-line training good PCA and two basis matrixs of NNSC respectively, obtain two groups of relevant with sex and age respectively face characteristic values;
Sex estimation neural network chip: the face characteristic value relevant with sex that pre-service obtains to process DSP graphics processing unit image carried out sex identification, and exports in men age estimation neural network chip or the female age estimation neural network chip according to sex recognition result lineup's face eigenwert that the DSP Flame Image Process Dan Zhongyu age is relevant;
Men age is estimated neural network chip or female age estimation neural network chip: lineup's face eigenwert relevant with the age in the DSP graphics processing unit is carried out age identification, and sex, the estimation of Age value of each facial image correspondence is sent to control module;
Control module: judge the sex age level that current audient crowd occupies the majority according to these data, the advertisement of selecting thus to have had at this audient crowd's sex age level is delivered to the building TV and is play.
The present invention utilizes biometrics identification technology to carry out layering to the commercial audience crowd, and on existing recognition technology, estimation of Age and sex identification can form maximum commercial value to commercial audience crowd layering.Can distinguish audient crowd's sex by sex identification, obtain current audient crowd's most of same sex character, utilize simultaneously estimation of Age to obtain overall ages again, thereby can carry out effectively obtaining best demonstration effect at sex, at ages audient crowd's advertisement.
Embodiment
A kind of intelligence control method towards building television advertising system of the present invention mainly comprises training and discerns two stages, wherein:
This training stage comprises the branch of no sequencing and fully independently estimation of Age training and sex recognition training, the present invention is provided with three radial base neural net chips that are respectively applied for sex identification, male sex's facial image estimation of Age and women's facial image estimation of Age, the weighting parameter of each training is kept in the register of neural network chip automatically, and this neural network chip obtains weighting parameter automatically when by the time discerning from register;
The present invention is divided into men and women's two parts by sex and carries out the estimation of Age training separately, the age label of facial image correspondence is (as 1 in sex face database of being made up of facial image by the data-interface input earlier by sex before the training and the face database, 2,30 etc.) two groups of files; In the DSP graphics processing unit sample being finished geometrical normalizations such as comprising facial image rotation, convergent-divergent then handles, uses simultaneously gray balance etc. to carry out the image pre-service of light standardization; Then pretreated imagery exploitation NNSC algorithm is calculated image feature value in the DPS graphics processing unit, and this image feature value imported into corresponding estimation of Age neural network chip (determining to select which kind of neural network chip by class label information) together with class label information through data bus, estimate that through the age neural network chip training draws hidden layer and also is stored to (weights of input layer and hidden layer do not need to calculate, and are fixed as 1) in the corresponding register automatically with the weighting parameter of output layer.
During the sex recognition training, earlier the sex label of facial image correspondence (is represented the male sex as 0 in face database of being made up of facial image by the data-interface input and the face database, 1 expression women) two groups of files are finished geometrical normalizations such as comprising facial image rotation, convergent-divergent processing, are used gray balance etc. to carry out the image pre-service of light standardization simultaneously sample in the DSP graphics processing unit then; Then pretreated imagery exploitation PCA algorithm computation is published picture as eigenwert, and image feature value and classification information imported in the corresponding sex identification neural network chip through data bus, obtain hidden layer through the training of sex identification neural network chip and also be stored to automatically in the corresponding register with the weighting parameter of output layer.
As shown in Figure 1, 2, this cognitive phase specifically may further comprise the steps:
Step 1, earlier obtain by image collecting device 1 and be subjected to behind building TV everybody group image and be sent to DSP graphics processing unit 2 to carry out the image pre-service;
Step 2, end user's face detection tech obtains the facial image of all audient crowd's individualities in audient crowd's image in DSP graphics processing unit 2, detected every facial image is carried out eyes coordinate setting, and utilize the binocular information that obtains to carry out the image geometry normalized, use the gray balance technology to carry out the standardized image pre-service of light simultaneously, and it is pulled into capable vector form, then this sample vector is done projection (being that the capable vector of sample multiplies each other with basis matrix) to prior off-line training good PCA and two basis matrixs of NNSC respectively, obtain the two group face characteristic values relevant with sex and age;
Step 3, lineup's face eigenwert relevant with sex in the step 2 is exported to sex to be estimated in the neural network chip 3, sex estimates that neural network chip 3 reads the weights data that trained and carries out sex identification from its register, can obtain the sex label information of current facial image, according to the sex label information, exporting lineup's face eigenwert relevant with the age in the step 2 to men age respectively according to sex estimates in neural network chip 41 or the female age estimation neural network chip 42, this neural network chip 41 or 42 reads the weighting parameter that has trained and carries out age identification from its corresponding register, and the estimation of Age value of this facial image is temporarily stored in its counter tank; This step of circular flow is sent to control module 5 with the pairing sex of each facial image, estimation of Age value in the counter tank after face images is finished identification;
Step 4, control module 5 are judged the sex age level that current audient crowd occupies the majority according to these data, and the advertisement of selecting thus to have had at this audient crowd's sex age level is delivered to the building TV and play.
As shown in Figure 2, a kind of intelligence controlling device towards building television advertising system of the present invention mainly comprises image collecting device 1, DSP graphics processing unit 2, sex estimation neural network chip 3, men age estimation neural network chip 41, female age estimation neural network chip 42 and control module 5:
Wherein image collecting device 1: be used to gather building TV audient crowd image, and be sent to DSP graphics processing unit 2 and carry out the image pre-service;
DSP graphics processing unit 2: end user's face detection tech obtains the facial image of all audient crowd's individualities from audient crowd's image that image collecting device 1 transmits, detected every facial image is carried out eyes coordinate setting, and utilize the binocular information that obtains to carry out the image geometry normalized, use the gray balance technology to carry out the standardized image pre-service of light simultaneously, and it is pulled into capable vector form, then this row vector is done projection to prior off-line training good PCA and two basis matrixs of NNSC respectively, obtain two groups of relevant with sex and age respectively face characteristic values;
Sex estimation neural network chip 3: the face characteristic value relevant with sex that the 2 image pre-service through the DSP graphics processing unit obtain carried out sex identification, and lineup's face eigenwert relevant with the age in the DSP graphics processing unit 2 is exported in men age estimation neural network chip 41 or the female age estimation neural network chip 42 according to the sex recognition result;
Men age is estimated neural network chip 41 or female age estimation neural network chip 42: lineup's face eigenwert relevant with the age in the DSP graphics processing unit 2 is carried out age identification, and sex, the estimation of Age value of each facial image correspondence is sent to control module 5;
Control module 5: judge the sex age level that current audient crowd occupies the majority according to these data, the advertisement of selecting thus to have had at this audient crowd's sex age level is delivered to the building TV and is play.
The present invention in normal operation, being used for sex identification neural network chip 3 in three neural network chips at first works, its output result is 0 (male sex) or 1 (women), use simple gate circuit output port as a result and men age are estimated that neural network chip 41 and female age estimate that neural network chip 42 is connected respectively, reach the purpose of automatic selection chip.
Because calculated amount is bigger, the calculating of NNSC basis matrix and PCA basis matrix should not realize in DSP graphics processing unit 2, so adopts the mode of calculated off-line, and will calculate the matrix value of getting well and be stored in the register of each neural network chip.
For given estimation of Age training sample, on PC, utilize the NNSC algorithm that training sample is carried out off-line training in advance and obtain basis matrix, then this basis matrix is passed through in the register of data-interface afferent nerve network chip.In DSP graphics processing unit 2, the row vector that will pull into through the facial image after pretreated multiplies each other with this basis matrix, can obtain the proper vector relevant with the age, these characteristic vector datas import corresponding neural network chip again into to carry out the training of neural network weight parameter or is directly used in identification.
For sex recognition training sample, on PC, training sample is carried out calculated off-line equally in advance and obtain basis matrix by the PCA algorithm,, then this basis matrix is passed through in the register of data-interface afferent nerve network chip.In DSP graphics processing unit 2, the row vector that facial image after pretreated is pulled into multiplies each other with this basis matrix, can obtain the proper vector relevant with sex, these characteristic vector datas import corresponding neural network chip into to carry out the training of neural network weight parameter or is directly used in identification.
If the user has the face database of oneself, want device training again, the present invention will provide the algorithm bag to go out above-mentioned two groups of basis matrixs for user's calculated off-line so, and the user can be imported the result into device by data-interface, and recognition training step afterwards can be operated according to device peripheral hardware button.Device peripheral hardware sequence of operations button of the present invention, as " training parameter setting ", " begin training ", " start working " etc., wherein " training parameter setting " button is down with " sex identification " and " estimation of Age ", " estimation of Age " has the sex selection button of " male sex " and " women ", can carry out sample training to the neural network chip of this device when needing.
Non-negative sparse coding algorithm of the present invention (NNSC) is a kind of new subspace method, and it is a kind of combination of NMF algorithm and SC algorithm.It has all applied non-negativity constraint to the pixel and the reconstructed coefficients of basic image, make that reconstructed image is to be formed by the non-stack combinations that subtracts of basic image, the notion that more meets " local formation is whole " among the human thinking, simultaneously and the change of age locality that influence shows to people's face match, have intuitively feasibility at the application power of estimation of Age.Its feature extraction ability to facial image has just in time met the requirement of the required feature extraction of estimation of Age, can be used as the feature extracting method of good estimation of Age.
Described non-negative sparse coding (NNSC) algorithm based on expansion is on the basis of non-negative sparse coding (NNSC) algorithm that Hoyer proposes, consider the feedback input of the main visual cortex V1 district CK body LGN floor of primary vision system, the hierarchical structure of analog vision system, having set up one has the NNSC neural network model that connects feedback, as shown in Figure 3.Wherein on behalf of extraneous natural perception information, the input layer unit be projected in the input signal of LGN layer behind retina image-forming; Feedforward connects the receptive field of representing V1 district simple cell; Output layer is represented the simple cell in V1 district, and the non-negative sparse coding of the corresponding natural perception information of its active state, output layer have self feed back and exist feedback to connect to input layer.Parameter symbol implication is described below:

Expression LGN is to the feedforward connecting path in V1 district; a
iExpression V1 district is to the feedback connecting path of LGN; f
i(s
i) expression neuron response from inhibition function; X represents to be mapped to by retina the data acquisition of LGN; X (k) remarked pixel coordinate; a
iThe i column vector of expression basis function A; s
iThe capable vector of i of expression matrix of coefficients S, e
iThe capable vector of i of expression error matrix E=X-AS.
The NNSC objective function of described expansion is defined as follows:
This objective function is formed by three, and first is image minimal reconstruction error term; Second is sparse penalty term; The 3rd is the feedback connection item of V1 district to LGN;
Being constrained to of this formula: X (x, y) 〉=0,
And || s
i||=1, wherein,
The pretreated natural image data of the actual expression of X, parameter lambda and γ represent positive constant, the sparse penalty f (s in second of the objective function
i) selection depend on coefficient component s
iThe sparse density p (s of priori
i).
In the training stage of expansion NNSC network, in order to embody of the influence of main visual cortex V1 district neuron to the neuronic feedback input of LGN floor, the descend optimized Algorithm of (Conjugate Gradient) of PRP (Polak-Ribiere-Polyak) conjugate gradient is adopted in the renewal of sparse coefficient component, the study of feature basis function is also adopted use the same method.
The NNSC arthmetic statement of described expansion is as follows:
Step 1, iterations initial value k=1 is set, PRP conjugate gradient algorithm precision precision ζ>0.01 and image reconstruction error ζ≤0.02; Select base vector a
i(k) and coefficient vector s
i(k) non-negative initial value, and to base vector a
i(k) and coefficient vector s
i(k) carry out the normalization operation;
Step 2, iterative process:
Fixing current base vector a
i(k), adopt the PRP algorithm to realize coefficient vector s
i(k) iteration; Make coefficient component s
i(k) all negative elements in are zero, and normalization
Order then
The currency s that fixedly obtains
i(k+1), adopt the PRP algorithm to realize base vector a
i(k) iteration; Make base vector a
i(k) all negative elements in are zero, make a
i(k+1) :=a
i(k); Make k:=k+1; If algorithm convergence then finish the conjugate gradient iterative process, otherwise restart the conjugate gradient iterative process;
Step 3, to basis function A and matrix of coefficients S after upgrading, judge
Whether set up, if set up, learning process, the basis function A that obtains after the algorithm convergence is the NNSC basis matrix of being asked if finishing; Otherwise the conjugate gradient iterative process of repeating step 2.
Training sample (or test sample book) is pulled into back the multiplying each other with basis matrix of capable vector can obtain new facial image eigenwert, can be applicable to next step age identification step.
Calculating for the PCA basis matrix, the present invention adopts Artificial Neural Network to realize, when artificial neural network passes through iterative PCA, each input new samples only causes the slight adjustment of basis matrix, neural network only need be upgraded on original basis and get final product when new samples occurring, weights needn't all recomputate, and therefore have adaptivity.Training sample (or test sample book) is pulled into back the multiplying each other with this basis matrix of capable vector can obtain new facial image eigenwert, can be applicable to next step sex identification step.