DE19641066A1

DE19641066A1 - Direct adaptive control of non linear system

Info

Publication number: DE19641066A1
Application number: DE1996141066
Authority: DE
Inventors: Dierk Schroeder; Kurt Fischle
Original assignee: Schroeder Dierk Prof Dr-Ing Dr-Ing Hc 82049 Pullach De
Current assignee: Schroeder Dierk Prof Dr-Ing Dr-Ing Hc 82049 Pullach De
Priority date: 1996-10-04
Filing date: 1996-10-04
Publication date: 1998-04-23

Abstract

The adaptive control has the process output y compared with that of a process model. The difference is used in a learning algorithm and this provides an access to a fuzzy logic controller coupled to the process. This allows the parameters to be varied to suit the process condition The learning process is subdivided into a number of part stages that allow the learning time to be reduced.

Description

1 Description and criticism of the state of the art

Bei industriellen Anwendungen der Regelungstechnik sind häufig Strecken zu regeln, welche durch ausge prägte Nichtlinearitäten gekennzeichnet sind und bei denen weiterhin Parameter und Struktur nicht exakt bekannt sind. Zur Regelung solcher Strecken dienen adaptive nichtlineare Regelungsverfahren. Diese lassen sich zunächst einteilen in Verfahren, die auf eine bestimmte Streckenstruktur zugeschnitten sind, und weit gehend universell einsetzbare Verfahren. Zur zweiten Klasse gehören Regelungsverfahren mit neuronalen Netzen sowie adaptive Fuzzy-Regler. Bei den meisten dieser Verfahren ist über die Stabilität und Konver genz des Gesamtsystems aus Strecke, Regler und Adaptionsgesetz keine Aussage möglich; daher können diese Verfahren nur zur Regelung von Strecken eingesetzt werden, die nicht stabilitätskritisch sind, und es sind zusätzliche Überwachungsmaßnahmen erforderlich. Bei einer Reihe von Verfahren ist dagegen ein Sta bilitätsbeweis möglich. Hierbei ist zu unterscheiden zwischen indirekten Verfahren, bei denen zunächst eine Identifikation der Strecke und darauf abgestimmt die Regelung aufgrund des identifizierten Streckenmodells erfolgt; bei direkten Verfahren, wird dagegen der Regler direkt adaptiert, um ein vorgegebenes Verhalten des Regelkreises zu erreichen. Während bei indirekten Verfahren ein Stabilitätsbeweis i.a. nur für das Teil system Strecke - Identifikationsmodell - Adaptionsgesetz möglich ist, kann bei direkten Verfahren Stabilität für das Gesamtsystem aus Strecke, Regler und Adaptionsgesetz bewiesen werden. Es wurde bislang bereits eine Reihe solcher Verfahren veröffentlicht: Tzirkel, E., and Fallside, F., A Direct Control Method for a Class of Nonlinear Systems Using Neutral Networks, Technical Report, Cambridge University Engineering Department, Cambridge, 1993; E. Tzirkel-Hancock and F. Fallside, Stable Control of Nonlinear Systems Using Neural Networks, Int. J. Robust and Nonlinear Control, Vol. 2, no. 1, 1992; Robert M. Sanner and Jean-Jacques E. Slotine, Gaussian Networks for Direct Adaptive Control, IEEE Transactions on Neural Networks, Vol. 3, no. 6, 1992; Clemens Schäffner and Dierk Schröder, An Application of General Regres sion Neural Network to Nonlinear Adaptive Control, 5th European Conference on Power Electronics and Applications (EPE), Vol. 4, Brighton, 1993; Clemens Schäffner, Analyse und Synthese neuronaler Regelungs verfahren, Dissertation, Technische Universität München, Lehrstuhl für elektrische Antriebstechnik, 1996; Li-Xin Wang, Slable Adaptive Fuzzy Control of Nonlinear Systems, IEEE Transactions on Fuzzy Systems, Vol. 1, 1993.In industrial applications of control technology, routes are often to be controlled, which are characterized by Embossed nonlinearities are marked and where parameters and structure continue to be not exact are known. Adaptive non-linear control methods are used to control such lines. Leave this first of all divide into procedures that are tailored to a specific route structure, and widely all-round processes. The second class includes control procedures with neuronal Networks as well as adaptive fuzzy controllers. Most of these methods are about stability and convergence Given the overall system consisting of the system, controller and adaptation law, no statement is possible; therefore can these methods are only used to control lines that are not critical to stability, and it additional monitoring measures are required. In contrast, in a number of procedures a sta proof of balance possible. A distinction must be made between indirect methods, in which one is used first Identification of the route and the regulation based on the identified route model he follows; with direct methods, on the other hand, the controller is directly adapted to a given behavior of the control loop. While indirect methods generally prove stability. only for the part system route - identification model - adaptation law is possible, with direct procedures stability for the entire system consisting of the system, controller and adaptation law. It has been done so far published a number of such procedures: Tzirkel, E., and Fallside, F., A Direct Control Method for a Class of Nonlinear Systems Using Neutral Networks, Technical Report, Cambridge University Engineering Department, Cambridge, 1993; E. Tzirkel-Hancock and F. Fallside, Stable Control of Nonlinear Systems Using Neural Networks, Int. J. Robust and Nonlinear Control, Vol. 2, no. 1, 1992; Robert M. Sanner and Jean-Jacques E. Slotine, Gaussian Networks for Direct Adaptive Control, IEEE Transactions on Neural Networks, Vol. 3, no. 6, 1992; Clemens Schäffner and Dierk Schröder, An Application of General Regres sion Neural Network to Nonlinear Adaptive Control, 5th European Conference on Power Electronics and Applications (EPE), Vol. 4, Brighton, 1993; Clemens Schäffner, analysis and synthesis of neuronal regulation proceedings, dissertation, Technical University of Munich, Chair of Electrical Drive Technology, 1996; Li-Xin Wang, Slable Adaptive Fuzzy Control of Nonlinear Systems, IEEE Transactions on Fuzzy Systems, Vol. 1, 1993.

Diese Regelungsverfahren basieren auf der Grundstruktur nach Abb. 1, wobei statt des Fuzzy-Reglers auch genausogut ein neuronales Netz als nichtlinearer Funktionsapproximator eingesetzt werden kann. Gege ben ist eine nichtlineare Strecke bekannter Ordnung, deren Struktur und Parameter jedoch völlig unbekannt oder nur zum Teil bekannt sind, sowie ein Referenzmodell, welches das gewünschte Führungsverhalten des geschlossenen Regelkreises vorgibt. Das Referenzmodell ist so zu gestalten, daß die physikalischen Gege benheiten der realen nichtlinearen Strecke und des Stellgliedes berücksichtigt werden. Die Strecke ist so zu regeln, daß ihre Ausgangsgröße y dem Referenzsignal y_m folgt. u ist die Stellgröße, x der Zustandsvektor der Strecke, n die Ordnung der Strecke und e = y-y_m der Ausgangsfehler. Das Regelgesetz ist von der Form
These control methods are based on the basic structure according to Fig. 1, whereby instead of the fuzzy controller, a neural network can be used as well as a non-linear function approximator. Given is a non-linear system of known order, the structure and parameters of which are completely unknown or only partially known, and a reference model that specifies the desired control behavior of the closed control loop. The reference model must be designed so that the physical conditions of the real non-linear path and the actuator are taken into account. The distance must be controlled so that its output variable y follows the reference signal y _m . u is the manipulated variable, x the state vector of the system, n the order of the system and e = yy _m the output error. The rule law is of the form

hierbei stellen die Ausdrücke θ T|1ξ(x) und θ T|2ξ(x) universelle nichtlineare Funktionsapproximatoren dar, also z. B. neuronale Netze mit radialen Basisfunktionen oder adaptive Fuzzy-Regler (s. u.). Die Parametervektoren dieser Funktionsapproximatoren werden durch ein Adaptionsgesetz der Form
the expressions θ T | 1 ξ ( x ) and θ T | 2 ξ ( x ) represent universal nonlinear function approximators, e.g. B. neural networks with radial basic functions or adaptive fuzzy controllers (see below). The parameter vectors of these function approximators are given the form by an adaptation law

eingestellt. Dabei ist γ die Adaptionsverstärkung. Mit Hilfe der Stabilitätstheorie von Ljapunow kann nun gezeigt werden, daß das Gesamtsystem stabil ist und der Fehler e gegen Null konvergiert, wenn die Strecke und die Eingangssignale gewissen Bedingungen genügen und die Koeffizienten p₁ . . . p_n die sog. Ljapunow- Gleichung erfüllen.set. Here γ is the adaptation gain. With the help of the stability theory of Ljapunow it can now be shown that the overall system is stable and the error e converges to zero if the path and the input signals meet certain conditions and the coefficients p ₁ . . . p _n satisfy the so-called Lyapunov equation.

Die bislang veröffentlichten Verfahren weisen jedoch eine Reihe von Nachteilen auf, die einer breiten praktischen Anwendung entgegenstehen bzw. diese praktisch verhindern:
However, the processes published to date have a number of disadvantages which prevent or practically prevent their widespread practical use:

- For both the control law and the adaptation law, n-1 derivatives of the control variable required (n = order of the route); in practice, however, this is due to the always existing Measurement disturbances in general cannot be realized.
- The procedures do not work if there is a manipulated variable limit; the presence Such a limitation generally causes a divergence of the adapted parameters, i.e. H. Instability. In in practice, however, unlimited manipulated variables are not possible on any route; in most cases the manipulated variable is limited so far that the limitation also occurs in real operating conditions is achieved.
- Computing time and storage space requirements grow exponentially with the order of the route. Also the Time to learn the desired control law increases strongly with the order of the route and is generally relatively large. This time results from the dynamics of the overall system from the route, Regulator and adaptation law and depends on numerous influencing factors; already for relatively simple ones Third order routes can be several days, which is not tolerable in practice.

Im folgenden Abschnitt wird ein verbessertes adaptives nichtlineares direktes Regelungsverfahren be schrieben, welches diese Nachteile behebt.The following section describes an improved adaptive nonlinear direct control method wrote, which overcomes these disadvantages.

2 Presentation of the invention 2.1 Procedure

Dem Verfahren liegt wieder die Systemstruktur nach Abb. 1 zugrunde. Die Strecke ist so zu regeln, daß ihre Ausgangsgröße y dem Referenzsignal y_m folgt. Für eine bestimmte Klasse von Strecken und Refe renzmodellen kann mittels des Ansatzes der Input-Output-Linearisierung (s. J. Slotine und W. Li, Applied Nonlinear Control) gezeigt werden, daß ein Regelgesetz existiert, welches diese Aufgabe erfüllt. Dieses "idea le" Regelgesetz kann aber nicht explizit berechnet werden, da die Strecke unbekannt ist; daher werden zu seiner Ermittlung Methoden der künstlichen Intelligenz eingesetzt. Hierfür kommen alle Verfahren in Frage, die zum Erlernen statischer nichtlinearer Funktionen geeignet sind, also Radial-Basis-Function-Netzwerke (s. Tzirkel, E., and Fallside, F., A Direct Control Method for a Class of Nonlinear Systems Using Neural Networks, und Robert M. Sanner and Jean-Jacques E. Slotine, Gaussian Networks for Dired Adaptive Con trol), General-Regression-Netzwerke (s. Clemens Schäffner and Dierk Schröder, An Application of General Regression Neural Network to Nonlinear Adaptive Control) oder adaptive Fuzzy-Regler (s. Li-Xin Wang, Stable Adaptive Fuzzy Control of Nonlinear Systems). Das Grundprinzip bleibt bei allen genannten Verfah ren gleich; der Einsatz eines Fuzzy-Reglers hat dabei den Vorteil, daß das Regelgesetz nicht als "black box", sondern in Form anschaulicher WENN-DANN-Regeln vorliegt. Dies ist besonders nützlich, wenn verbal formuliertes Vorwissen zur Initialisierung des Reglers genutzt werden soll. In den folgenden Ausführungen soll deshalb der Ansatz unter Verwendung von Fuzzy-Reglern dargestellt werden; dies soll aber nur als Beispiel dienen und bedeutet keine Beschränkung der vorgestellten Verfahren auf die Fuzzy-Regelung, d. h. es können auch die anderen Methoden verwendet werden.The process is again based on the system structure according to Fig. 1. The distance must be controlled so that its output variable y follows the reference signal y _m . For a certain class of lines and reference models, the approach of input-output linearization (see J. Slotine and W. Li, Applied Nonlinear Control) can be used to show that a rule exists that fulfills this task. However, this "ideal le" rule law cannot be calculated explicitly because the route is unknown; therefore, artificial intelligence methods are used to determine it. All methods that are suitable for learning static nonlinear functions, i.e. radial basis function networks (see Tzirkel, E., and Fallside, F., A Direct Control Method for a Class of Nonlinear Systems Using Neural Networks, and Robert M. Sanner and Jean-Jacques E. Slotine, Gaussian Networks for Dired Adaptive Control, General Regression Networks (see Clemens Schäffner and Dierk Schröder, An Application of General Regression Neural Network to Nonlinear Adaptive Control) or adaptive fuzzy controllers (see Li-Xin Wang, Stable Adaptive Fuzzy Control of Nonlinear Systems). The basic principle remains the same in all of the methods mentioned; The use of a fuzzy controller has the advantage that the control law is not available as a "black box", but in the form of clear IF-THEN rules. This is particularly useful when verbally formulated prior knowledge is to be used to initialize the controller. The approach using fuzzy controllers will therefore be presented in the following explanations; however, this is only intended to serve as an example and does not limit the methods presented to fuzzy control, ie the other methods can also be used.

2.1.1 Control law without differentiation of the controlled variable

Das Regelgesetz zur Berechnung der Stellgröße u aus dem Sollwert w und dem Zustandsvektor x kann unterschiedliche Formen besitzen. Die einfachste Form ist
The control law for calculating the manipulated variable u from the setpoint w and the state vector x can have different forms. The simplest form is

hierbei ist θ ^T ξ(w, x) ein universeller nichtlinearer Funktionsapproximator, z. B. ein neuronales Netzwerk mit radialen Basisfunktionen, ein allgemeines neuronales Regressionsnetzwerk oder ein bestimmter Typ von Fuzzy-Regler (s. Li-Xin Wang, Stable Adaptive Fuzzy Control of Nonlinear Systems). Der Vektor θ enthält die einstellbaren Parameter des Funktionsapproximators, der Vektor ξ enthält lokalisierte Basisfunktionen, von denen jede in einem begrenzten Bereich des Eingangsgrößenraumes zur Approximation beiträgt, im übrigen Eingangsgrößenraum dagegen Null oder näherungsweise Null ist.here θ ^T ξ (w, x ) is a universal nonlinear function approximator, e.g. B. a neural network with radial basic functions, a general neural regression network or a certain type of fuzzy controller (see Li-Xin Wang, Stable Adaptive Fuzzy Control of Nonlinear Systems). The vector θ contains the adjustable parameters of the function approximator, the vector ξ contains localized basic functions, each of which contributes to the approximation in a limited range of the input size space, whereas in the remaining input size space it is zero or approximately zero.

Dieses Regelgesetz ist nur für Strecken einsetzbar, bei denen die Lie-Ableitung L_gL n-1|fh(x) (s. u.) kon stant ist. Ist L_gL n-1|fh(x) nicht konstant, findet das Regelgesetz
This rule of law can only be used for lines where the Lie derivative L _g L n-1 | fh ( x ) (see below) is constant. If L _g L n-1 | fh ( x ) is not constant, the control law finds

Anwendung. Im Gegensatz zu Regelgesetzen der Form (1) benötigen diese Regelgesetze keine Differentiation des Fehlers e und damit der Regelgröße y. Ihre Einsetzbarkeit beruht auf dem Hilfssatz 1. (Die hier ange gebenen mathematischen Gleichungen sollen nur das grundsätzliche Prinzip des Verfahrens verdeutlichen. Die praktische Realisierung kann nach diesen Gleichungen erfolgen oder auch durch andere mathematische Darstellungsweisen, die dasselbe Grundprinzip realisieren.) Application. In contrast to rule laws of the form (1), these rule laws do not require differentiation the error e and thus the controlled variable y. Their applicability is based on auxiliary sentence 1. (The here The given mathematical equations are only intended to clarify the basic principle of the method. The practical implementation can be done according to these equations or by other mathematical Representations that implement the same basic principle.)

2.1.2 Control law with a reduced number of parameters

Bei den beiden soeben genannten Regelgesetzen ist die Stellgröße eine allgemeine nichtlineare Funktion aller Reglereingangsgrößen. Daher werden universelle Funktionsapproximatoren benötigt, welche alle Reglerein gangsgrößen als Eingangsgrößen besitzen. Dies führt bei Strecken höherer Ordnung zu Problemen mit dem Rechenzeit- und Speicherplatzbedarf sowie der Lernzeit. Eine mögliche Lösung besteht darin, das Regelge setz in lineare und nichtlineare Teilfunktionen aufzuteilen, die dann getrennt erlernt werden können. Eine solche Aufteilung kann aufgrund von Vorwissen über die Streckenstruktur vorgenommen werden. In vielen Fällen ist es möglich, die Differentialgleichungen der Strecke in bekannte und unbekannte Teilfunktionen aufzuteilen. Für dieses teilweise bekannte Streckenmodell kann dann über die Lie-Ableitungen das ideale Regelgesetz berechnet werden. Es hat die allgemeine Form (vgl. Hilfssatz 1)
With the two control laws just mentioned, the manipulated variable is a general non-linear function of all controller input variables. Therefore, universal function approximators are required, which have all controller input variables as input variables. For higher-order routes, this leads to problems with the computing time and storage space requirements and the learning time. One possible solution is to divide the rule into linear and non-linear sub-functions, which can then be learned separately. Such a division can be made based on previous knowledge of the route structure. In many cases it is possible to divide the differential equations of the system into known and unknown sub-functions. The ideal rule law can then be calculated for this partially known route model using the Lie derivatives. It has the general form (see auxiliary sentence 1)

wobei die Teilfunktionen f*_ki,j(x) bekannt und die Teilfunktionen f*_ui,j(x) unbekannt sind. b_1m ist der erste Zählerkoeffizient des Referenzmodells. Der wesentliche Vorteil besteht nun darin, daß i.a. die unbekannten Teilfunktionen f*_ui,j(x) nicht von allen Reglereingangsgrößen abhängen, sondern nur von einigen bestimm ten Eingangsgrößen. Welche Teilfunktionen von welchen Eingangsgrößen abhängen, ist im voraus bekannt. Dieses Vorwissen wird zur Aufstellung des Regelgesetzes
where the subfunctions f * _{ki, j} ( x ) are known and the subfunctions f * _{ui, j} ( x ) are unknown. b _1m is the first counter coefficient of the reference model. The main advantage is that the unknown partial functions f * _{ui, j} ( x ) do not depend on all controller input variables, but only on some specific input variables. Which subfunctions depend on which input variables is known in advance. This prior knowledge becomes the establishment of the regular law

verwendet. Der Vorteil besteht nun darin, daß auch hier die unbekannten nichtlinearen Teilfunktionen θ T|i,jξ _i,j(x) nicht von allen Reglereingangsgrößen abhängen. Dadurch haben die Basisfunktionen eine niedri gere Dimension, und es werden insgesamt weniger Parameter benötigt (s. Beispiel in Abschnitt 2.2). (Die hier angegebenen mathematischen Gleichungen sollen nur das grundsätzliche Prinzip des Verfahrens ver deutlichen. Die praktische Realisierung kann nach diesen Gleichungen erfolgen oder auch durch andere mathematische Darstellungsweisen, die dasselbe Grundprinzip realisieren.)used. The advantage is that the unknown non-linear subfunctions θ T | i, j ξ _{i, j} ( x ) do not depend on all controller input variables. As a result, the basic functions have a lower dimension and fewer parameters are required overall (see example in section 2.2). (The mathematical equations given here are only intended to clarify the basic principle of the method. The practical implementation can be carried out according to these equations or also by other mathematical representations that implement the same basic principle.)

2.1.3 Regulatory law with the use of symmetry properties

Eine Reduktion der Parameteranzahl - und damit von Rechenzeit- und Speicherplatzbedarf sowie Lernzeit- kann durch Ausnutzung von Symmetrieeigenschaften erreicht werden. Die Dynamik vieler Strecken weist (näherungsweise) Symmetrieeigenschaften zum Punkt
A reduction of the number of parameters - and thus of computing time and storage space requirements as well as learning time - can be achieved by using symmetry properties. The dynamics of many lines show (approximately) symmetry properties to the point

aus denen folgt, daß
from which it follows that

ist. Folglich ist alle Information, die zur Beschreibung des Regelgesetzes erforderlich ist, in der Hälfte des Parametervektors enthalten. Um dies auszunutzen, werden auch die Basisfunktionen
is. Consequently, all of the information required to describe the control law is contained in half of the parameter vector. To take advantage of this, the basic functions are also

des nichtlinearen Funktionsapproximators symmetrisch zum Punkt
of the nonlinear function approximator symmetrical to the point

angeordnet. Der Basisfunktionsvektor ξ wird in zwei gleich große Untervektoren ξ ⁺ und ξ ⁻ unterteilt, so daß gilt:
arranged. The basic function vector ξ is divided into two subvectors ξ ⁺ and ξ ⁻ of the same size, so that:

(Liegt eine Basisfunktion genau symmetrisch zum Punkt
(If a basic function is exactly symmetrical to the point

so muß der zugehörige Parameter Null sein; daher kann diese Basisfunktion weggelassen werden.) Wird nun der Parametervektor θ ebenfalls in zwei entsprechende Untervektoren θ⁺ und θ⁻ aufgeteilt, kann ein neues Regelgesetz als u' = θ' ^T ξ' mit θ' = θ⁺ und ξ' = ξ⁺ - ξ⁻ geschrieben werden. Da θ⁺ = -θ⁻ ist (aufgrund der Symmetrie von f_c), ergibt sich u' = u. Das neue Regelgesetz liefert also die gleiche Stellgröße wie (3), kommt aber mit der Hälfte der Parameter aus. Die Adaption erfolgt genau wie für das ursprüngliche Regelgesetz, wobei ξ durch ξ' zu ersetzen ist.the associated parameter must be zero; Therefore, this basic function can be omitted.) Now, if the parameter vector also θ into two corresponding sub-vectors θ ⁺ and divided θ ⁻, a new control law as u '= θ' ^T ξ 'with θ' = θ ⁺ and ξ '= ξ ⁺ - ξ ⁻ can be written. Since θ ⁺ = - θ ⁻ (due to the symmetry of f _c ), u '= u. The new control law thus provides the same manipulated variable as (3), but manages with half the parameters. The adaptation is carried out exactly as for the original control law, whereby wobei is to be replaced by ξ ' .

Die Rechenzeiteffizienz kann erhöht werden, indem nicht alle Basisfunktionen ξ⁺ und ξ⁻ berechnet werden. Dies ist möglich, wenn die Aufteilung der Basisfunktionen entlang einer (Hyper-)ebene orthogonal zu einer Achse des Eingangsgrößenraumes erfolgt (d. h. beispielsweise x₁ = 0 als Trennungsebene). Dann sind nämlich die meisten Basisfunktionen auf einer Seite der Trennungsebene gleich Null (oder bei Funktionen, die nicht den Wert Null annehmen - wie z. B. radiale Gaußsche Basisfunktionen - vernachlässigbar klein). Lediglich Basisfunktionen, deren Zentren in der Nähe oder auf der Trennungsebene liegen, sind auf beiden Seiten der Trennungsebene von Null verschieden. Ersetzt man nun vor Berechnung der Basisfunktionen x₁ durch |x₁| (es wird davon ausgegangen, daß x₁ = 0 die Trennungsebene ist und die Basisfunktionen ξ +|k auf der Seite x₁ < 0 liegen), so sind die meisten Basisfunktionen ξ -|k konstant Null oder sehr klein und können bei der Berechnung weggelassen werden; d. h. die Berechnung von ξ' vereinfacht sich erheblich. Um die Betragsbildung von x₁ wieder auszugleichen, muß die berechnete Stellgröße mit dem Vorzeichen von x₁ multipliziert werden. The computing time efficiency can be increased by not calculating all basic functions ξ ⁺ and ξ ⁻. This is possible if the division of the basic functions takes place along a (hyper) plane orthogonal to an axis of the input quantity space (ie for example x ₁ = 0 as the separation plane). Then most of the basic functions on one side of the separation level are zero (or for functions that do not take the value zero - such as radial Gaussian basic functions - negligibly small). Only basic functions whose centers are close to or on the separation level are different from zero on both sides of the separation level. Now replace x ₁ with | x ₁ | before calculating the basic functions (it is assumed that x ₁ = 0 is the separation plane and the basic functions ξ + | k are on the side x ₁ <0), so most basic functions ξ - | k are constantly zero or very small and can be used in the calculation be omitted; ie the calculation of ξ ' is considerably simplified. To compensate for the amount of x ₁ again, the calculated manipulated variable must be multiplied by the sign of x ₁ .

Für die Regelgesetze (4) und (6) ist die Ausnutzung der Symmetrie nach demselben Prinzip möglich.For rule laws (4) and (6), symmetry can be used according to the same principle.

Bei Regelgesetz (4) erfolgt die Aufteilung getrennt für die Basisfunktionsvektoren ξ ₁(w, x) und ξ ₂(x). Bei Regelgesetz (6) erfolgt die Aufteilung getrennt für die Basisfunktionsvektoren ξ ₁ = [ξ T|1,1(x) . f*_k1,1(x), ξ T|1,2(x) . f*_k1,2(x), . . .]^T und ξ ₂ = [ξ T|2,1(x) . f*_k2,1(x), ξ T|2,2(x) . f*_k2,2(x), . . .]^T. Dabei sind die unterschiedlichen Symmetrieeigenschaften der bekannten Teilfunktionen f*_ki,j(x) zu beachten: es kann entweder gelten f*_ki,j(x) = f*_ki,j(-x) oder f*_ki,j(x) = -f*_ki,j(-x). Für die kombinierten Basisfunktionsvektoren ξ ₁ und ξ ₂ gilt also:
With control law (4), the division is made separately for the basic function vectors ξ ₁ (w, x ) and ξ ₂ ( x ). In the control law (6), the division takes place separately for the basic function vectors ξ ₁ = [ ξ T | 1,1 ( x ). f * _k1.1 ( x ), ξ T | 1.2 ( x ). f * _k1.2 ( x ) _,. . .] ^T and ξ ₂ = [ ξ T | 2.1 ( x ). f * _k2.1 ( x ), ξ T | 2.2 ( x ). f * _k2.2 ( x ) _,. . .] ^T. The different symmetry properties of the known subfunctions f * _{ki, j} ( x ) must be noted: either f * _{ki, j} ( x ) = f * _{ki, j} (- x ) or f * _{ki, j} ( x ) = -f * _{ki, j} (- x ). The following therefore applies to the combined basic function vectors ξ ₁ and gilt ₂ :

Für die Parameter gilt entsprechend θ +|k = -v_kθ -|k; das modifizierte Regelgesetz wird folglich mit θ' = θ⁺ und ξ' = ξ⁺ - Vξ⁻, V = diag(v_k) gebildet. (Die hier angegebenen mathematischen Gleichungen sollen nur das grundsätzliche Prinzip des Verfahrens verdeutlichen. Die praktische Realisierung kann nach diesen Gleichungen erfolgen oder auch durch andere mathematische Dar stellungsweisen, die dasselbe Grundprinzip realisieren.)The following applies correspondingly to the parameters θ + | k = -v _k θ - | k; the modified rule law is consequently formed with θ ' = θ ⁺ and ξ' = ξ ⁺ - V ξ ⁻, V = diag (v _k ). (The mathematical equations given here are only intended to clarify the basic principle of the method. The practical implementation can be carried out according to these equations or by other mathematical representations which implement the same basic principle.)

2.1.4 Adaptation law without differentiation of the controlled variable

Um im weiteren die Formeln zu vereinfachen, soll folgende Schreibweise vereinbart werden: Ist z(t) ein zeitabhängiges Signal, so wird mit z_f(t) dasjenige Signal bezeichnet, welches aus z(t) durch die lineare Filterung
In order to simplify the formulas below, the following notation should be agreed: If z (t) is a time-dependent signal, then z _f (t) denotes the signal that consists of z (t) through linear filtering

entsteht. arises.

Unter Verwendung dieser Schreibweise lautet das Adaptionsgesetz mit erweitertem Fehler für das Re gelgesetz (3)
Using this notation, the adaptation law with extended error for the control law is (3)

Dabei sind γ < 0 und
Here are γ <0 and

die Adaptionsverstärkungen und e* der erweiterte Fehler (s. J. Slotine und W. Li, Applied Nonlinear Control); der erweiterte Fehler ist definiert als
the adaptation gains and e * the extended error (see J. Slotine and W. Li, Applied Nonlinear Control); the extended error is defined as

mit dem Hilfsfehler
with the auxiliary error

_p und u_aux sind Hilfssignale, die aus folgendem Grund benötigt werden: Zur Berechnung des erweiterten Fehlers (10) ist ein Schätzwert für die Streckenverstärkung L_gL n-1|fh(x) (im weiteren kurz als k_p bezeichnet) erforderlich. Dieser Wert ist jedoch bei dem verwendeten Regelgesetz nicht explizit verfügbar, sondern impli zit in f_c(w, x, θ) enthalten; daher muß k_p durch einen internen Hilfsparameter _p geschätzt werden. Für die Adaption von _p wird das Hilfssignal u_aux benötigt, welches gemäß folgenden Gleichungen in Referenzmodell und Regelgesetz eingespeist wird:
_p and u _aux are auxiliary signals that are required for the following reason: To calculate the extended error (10), an estimate for the path gain L _g L n-1 | fh ( x ) (hereinafter referred to as k _{p for} short) is required. However, this value is not explicitly available with the rule of law used, but is implicitly contained in f _c (w, x , θ ); therefore k _p must be estimated by an internal auxiliary parameter _p . The auxiliary signal u _{aux is} required for the adaptation of _p , which is fed into the reference model and control law according to the following equations:

Für den Stabilitätsbeweis wird zunächst folgender Hilfssatz benötigt:The following auxiliary set is required to prove stability:

Auxiliary sentence 1

Gegeben sei eine SISO-Strecke der Form
Given a SISO range of the form

deren relativer Grad r gleich ihrer Ordnung n ist (Der relative Grad ist definiert als die Ordnung der niedrigsten Ableitung von y, die direkt von der Eingangsgröße u abhängt (s. J. Slotine und W. Li, Applied Nonlinear Control). Für lineare Systeme ist dies gleichbedeutend mit der bekannten Definition "Anzahl der Pole minus Anzahl der Nullstellen"), sowie ein lineares Referenzmodell
whose relative degree r is equal to their order n (the relative degree is defined as the order of the lowest derivative of y, which depends directly on the input variable u (see J. Slotine and W. Li, Applied Nonlinear Control). For linear systems this is synonymous with the well-known definition "number of poles minus number of zeros"), as well as a linear reference model

der Ordnung n mit dem Zählergrad Null. Existieren in einem bestimmten Bereich des Zustandsraumes die Lie-Ableitungen L n|fh(x) und L_gL n-1|fh(x) und ist in dem betreffenden Bereich L_gL n-1|fh(x) = const. ≠ 0, so führt das Regelgesetz
of order n with the degree zero. If the Lie derivatives L n | fh ( x ) and L _g L n-1 | fh ( x ) exist in a certain area of the state space and L _g L n-1 | fh ( x ) = const in the relevant area. ≠ 0, the rule of law leads

für diesen Bereich des Zustandsraumes zu der Fehlerdifferentialgleichung
for this area of the state space to the error differential equation

Der Beweis dieses Hilfssatzes beruht auf der Theorie der Input-Output-Linearisierung (s. J. Slotine und W. Li, Applied Nonlinear Control) und soll hier nicht weiter ausgeführt werden.The proof of this auxiliary sentence is based on the theory of input-output linearization (see J. Slotine and W. Li, Applied Nonlinear Control) and will not be discussed further here.

Der folgende Satz stellt für das Adaptionsgesetz (8)-(9) die Stabilität sicher:The following sentence ensures stability for the adaptation law (8) - (9):

Sentence 1

Gegeben sei ein adaptives System aus einer Strecke vom Typ (14)-(15), einem Referenzmodell nach Gl. (12), einem adaptiven Fuzzy-Regler nach Gl. (13) und dem Adaptionsgesetz (8)-(9). Dann gilt:
Given an adaptive system from a line of type (14) - (15), a reference model according to Eq. (12), an adaptive fuzzy controller according to Eq. (13) and the Adaptation Act (8) - (9). Then:

1. If the inherent approximation error d is sufficiently small, the parameter vector θ and the auxiliary parameter p are limited.
2. If d ∼ is 0 and ξ and u aux are constrained, then lim t → ∞ e = 0.

Proof of sentence 1

Unter Verwendung von Hilfssatz 1 erhält man für das beschriebene System folgende Differentialgleichung für den Fehler e:
Using auxiliary sentence 1, the following differential equation for the error e is obtained for the system described:

d. h.
ie

Für den erweiterten Fehler e* ergibt sich
For the extended error e * results

hierbei ist der erweiterte Parameterfehler
here is the extended parameter error

und der erweiterte Aktivierungsvektor
and the extended activation vector

Aus dem Adaptionsgesetz (8)-(9) ergibt sich
From the adaptation law (8) - (9) follows

mit
With

Zum Nachweis der Beschränktheit von
To demonstrate the limitation of

wird die Ljapunow-Funktion
becomes the Lyapunov function

verwendet. Ihre Ableitung nach der Zeit ist
used. Your derivative is after time

Ist d hinreichend klein, so ist ≦ 0; damit ist die Beschränktheit von
If d is sufficiently small, then ≦ 0; with that the limitation of

und somit die Beschränktheit von θ und _p nachgewiesen.and thus the limitation of θ and _{p is} demonstrated.

Um zu zeigen, daß lim_t→∞ e = 0 ist, wird zunächst bewiesen, daß lim_t→∞ e* = 0 ist. (Im folgenden wird von d ∼ 0 ausgegangen.) Hierzu wird die zweite Ableitung von V berechnet:
To show that lim _{t → ∞} e = 0, we first prove that lim _{t → ∞} e * = 0. (In the following, d ∼ 0 is assumed.) For this, the second derivative of V is calculated:

Aus der Beschränktheit von ξ und u_aux (s. Voraussetzung) und der Beschränktheit von
From the restriction of ξ and u _aux (see prerequisite) and the restriction of

folgt die Be schränktheit von
follows the limitation of

und damit die Beschränktheit von . Aus dem Lemma von Barbalat (s. J. Slotine und W. Li, Applied Nonlinear Control) folgt nun aber, daß dann lim_t→∞ = lim_t→∞(-e^*2) = 0 sein muß. Damit ist auch lim_t→∞ e^* = 0.and thus the limitation of. From Barbalat's lemma (see J. Slotine and W. Li, Applied Nonlinear Control) it follows that lim _{t → ∞} = lim _{t → ∞} (-e ^{* 2} ) = 0. Hence lim _{t → ∞} e ^* = 0.

Als nächster Schritt wird lim_t→∞ e_aux = 0 bewiesen. In K. Narendra und A. Annaswamy, Siable Adaptive Systems, wird gezeigt, daß dies der Fall ist, wenn ξ beschränkt und
The next step is lim _{t → ∞} e _aux = 0. K. Narendra and A. Annaswamy, Siable Adaptive Systems, show that this is the case when ξ is restricted and

quadratisch integrierbar ist. Die quadratische Integrierbarkeit von
can be integrated into a square. The square integrability of

ergibt sich aus
results from

Somit ist e^* quadratisch integrierbar; da
Thus e ^{* can be} integrated into a square; there

und ξ _f beschränkt ist, ist auch
and ξ _{f is} limited, too

quadratisch inte grierbar. Damit ist nachgewiesen, daß lim_t→∞ e_aux = 0 ist.square integrable. This proves that lim _{t → ∞} e _aux = 0.

Schließlich kann mit derselben Argumentation gezeigt werden, daß auch lim_t→∞((_p u_aux)_f - _pu_auxf) = 0 ist. Mit (10) folgt aus diesen Teilergebnissen lim_t→∞ e = 0, womit Satz 1 bewiesen ist.Finally, the same reasoning can be used to show that lim _{t → ∞} (( _p u _aux ) _f - _p u _auxf ) = 0. With (10) follows from these partial results lim _{t → folgt} e = 0, which proves Theorem 1.

Für das Regelgesetz (4) lautet das entsprechende Adaptionsgesetz
For the regulation law (4) the corresponding adaptation law is

mit
With

Der Hilfsparameter _p und das Hilfssignal u_aux können hier entfallen. Die Adaption von θ ₂ ist bei Unter schreiten einer vorgegebenen unteren Schranke anzuhalten, um eine Division durch Null im Regelgesetz zu verhindern.The auxiliary parameter _p and the auxiliary signal u _aux can be omitted here. The adaptation of θ ₂ is to be stopped when falling below a predetermined lower limit in order to prevent division by zero in the control law.

Für das Regelgesetz (6) lautet das Adaptionsgesetz
For the regulation law (6) is the adaptation law

mit
With

(Die hier angegebenen mathematischen Gleichungen sollen nur das grundsätzliche Prinzip des Verfahrens verdeutlichen. Die praktische Realisierung kann nach diesen Gleichungen erfolgen oder auch durch andere mathematische Darstellungsweisen, die dasselbe Grundprinzip realisieren.) (The mathematical equations given here are only intended to illustrate the basic principle of the method clarify. The practical implementation can be done according to these equations or by others mathematical representations that implement the same basic principle.)

2.1.5 Consideration of the manipulated variable limitation

Ein weiteres Problem der stabilen adaptiven Fuzzy-Regelung bestand bislang darin, daß sie auf Strecken mit Stellgrößenbegrenzung nicht anwendbar war. Die Herleitung des Verfahrens geht davon aus, daß beliebig hohe Stellamplituden möglich sind; Simulationen zeigen, daß das Vorhandensein einer Stellgrößenbegrenzung zu Instabilität (d. h. unbeschränktem Anwachsen der zu adaptierenden Parameter) führt. Das resultieren de Regelungsverhalten ist durch ständiges Hin- und Herschalten der Stellgröße zwischen positiver und negativer Begrenzung gekennzeichnet. Auch dieses Problem konnte jedoch durch eine Weiterentwicklung des Adaptionsgesetzes gelöst werden. Die Modifikation beruht im wesentlichen auf einer Abschaltung der Adaption während der Begrenzungsphase sowie der Verwendung eines modifizierten Referenzsignals. Das Adaptionsgesetz (8) wird modifiziert zu
Another problem with stable adaptive fuzzy control has so far been that it was not applicable to lines with manipulated variable limitation. The derivation of the method assumes that actuation amplitudes of any desired height are possible; Simulations show that the existence of a manipulated variable limit leads to instability (ie unlimited increase in the parameters to be adapted). The resulting control behavior is characterized by constant switching of the manipulated variable between positive and negative limitation. However, this problem could also be solved by further developing the adaptation law. The modification is essentially based on switching off the adaptation during the limitation phase and using a modified reference signal. The adaptation law (8) is modified to

das Adaptionsgesetz (9) wird entsprechend modifiziert. Die Bedingung (ue*ξ _f < 0 ∧ |e| < ε_e) wird benötigt, um ein "Hängenbleiben" in der Stellgrößenbegrenzung zu verhindern, falls alle aktiven Parameter durch Überschwingen < u_max (oder < -u_max) werden sollten. In diesem Fall wird der Fehler irgendwann < ε_e, und die Parameter verlassen die Begrenzung wieder. Theoretisch könnte ε_e, auf Null gesetzt werden; Simulationsergebnisse haben jedoch gezeigt, daß ein Wert von ε_e = (0.1 . . . 1.0) . max(|y_m(t)|) zu einem besseren Verhalten der Parameter führt. Wird ε_e zu Null gesetzt, ist die Adaption für Parameter, die von der Begrenzung betroffen sind, "unsymmetrisch" (Adaption zu Null hin ist möglich, während Adaption von Null weg während der Begrenzung nicht möglich ist); dies führt in Verbindung mit dem inhärenten Approximationsfehler zu oszillatorischem Verhalten der Parameter.the adaptation law (9) is modified accordingly. The condition (ue * ξ _f <0 ∧ | e | <ε _e ) is required in order to prevent "getting stuck" in the manipulated variable limitation if all active parameters should become <u _max (or <-u _max ) due to overshoot. In this case the error becomes <ε _e at some point and the parameters leave the limit again. Theoretically, ε _e , could be set to zero; However, simulation results have shown that a value of ε _e = (0.1 ... 1.0). max (| y _m (t) |) leads to better behavior of the parameters. If ε _e is set to zero, the adaptation for parameters that are affected by the limitation is "asymmetrical" (adaptation to zero is possible, whereas adaptation away from zero is not possible during the limitation); in connection with the inherent approximation error, this leads to oscillatory behavior of the parameters.

Weiterhin muß zur Erreichung stabilen Verhaltens ein modifiziertes Referenzsignal y_mm eingeführt wer den. Das modifizierte Referenzmodell hat die gleiche Dynamik wie das ursprüngliche Modell, aber seine Zustände y_mm, _mm, _mm, . . . y (n-1)|mm werden auf y, , , . . . y^(n-1) zurückgesetzt, wenn u die Begrenzung verläßt (In der Praxis wird das Referenzmodell zeitdiskret realisiert. Wenn u die Begrenzung verläßt, werden seine Zustände y_mm(k) . . . y_mm(k - n + 1) auf y(k) . . . y(k - n + 1) zurückgesetzt. Es ist also keine Differentiation von y erforderlich). Gleichzeitig werden die Zustände der linearen Filter für ξ _f und (θ ^T ξ)_f auf Null zurückgesetzt. (Die hier angegebenen mathematischen Gleichungen sollen nur das grundsätzliche Prinzip des Verfahrens verdeutlichen. Die praktische Realisierung kann nach diesen Gleichungen erfolgen oder auch durch andere mathematische Darstellungsweisen, die dasselbe Grundprinzip realisieren.)Furthermore, a modified reference signal y _mm must be introduced to achieve stable behavior. The modified reference model has the same dynamics as the original model, but its states y _mm , _mm , _mm,. . . y (n-1) | mm are on y,,,. . . y ^{(n-1) is} reset when u leaves the limit (In practice, the reference model is implemented in a time-discrete manner. When u leaves the limit, its states y _mm (k) ... y _mm (k - n + 1) open y (k) ... y (k - n + 1), so no differentiation of y is required). At the same time, the states of the linear filters for ξ _f and ( θ ^T ξ ) _{f are} reset to zero. (The mathematical equations given here are only intended to clarify the basic principle of the method. The practical implementation can be carried out according to these equations or by other mathematical representations which implement the same basic principle.)

2.2 Application of the method to the control of electrical drive systems with vibratory mechanics and non-linear friction characteristic

Im folgenden soll nun die Funktionsweise des beschriebenen Verfahrens an einem Anwendungsbeispiel ver deutlicht werden. Als unbekannte nichtlineare Strecke wurde ein Antriebssystem mit nichtlinearer Reibung verwendet (Abb. 2). Das Antriebssystem stellt einen Zwei-Massen-Torsionsschwinger dar. Die zu regeln de Ausgangsgröße ist die Drehzahl der Lastmasse x₁; die weiteren Zustandsgrößen sind der Verdrehwinkel der Welle x₂ und die Motordrehzahl x₃. Auf die Motormasse wirkt als Stellgröße das Luftspaltmoment u (Annahme: ideale Momentenregelung); der Betrag von u ist auf den Maximalwert u_max = 5,0 begrenzt. Auf die Lastmasse wirkt ein Reibmoment f_f(x₁), welches nichtlinear von der Lastdrehzahl x₁ abhängt. Die Reibungskennlinie wurde durch die Arcustangens-Funktion modelliert. Die Torsionseigenfrequenz beträgt ca. 50 Hz; die Dämpfung der Welle wurde zu Null angenommen.In the following, the operation of the described method will now be illustrated using an application example. A drive system with non-linear friction was used as the unknown non-linear path ( Fig. 2). The drive system is a two-mass torsional vibrator. The output variable to be controlled is the speed of the load mass x ₁ ; the other state variables are the angle of rotation of the shaft x ₂ and the motor speed x ₃ . The air gap torque u acts on the engine mass as a manipulated variable (assumption: ideal torque control); the amount of u is limited to the maximum value u _max = 5.0. A friction torque f _f (x ₁ ) acts on the load mass, which depends non-linearly on the load speed x ₁ . The friction characteristic was modeled using the arctangent function. The natural torsional frequency is approx. 50 Hz; the wave damping was assumed to be zero.

Der Entwurf des Referenzmodells erfolgte nach der Methode der Doppelverhältnisse; die Ersatzzeitkon stante des Referenzmodells wurde zu T_m = 20 ms vorgegeben.The design of the reference model was based on the double ratio method; the equivalent time constant of the reference model was specified as T _m = 20 ms.

Unter Ausnutzung des Vorwissens, daß die nichtlineare Reibung nur von x₁, aber nicht von x₂ und x₃ abhängt, wurde ein Regelgesetz der Form
Taking advantage of the prior knowledge that the nonlinear friction depends only on x ₁ , but not on x ₂ and x ₃ , a rule law of the form became

aufgestellt, bei dem alle unbekannten nichtlinearen Funktionen nur noch von x₁ abhängen. Für jede der unbekannten nichtlinearen Teilfunktionen wurde ein Funktionsapproximator mit 10 dreieckförmigen Basis funktionen verwendet. in which all unknown nonlinear functions only depend on x ₁ . A function approximator with 10 triangular base functions was used for each of the unknown nonlinear subfunctions.

Simulationsergebnisse für diesen Regler sind in Abb. 3 zu sehen. Der Regler wird durch Aufschalten eines Sollwertsignals w(t) trainiert, das alle 40 ms einen neuen Zufallswert im Bereich [-0,5, 0,5] annimmt. Nach 300, 1800 und 7200 s wird ein dreieckförmiges Testsignal aufgeschaltet, um das erreichte Lernergebnis zu beurteilen.Simulation results for this controller can be seen in Fig. 3. The controller is trained by applying a setpoint signal w (t) which assumes a new random value in the range [-0.5, 0.5] every 40 ms. A triangular test signal is applied after 300, 1800 and 7200 s to assess the learning result achieved.

Um zu demonstrieren, daß Stabilität und Konvergenz unabhängig von den Anfangsbedingungen sind, werden die Parametervektoren θ ₂, θ ₃, θ ₄ und θ ₅ aus (38) auf Null initialisiert; θ₁ wird auf den Anfangswert 10⁶ gesetzt. Es wird also keinerlei Vorwissen über die Parameter der Strecke zugrundegelegt. Zu Beginn der Adaption ist das Regelgesetz also
In order to demonstrate that stability and convergence are independent of the initial conditions, the parameter vectors θ ₂ , θ ₃ , θ ₄ and θ ₅ from (38) are initialized to zero; θ ₁ is set to the initial value 10 ⁶ . So no previous knowledge of the parameters of the route is used. At the beginning of the adaptation there is the rule law

es erfolgt somit lediglich eine Steuerung mit dem Sollwert w. Wie bereits erwähnt, geht der Adaptions vorgang in der Anfangsphase relativ schnell vonstatten. Nach ca. 300 s folgt die Lastdrehzahl bereits grob dem Referenzsignal y_m. Danach wird die Adaption jedoch wesentlich langsamer. Auch nach 1800 s sind - besonders im Bereich kleinerer Drehzahlen - noch deutliche Abweichungen zwischen y und y_m zu erkennen. Nach ca. 7200 s (d. h. zwei Stunden) sind Lastdrehzahl und Referenzsignal für den dreieckförmigen Sollwert verlauf praktisch deckungsgleich. Die verbleibenden Abweichungen bei zufallsverteiltem Sollwertverlauf sind auf die Stellgrößenbegrenzung zurückzuführen. Die Adaption wird bei Erreichen der Stellgrößenbegrenzung angehalten (s. 2.1.5); wie man sieht, bleiben durch diese Maßnahme Stabilität und Konvergenz erhalten, obwohl die Stellgrößenbegrenzung (aufgrund der schnellen Einstellung des Referenzmodells) bei fast jedem Sollwertsprung erreicht wird.it is therefore only controlled with the setpoint w. As already mentioned, the adaptation process takes place relatively quickly in the initial phase. After approx. 300 s, the load speed already roughly follows the reference signal y _m . After that, however, the adaptation becomes much slower. Even after 1800 s, clear deviations between y and y _m can still be seen, especially in the range of lower speeds. After approx. 7200 s (ie two hours), the load speed and the reference signal for the triangular setpoint are practically congruent. The remaining deviations in the case of a randomly distributed setpoint curve are due to the manipulated variable limitation. The adaptation is stopped when the manipulated variable limit is reached (see 2.1.5); As you can see, this measure maintains stability and convergence, although the manipulated variable limitation (due to the quick setting of the reference model) is achieved with almost every setpoint step change.

Claims

1. A method for stable direct adaptive control of non-linear paths with a structure which is not exactly known and parameters which are not exactly known, characterized in that the control law contains one or more universal non-linear function approximators with localized basic functions and that the parameters of these function approximators are set by an adaptation law, in which the time profiles of the basic functions are processed according to the principle of the extended error, which means that there is no differentiation of the controlled variable. All methods can be used as universal nonlinear function approximators with localized basic functions, in which the approximation value is calculated by scalar multiplication of a vector of adjustable parameters with a vector of basic functions arranged in the input size range, each basic function only in a specific area of the input size range is significantly different from zero. These include neural networks with radial basic functions, general neural regression networks and fuzzy controllers, the defuzzification of which is carried out using the singleton method (method according to Section 2.1.4 or a comparable method).

2. The method according to claim 1, characterized in that the controller as the input variables the setpoint and receives all state variables of the system, but no derivatives of the controlled variable. The controllers structure consists of either a universal nonlinear function approximator, which the Reg input values and which directly calculates the manipulated variable, or from two universal nonlinear function approximators to which the controller input variables are fed and their Output variables for determining the manipulated variable are divided (procedure according to section 2.1.1 or comparable procedure).

3. The method according to claim 1, characterized in that a division based on previous knowledge of the regulatory law takes place in several sub-functions, some of which are known in advance and at the rest of them know in advance which controller input variables they depend on. The rule law consists of the known sub-functions as well as universal non-linear function approximators, which learn the unknown sub-functions. These function approximators are not all controller input quantities are supplied, but only those of which the corresponding unknown really depend on the sub-functions (procedure according to Section 2.1.2 or comparable procedure).

4. The method according to claim 1-3, characterized in that by taking advantage of symmetry properties the number of adapted parameters and thus also the required learning time as well the computing time and storage space requirement is reduced. This is done by a suitable team menu of two parameters or the two associated basic functions, so that two Parameters are grouped together due to the symmetry properties against the same value or converge the same value with the opposite sign (procedure according to Section 2.1.3 or comparable procedure).

5. The method according to claim 1-4, characterized in that no adaptation of the parameters instead takes place as long as the manipulated variable is at the limit and that when the manipulated variable is exited limitation the state of the reference model is reset to the track state (procedure according to section 2.1.5 or comparable procedure).