NetNews Usenet Archive 1992 #30

home *** CD-ROM | disk | FTP | other *** search

/ NetNews Usenet Archive 1992 #30 / NN_1992_30.iso / spool / bit / listserv / csgl / 1912 < prev next >

Wrap

Text File | 1992-12-21 | 11.5 KB | 201 lines

Comments: Gated by NETNEWS@AUVM.AMERICAN.EDU Path: sparky!uunet!paladin.american.edu!auvm!BEN.DCIEM.DND.CA!MMT Message-ID: <9212190036.AA04344@chroma.dciem.dnd.ca> Newsgroups: bit.listserv.csg-l Date: Fri, 18 Dec 1992 19:36:07 EST Sender: "Control Systems Group Network (CSGnet)" <CSG-L@UIUCVMD.BITNET> From: mmt@BEN.DCIEM.DND.CA Subject: Re: Martin to Rick on Shannon Lines: 190 [Martin Taylor 921218 18:30] (Tom Bourbon 921218 14:38) >May I repeat and elaborate on an invitation I made about a year >ago? I will send you one of my programs in which two systems (two >people, the two hands of one person, two models, or a person and a >model) interact and produce controlled relationships. ... I guess I'd better try to describe, as I did a year or so ago, wherein information theory helps in the understanding of PCT. I didn't succeed in getting across then, and I'm not sure I'll do any better now. I should think that the prediction for your proposed system would be no better and no worse than you would get without it, because you are dealing with a transparent system of one control level. The understanding you get with information theory is not at the level of setting the parameters. If I were to try to develop a model to make predictions in your experiment, I expect it would look essentially identical to yours, because the key elements would be the gain and delays in the two interacting loops. Now consider the interchanges of a week or two ago about planning and prediction, continued in Bill's post of today to Allan Randall. In those, the situation is greatly different. The information required from the lower level for the upper level to maintain control through a hiatus in sensory acquisition depends greatly on the accuracy of control maintained at the lower level. Where does that come from, and where does it go? We come back to the fundamental basis of PCT. Why is it necessary, and is it sufficient? Let's take two limiting possibilities for how a world might be. Firstly, consider a predictable world. PCT is not necessary, because the desired effects can be achieved by executing a prespecified series of actions. No information need be acquired from the world. From the world's viewpoint, the organism is to some extent unpredictable, so the organism supplies information to the world. How much? That depends on the probabilities of the various plans as "perceived" by the world. At the other extreme, consider a random world, in which the state at t+delta is unpredictable from the state at t. PCT is not possible. There is no set of actions in the world that will change the information at the sensors. Now consider a realistic (i.e. chaotic) world. What does that mean? At time t one looks at the state of the world, and the probabilities of the various possible states at t+delta are thereby made different from what they would have been had you not looked at time t. If one makes an action A at time t, the probability distributions of states at time t+delta are different from what they would have been if action A had not occurred, and moreover, that difference is reflected in the probabilities of states of the sensor systems observing the state of the world. Action A can inform the sensors. PCT is possible. In a choatic world, delta matters. If delta is very small, the probability distribution of states at t+delta is tightly constrained by the state at t. If delta is very large, the probability distribution of states at t+delta is unaffected by the state at t (remember, we are dealing with observations and subjective probabilities, not frequency distributions--none of this works with frequentist probabilities; not much of anything works with frequentist probabilities!). Information is lost as time goes by, at a rate that can be described, depending on the kinds of observations and the aspect of the world that is observed. The central theme of PCT is that a perception in an ECS should be maintained as close as possible to a reference value. In other words, the information provided by the perception, given knowledge of the reference, should be as low as possible. But in the chaotic world, simple observation of the CEV provides a steady stream of information. The Actions must provide the same information to the world, so that the perception no longer provides any more information. Naturally that is impossible in detail, and the error does not stay uniformly zero. It conveys some of the information inherent in the chaotic nature of the world, though less than it would if the Actions did not occur. The Action bandwidth determines the rate at which information can be supplied by the world, the nature of the physical aspect of the world being affected, and the delta t between Action and sensing the affected CEV determines the information that will be given to the sensors (the unpredicted disturbances, in other words), and the bandwidth of the sensory systems determines how much information can be provided through the perceptual signal. Any one of these parts of the loop can limit the success of control, as measured by the information contained in the error signal. So far, the matter is straightforward and non-controversial, I think. Think of a set of orbits diverging in a phase space. The information given by an initial information is represented by a small region of phase space as compared to the whole space. After a little while, the set of orbits represented by the initial uncertainty has diverged, so the uncertainty has increased. Control is to maintain the small size, which means to supply information to the world. Things become more interesting when we go up a level in the hierarchy. Now we have to consider the source of information as being the error signals of the lower ECSs, given that the higher level has no direct sensory access to the world, and that all lower ECSs are actually controlling (both restrictions will be lifted later, especially the latter). Even though the higher ECSs may well take as sensory input the perceptual signals of the lower ECSs, nevertheless the information content (unpredictability) of those perceptual signals is that of the error, since the higher ECSs have information about their Actions (the references supplied to the lower ECSs) just as the lower ones have information about theis Actions in the world. The higher ECSs see a more stable world than do the lower ones, if the world allows control. (Unexpected events provide moments of high information content, but they can't happen often, or we are back in the uncontrollable world.) What does this mean? Firstly, the higher ECSs do not need one or both of high speed or high precision. The lower ECSs can take care of things at high information rates, leaving to the higher ECSs precisely those things that are not predicted by them--complexities of the world, and specifically things of a KIND that they do not incorporate in their predictions. In other words, the information argument does not specify what Bill's eleven levels are, but it does make it clear why there should BE level of the hierarchy that have quite different characteristics in their perceptual input functions. It is that kind of thing that I refer to as "understanding" PCT, not the making of predictions for simple linear phenomena. Linear models are fine when you have found the right ECS connections and have plugged in model parameters. I am talking about seeing why those models are as they are. Look, for example, at the attention and alerting discussions, which come absolutely straight from the Shannon theory. But the results of the (almost) a priori argument agree with the (despised) results of experiments in reading that I discussed in our 1983 Psychology of Reading. had I known about PCT then, I could have made a much stronger case than I did, but only because of Shannon. The whole notion of Layered Protocols in intelligent dialogue depends on Shannon, and demonstrates the impossibility of simple coding schemes (which some people have claimed as the basis of Shannon information). >I invite you to add to the PCT model any features of information >theory that you believe must be there. If necessary, use features >from information theory to replace those from PCT. Have I done that to your satisfaction? > If your changes >improve the predictions by the model, there will be no argument and >no complaint: You will have demonstrated that a person who does >not understand Shannon or information theory does not understand >PCT. An electricity meter reader does not need to understand the principles of electromagnetism to get an accurate meter reading. This challenge is misdirected. If there are places where I think the prediction would be improved, they are likely to be structural, such as in the division of attention, monitoring behaviour or some such. What should be improved, in general, is understanding, not meter reading. I said that if you don't understand Shannon, you won't understand PCT. I didn't say you won't be able to use PCT to make predictions. ================ Having said all that, I might soften a bit, and take the analogy of the use of the mathematically ideal observer in psychophysics. The ideal observer is assumed to take whatever information is in principle available in the signal, and to use it to determine whether or not some specified class of even has occurred. Trained psychoacoustic listeners often perform rather like an ideal observer who is presented with a signal some 3 or 4 dB weaker than the actual one. We say they are within 3 or 4 dB of ideal. Now we take the observer and add or eliminate possibilities for getting information about the event. For example, we may let the observer know what the waveform of the event would be if the event actually occurred, by presenting it to the other ear. They now approach the performance of an ideal observer who knows the waveform. Can they do this if the "cue" is delayed? It depends how much delay, and by looking at the performace over a variety of delays, we can tell something about what information the real observers are losing. Is it phase, frequency, or amplitude? Change the cue and do some more variation, and determine what the ideal observer might be capable of doing if it lacked this or that kind of information. By analogy, it might be possible to make ideal controller predictions, given different kinds of disturbance or sensory prediction aids. An example comes to mind (not of an ideal controller). A submarine reacts very slowly to changes in its control surfaces, but in stable water there are reasonably simple algorithms to determine where it will be if nothing changes in the control surfaces over the next few minutes (the chaotic world doesn't provide much information under these circumstances). So it is possible to make a display that shows a line indicating where the submarine will go if the steerer does nothing. If that's where it should go, fine. Otherwise the steerer moves the controls until the line goes where the submarine should go. But there are currents and so forth (the world provides information), so the submarine does not go where the line said it would. However, the line still predicts where it would NOW go if nothing happens, so the helm can still control that future position to some extent. How far should the line go? That depends on the information rate of the world. If the currents are swirling and unpredictable, probably it should not go very far, but if they are steady, they provide little information, and the line can compensate. Too long. I must go home. I hope that this has been helpful. Martin