OS/2 Shareware BBS: 10 Tools

home *** CD-ROM | disk | FTP | other *** search

/ OS/2 Shareware BBS: 10 Tools / 10-Tools.zip / perl501m.zip / pod / perlcall.pod < prev next >

Wrap

Text File | 1995-07-03 | 38KB | 1,365 lines

=head1 NAME perlcall - Perl calling conventions from C =head1 DESCRIPTION The purpose of this document is to show you how to write I<callbacks>, i.e. how to call Perl from C. The main focus is on how to interface back to Perl from a bit of C code that has itself been run by Perl, i.e. the 'main' program is a Perl script; you are using it to execute a section of code written in C; that bit of C code wants you to do something with a particular event, so you want a Perl sub to be executed whenever it happens. Examples where this is necessary include =over 5 =item * You have created an XSUB interface to an application's C API. A fairly common feature in applications is to allow you to define a C function that will get called whenever something nasty occurs. What we would like is for a Perl sub to be called instead. =item * The classic example of where callbacks are used is in an event driven program like for X-windows. In this case your register functions to be called whenever a specific events occur, e.g. a mouse button is pressed. =back Although the techniques described are applicable to embedding Perl in a C program, this is not the primary goal of this document. For details on embedding Perl in C refer to L<perlembed> (currently unwritten). Before you launch yourself head first into the rest of this document, it would be a good idea to have read the following two documents - L<perlapi> and L<perlguts>. This stuff is easier to explain using examples. But first here are a few definitions anyway. =head2 Definitions Perl has a number of C functions which allow you to call Perl subs. They are I32 perl_call_sv(SV* sv, I32 flags) ; I32 perl_call_pv(char *subname, I32 flags) ; I32 perl_call_method(char *methname, I32 flags) ; I32 perl_call_argv(char *subname, I32 flags, register char **argv) ; The key function is I<perl_call_sv>. All the other functions make use of I<perl_call_sv> to do what they do. I<perl_call_sv> takes two parameters, the first is an SV*. This allows you to specify the Perl sub to be called either as a C string (which has first been converted to an SV) or a reference to a sub. Example 7, shows you how you can make use of I<perl_call_sv>. The second parameter, C<flags>, is a general purpose option command. This parameter is common to all the I<perl_call_*> functions. It is discussed in the next section. The function, I<perl_call_pv>, is similar as I<perl_call_sv> except it expects it's first parameter to be a C char* which identifies the Perl sub you want to call, e.g. C<perl_call_pv("fred", 0)>. The function I<perl_call_method> expects its first argument to contain a blessed reference to a class. Using that reference it looks up and calls C<methname> from that class. See example 9. I<perl_call_argv> calls the Perl sub specified by the C<subname> parameter. It also takes the usual C<flags> parameter. The final parameter, C<argv>, consists of a list of C strings to be sent to the Perl sub. See example 8. All the functions return a number. This is a count of the number of items returned by the Perl sub on the stack. As a general rule you should I<always> check the return value from these functions. Even if you are only expecting a particular number of values to be returned from the Perl sub, there is nothing to stop someone from doing something unexpected - don't say you haven't been warned. =head2 Flag Values The C<flags> parameter in all the I<perl_call_*> functions consists of any combination of the symbols defined below, OR'ed together. =over 5 =item G_SCALAR Calls the Perl sub in a scalar context. Whatever the Perl sub actually returns, we only want a scalar. If the perl sub does return a scalar, the return value from the I<perl_call_*> function will be 1 or 0. If 1, then the value actually returned by the Perl sub will be contained on the top of the stack. If 0, then you have used the G_DISCARD flag. It is important to note that if the Perl sub returns a list, the I<perl_call_*> function will still only return 1 or 0. If 1, then the number of elements in the list will be stored on top of the stack. The actual values which were contained in the list will not be accessible. G_SCALAR is the default flag setting for all the functions. =item G_ARRAY Calls the Perl sub in a list context. The return code from the I<perl_call_*> functions will indicate how many elements of the stack are used to store the array. See example 4. =item G_DISCARD If you are not interested in the values returned by the Perl sub then setting this flag will make Perl get rid of them automatically for you. This will take precedence to either G_SCALAR or G_ARRAY. If you do not set this flag then you need to make sure that any temporaries (i.e. parameters passed to the Perl sub and values returned from the sub) are disposed of yourself. Example 3 gives details of how to explicitly dispose of these temporaries and the section I<Using Perl to dispose of temporaries> discusses the specific circumstances where you can ignore the problem and let Perl deal with it for you. =item G_NOARGS If you are not passing any parameters to the Perl sub, you can save a bit of time by setting this flag. It has the effect of not creating the C<@_> array for the Perl sub. A point worth noting is that if this flag is specified the Perl sub you have called can still access an C<@_> array from a previous Perl sub. This functionality can be illustrated with the perl code below sub fred { print "@_\n" } sub joe { &fred } &joe(1,2,3) ; This will print 1 2 3 What has happened is that C<fred> accesses the C<@_> array which belongs to C<joe>. =item G_EVAL It is possible for the Perl sub you are calling to terminate abnormally, e.g. by calling I<die> explicitly or by not actually existing. If you want to trap this type of event, specify the G_EVAL flag. It will put an I<eval { }> around the sub call. Whenever control returns from the I<perl_call_*> function you need to check the C<$@> variable as you would in a normal Perl script. The value returned from the I<perl_call_*> function is dependent on what other flags have been specified and whether an error has occurred. If the I<perl_call_*> function returns normally, then the value returned is as specified in the previous sections. If G_DISCARD is specified, the return value will always be 0 - this will override both G_SCALAR and G_ARRAY. If G_ARRAY is specified I<and> an error has occurred, the return value will always be 0. If G_SCALAR is specified I<and> an error has occurred, the return value will be 1 and the value on the top of the stack will be I<undef>. This means that if you have already detected the error by checking C<$@> and you want the program to continue, you must remember to pop the I<undef> from the stack. See example 6 for details of using G_EVAL. Please Note: If you are intending to make use of the G_EVAL flag in your code use a version of Perl greater than 5.000. There is a known bug in version 5.000 of Perl which means that G_EVAL will not work as described above. =back =head1 EXAMPLES Enough of the definition talk, let's have a few examples. Perl provides many macros to assist in accessing the Perl stack. Wherever possible, these macros should always be used when interfacing to Perl internals. Hopefully this should make the code less vulnerable to any changes made to Perl in the future. Another point worth noting is that in the first series of examples I have only made use of the I<perl_call_pv> function. This has only been done to ease you into the topic. Wherever possible, if the choice is between using I<perl_call_pv> and I<perl_call_sv>, you should always try to use I<perl_call_sv>. See example 7 for details. =head2 Example1: No Parameters, Nothing returned This first trivial example will call a Perl sub, I<PrintUID>, to print out the UID of the process. sub PrintUID { print "UID is $<\n" ; } and here is the C to call it void call_PrintUID() { dSP ; PUSHMARK(sp) ; perl_call_pv("PrintUID", G_DISCARD|G_NOARGS) ; } Simple, eh. A few points to note about this example. =over 5 =item 1. We aren't passing any parameters to I<PrintUID> so G_NOARGS can be specified. =item 2. Ignore C<dSP> and C<PUSHMARK(sp)> for now. They will be discussed in the next example. =item 3. We aren't interested in anything returned from I<PrintUID>, so G_DISCARD is specified. Even if I<PrintUID> was changed to actually return some value(s), having specified G_DISCARD will mean that they will be wiped by the time control returns from I<perl_call_pv>. =item 4. Because we specified G_DISCARD, it is not necessary to check the value returned from I<perl_call_pv>. It will always be 0. =item 5. As I<perl_call_pv> is being used, the Perl sub is specified as a C string. =back =head2 Example 2: Passing Parameters Now let's make a slightly more complex example. This time we want to call a Perl sub, C<LeftString>, which will take 2 parameters - a string (C<$s>) and an integer (C<$n>). The sub will simply print the first C<$n> characters of the string. So the Perl sub would look like this sub LeftString { my($s, $n) = @_ ; print substr($s, 0, $n), "\n" ; } The C function required to call I<LeftString> would look like this. static void call_LeftString(a, b) char * a ; int b ; { dSP ; PUSHMARK(sp) ; XPUSHs(sv_2mortal(newSVpv(a, 0))); XPUSHs(sv_2mortal(newSViv(b))); PUTBACK ; perl_call_pv("LeftString", G_DISCARD); } Here are a few notes on the C function I<call_LeftString>. =over 5 =item 1. The only flag specified this time is G_DISCARD. As we are passing 2 parameters to the Perl sub this time, we have not specified G_NOARGS. =item 2. Parameters are passed to the Perl sub using the Perl stack. This is the purpose of the code beginning with the line C<dSP> and ending with the line C<PUTBACK>. =item 3. If you are going to put something onto the Perl stack, you need to know where to put it. This is the purpose of the macro C<dSP> - it declares and initializes a I<local> copy of the Perl stack pointer. All the other macros which will be used in this example require you to have used this macro. If you are calling a Perl sub directly from an XSUB function, it is not necessary to explicitly use the C<dSP> macro - it will be declared for you. =item 4. Any parameters to be pushed onto the stack should be bracketed by the C<PUSHMARK> and C<PUTBACK> macros. The purpose of these two macros, in this context, is to automatically count the number of parameters you are pushing. Then whenever Perl is creating the C<@_> array for the sub, it knows how big to make it. The C<PUSHMARK> macro tells Perl to make a mental note of the current stack pointer. Even if you aren't passing any parameters (like in Example 1) you must still call the C<PUSHMARK> macro before you can call any of the I<perl_call_*> functions - Perl still needs to know that there are no parameters. The C<PUTBACK> macro sets the global copy of the stack pointer to be the same as our local copy. If we didn't do this I<perl_call_pv> wouldn't know where the two parameters we pushed were - remember that up to now all the stack pointer manipulation we have done is with our local copy, I<not> the global copy. =item 5. Next, we come to XPUSHs. This is where the parameters actually get pushed onto the stack. In this case we are pushing a string and an integer. See the section I<XSUB's AND THE ARGUMENT STACK> in L<perlguts> for details on how the XPUSH macros work. =item 6. Finally, I<LeftString> can now be called via the I<perl_call_pv> function. =back =head2 Example 3: Returning a Scalar Now for an example of dealing with the values returned from a Perl sub. Here is a Perl sub, I<Adder>, which takes 2 integer parameters and simply returns their sum. sub Adder { my($a, $b) = @_ ; $a + $b ; } Since we are now concerned with the return value from I<Adder>, the C function required to call it is now a bit more complex. static void call_Adder(a, b) int a ; int b ; { dSP ; int count ; ENTER ; SAVETMPS; PUSHMARK(sp) ; XPUSHs(sv_2mortal(newSViv(a))); XPUSHs(sv_2mortal(newSViv(b))); PUTBACK ; count = perl_call_pv("Adder", G_SCALAR); SPAGAIN ; if (count != 1) croak("Big trouble\n") ; printf ("The sum of %d and %d is %d\n", a, b, POPi) ; PUTBACK ; FREETMPS ; LEAVE ; } Points to note this time are =over 5 =item 1. The only flag specified this time was G_SCALAR. That means the C<@_> array will be created and that the value returned by I<Adder> will still exist after the call to I<perl_call_pv>. =item 2. Because we are interested in what is returned from I<Adder> we cannot specify G_DISCARD. This means that we will have to tidy up the Perl stack and dispose of any temporary values ourselves. This is the purpose of ENTER ; SAVETMPS ; at the start of the function, and FREETMPS ; LEAVE ; at the end. The C<ENTER>/C<SAVETMPS> pair creates a boundary for any temporaries we create. This means that the temporaries we get rid of will be limited to those which were created after these calls. The C<FREETMPS>/C<LEAVE> pair will get rid of any values returned by the Perl sub, plus it will also dump the mortal SV's we have created. Having C<ENTER>/C<SAVETMPS> at the beginning of the code makes sure that no other mortals are destroyed. =item 3. The purpose of the macro C<SPAGAIN> is to refresh the local copy of the stack pointer. This is necessary because it is possible that the memory allocated to the Perl stack has been re-allocated whilst in the I<perl_call_pv> call. If you are making use of the Perl stack pointer in your code you must always refresh the your local copy using SPAGAIN whenever you make use of of the I<perl_call_*> functions or any other Perl internal function. =item 4. Although only a single value was expected to be returned from I<Adder>, it is still good practice to check the return code from I<perl_call_pv> anyway. Expecting a single value is not quite the same as knowing that there will be one. If someone modified I<Adder> to return a list and we didn't check for that possibility and take appropriate action the Perl stack would end up in an inconsistent state. That is something you I<really> don't want to ever happen. =item 5. The C<POPi> macro is used here to pop the return value from the stack. In this case we wanted an integer, so C<POPi> was used. Here is the complete list of POP macros available, along with the types they return. POPs SV POPp pointer POPn double POPi integer POPl long =item 6. The final C<PUTBACK> is used to leave the Perl stack in a consistent state before exiting the function. This is necessary because when we popped the return value from the stack with C<POPi> it only updated our local copy of the stack pointer. Remember, C<PUTBACK> sets the global stack pointer to be the same as our local copy. =back =head2 Example 4: Returning a list of values Now, let's extend the previous example to return both the sum of the parameters and the difference. Here is the Perl sub sub AddSubtract { my($a, $b) = @_ ; ($a+$b, $a-$b) ; } and this is the C function static void call_AddSubtract(a, b) int a ; int b ; { dSP ; int count ; ENTER ; SAVETMPS; PUSHMARK(sp) ; XPUSHs(sv_2mortal(newSViv(a))); XPUSHs(sv_2mortal(newSViv(b))); PUTBACK ; count = perl_call_pv("AddSubtract", G_ARRAY); SPAGAIN ; if (count != 2) croak("Big trouble\n") ; printf ("%d - %d = %d\n", a, b, POPi) ; printf ("%d + %d = %d\n", a, b, POPi) ; PUTBACK ; FREETMPS ; LEAVE ; } If I<call_AddSubtract> is called like this call_AddSubtract(7, 4) ; then here is the output 7 - 4 = 3 7 + 4 = 11 Notes =over 5 =item 1. We wanted array context, so we used G_ARRAY. =item 2. Not surprisingly C<POPi> is used twice this time because we were retrieving 2 values from the stack. The main point to note is that they came off the stack in reverse order. =back =head2 Example 5: Returning Data from Perl via the parameter list It is also possible to return values directly via the parameter list - whether it is actually desirable to do it is another matter entirely. The Perl sub, I<Inc>, below takes 2 parameters and increments each directly. sub Inc { ++ $_[0] ; ++ $_[1] ; } and here is a C function to call it. static void call_Inc(a, b) int a ; int b ; { dSP ; int count ; SV * sva ; SV * svb ; ENTER ; SAVETMPS; sva = sv_2mortal(newSViv(a)) ; svb = sv_2mortal(newSViv(b)) ; PUSHMARK(sp) ; XPUSHs(sva); XPUSHs(svb); PUTBACK ; count = perl_call_pv("Inc", G_DISCARD); if (count != 0) croak ("call_Inc: expected 0 values from 'Inc', got %d\n", count) ; printf ("%d + 1 = %d\n", a, SvIV(sva)) ; printf ("%d + 1 = %d\n", b, SvIV(svb)) ; FREETMPS ; LEAVE ; } To be able to access the two parameters that were pushed onto the stack after they return from I<perl_call_pv> it is necessary to make a note of their addresses - thus the two variables C<sva> and C<svb>. The reason this is necessary is that the area of the Perl stack which held them will very likely have been overwritten by something else by the time control returns from I<perl_call_pv>. =head2 Using G_EVAL Now an example using G_EVAL. Below is a Perl subroutine which computes the difference of its 2 parameters. If this would result in a negative result, the subroutine calls I<die>. sub Subtract { my ($a, $b) = @_ ; die "death can be fatal\n" if $a < $b ; $a - $b ; } and some C to call it static void call_Subtract(a, b) int a ; int b ; { dSP ; int count ; SV * sv ; ENTER ; SAVETMPS; PUSHMARK(sp) ; XPUSHs(sv_2mortal(newSViv(a))); XPUSHs(sv_2mortal(newSViv(b))); PUTBACK ; count = perl_call_pv("Subtract", G_EVAL|G_SCALAR); SPAGAIN ; /* Check the eval first */ sv = GvSV(gv_fetchpv("@", TRUE, SVt_PV)); if (SvTRUE(sv)) { printf ("Uh oh - %s\n", SvPV(sv, na)) ; POPs ; } else { if (count != 1) croak ("call_Subtract : expected 1 value from 'Subtract', got %d\n", count) ; printf ("%d - %d = %d\n", a, b, POPi) ; } PUTBACK ; FREETMPS ; LEAVE ; } If I<call_Subtract> is called thus call_Subtract(4, 5) the following will be printed Uh oh - death can be fatal Notes =over 5 =item 1. We want to be able to catch the I<die> so we have used the G_EVAL flag. Not specifying this flag would mean that the program would terminate immediately at the I<die>. =item 2. The code sv = GvSV(gv_fetchpv("@", TRUE, SVt_PV)); if (SvTRUE(sv)) printf ("Uh oh - %s\n", SvPVx(sv, na)) ; is the equivalent of this bit of Perl print "Uh oh - $@\n" if $@ ; =item 3. Note that the stack is popped using C<POPs> in the block where C<SvTRUE(sv)> is true. This is necessary because whenever a I<perl_call_*> function invoked with G_EVAL|G_SCALAR returns an error, the top of the stack holds the value I<undef>. As we want the program to continue after detecting this error, it is essential that the stack is tidied up by removing the I<undef>. =back =head2 Example 7: Using perl_call_sv In all the previous examples I have 'hard-wired' the name of the Perl sub to be called from C. Most of the time though, it is more convenient to be able to specify the name of the Perl sub from within the Perl script. Consider the Perl code below sub fred { print "Hello there\n" ; } CallSub("fred") ; here is a snippet of XSUB which defines I<CallSub>. void CallSub(name) char * name CODE: PUSHMARK(sp) ; perl_call_pv(name, G_DISCARD|G_NOARGS) ; That is fine as far as it goes. The thing is, it only allows the Perl sub to be specified as a string. For perl 4 this was adequate, but Perl 5 allows references to subs and anonymous subs. This is where I<perl_call_sv> is useful. The code below for I<CallSub> is identical to the previous time except that the C<name> parameter is now defined as an SV* and we use I<perl_call_sv> instead of I<perl_call_pv>. void CallSub(name) SV* name CODE: PUSHMARK(sp) ; perl_call_sv(name, G_DISCARD|G_NOARGS) ; As we are using an SV to call I<fred> the following can all be used CallSub("fred") ; Callsub(\&fred) ; $ref = \&fred ; CallSub($ref) ; CallSub( sub { print "Hello there\n" } ) ; As you can see, I<perl_call_sv> gives you much greater flexibility in how you can specify the Perl sub. =head2 Example 8: Using perl_call_argv Here is a Perl sub which prints whatever parameters are passed to it. sub PrintList { my(@list) = @_ ; foreach (@list) { print "$_\n" } } and here is an example of I<perl_call_argv> which will call I<PrintList>. call_PrintList { dSP ; char * words[] = {"alpha", "beta", "gamma", "delta", NULL } ; perl_call_argv("PrintList", words, G_DISCARD) ; } Note that it is not necessary to call C<PUSHMARK> in this instance. This is because I<perl_call_argv> will do it for you. =head2 Example 9: Using perl_call_method Consider the following Perl code { package Mine ; sub new { my($type) = shift ; bless [@_] } sub Display { my ($self, $index) = @_ ; print "$index: $$self[$index]\n" ; } } It just implements a very simple class to manage an array. The only method defined is C<Display> which prints out a single element of the array. Here is an all Perl example of using it. $a = new Mine ('red', 'green', 'blue') ; $a->Display(1) ; will print 1: green Calling a Perl method from C is fairly straightforward - a reference to the object, the name of the method and any other parameters specific to the method are all that is required. Here is a simple XSUB which illustrates the mechanics of calling the C<Display> method from C. void call_Display(ref, method, index) SV * ref char * method int index CODE: PUSHMARK(sp); XPUSHs(ref); XPUSHs(sv_2mortal(newSViv(index))) ; PUTBACK; perl_call_method(method, G_DISCARD) ; C<call_Display> expects it's first parameter to be a reference to an object, the second to be the name of the method, and the third is the index. So the method C<Display> method can be invoked like this $a = new Mine ('red', 'green', 'blue') ; call_Method($a, 'Display', 2) ; The only thing to note is that the reference to the object (C<ref>) and the C<index> parameter are is pushed onto the stack before calling I<perl_call_method>. The method name C<method> does not get pushed onto the stack, it is used as the first parameter to I<perl_call_method>. =head2 Using Perl to dispose of temporaries In the examples given to date, any temporaries created in the callback ( i.e parameters passed on the stack to the I<perl_call_*> function or values returned via the stack) have been freed by =over 5 =item * specifying the G_DISCARD flag with I<perl_call_*>. =item * explicitly disposed of using the C<ENTER>/C<SAVETMPS> - C<FREETMPS>/C<LEAVE> pairing. =back There is another method which can be used, namely letting Perl do it for you automatically whenever it regains control after the callback has terminated. This is done by simply not using the ENTER ; SAVETMPS ; ... FREETMPS ; LEAVE ; sequence in the callback (and not, of course, specifying the G_DISCARD flag). If you are going to use this method you have to be aware of a possible memory leak which can arise under very specific circumstances. To explain these circumstances you need to know a bit about the flow of control between Perl and the callback routine. The examples given at the start of the document (an error handler and an event driven program) are typical of the two main sorts of flow control that you are likely to encounter with callbacks. There is a very important distinction between them, so pay attention. In the first example, an error handler, the flow of control could be as follows. You have created an interface to an external library and control has reached the library thus perl --> XSUB --> external library Whilst control is in the library, an error condition occurs. You have previously set up a Perl callback to handle this situation, so the flow of control will be external library --> XSUB --> perl_call --+ | perl <-- XSUB <-- external library <-- XSUB <---------------+ After processing of the error using I<perl_call_*> is completed, control reverts back to Perl more or less immediately. The important thing to note is that it is only now that control is back with Perl that any temporaries you have left hanging around will be freed. In the second example, an event driven program, the flow of control will be more like this event handler --> XSUB --> perl_call --+ | +--------<---------------+ | event handler +-> XSUB --> perl_call --+ | +----------<-------------+ | event handler +-> XSUB --> perl_call --+ | +----------<-------------+ | event handler +-> XSUB --> perl_call --+ In this case the flow of control can consist only of the repeated sequence event handler -> XSUB -> perl_call for the practically the complete duration of the program. This means that control may never drop back to Perl proper. So what is the big problem? Well, if you are expecting Perl to tidy up those temporaries for you, you might be in for a long wait. For Perl to actually dispose of your temporaries, control must drop back to it at some stage. In the event driven scenario that may never happen. This means that as time goes on, your program will create more and more temporaries, none of which will ever be freed. As each of these temporaries consumes some memory you program will eventually consume all the available memory in your system - kapow! So here is the bottom line - if you are sure that control will revert back to Perl fairly quickly after the end of your callback, then it isn't absolutely necessary to explicitly dispose of any temporaries you may have created. Mind you, if you are at all uncertain about what to do, it doesn't do any harm to tidy up anyway. =head2 Strategies for storing Callback Context Information Potentially one of the trickiest problems to overcome when designing a callback interface can be figuring out how to store the mapping between the C callback function and the Perl equivalent. To help understand why this can be a real problem first consider how a callback is set up in an all C environment. Typically a C API will provide a function to register a callback. This will expect a pointer to a function as one of it's parameters. Below is a call to a hypothetical function C<register_fatal> which registers the C function to get called when a fatal error occurs. register_fatal(cb1) ; The single parameter C<cb1> is a pointer to a function, so you must have defined C<cb1> in your code, say something like this void cb1() { printf ("Fatal Error\n") ; exit(1) ; } Now change that to call a Perl sub instead static SV * callback ; void register_fatal(fn) CODE: SV * fn { /* Remember the Perl sub */ callback = fn ; /* register the callback with the external library */ register_fatal(cb1) ; } void cb1() { ... /* Call the Perl sub to process the callback */ perl_call_sv(callback) ; ... } where the Perl equivalent of C<register_fatal> and the callback it registers ,C<pcb1>, might look like this # Register the sub pcb1 register_fatal(\&pcb1) ; sub pcb1 { die "I'm dying...\n" ; } The mapping between the C callback and the Perl equivalent is stored in the global variable C<callback>. This will be adequate if you only ever need to have 1 callback registered at any time. An example could be an error handler like the code sketched out above. Remember though, repeated calls to C<register_fatal> will replace the previously registered callback function with the new one. Say though you want to interface to a library which allows asynchronous file i/o. In this case you may be able to register a callback whenever a read operation has completed. To be of any use we want to be able to call separate Perl subs for each file that is opened. As it stands, the error handler example above would not be adequate as it only allows a single callback to be defined at any time. What we require is a means of storing the mapping between the opened file and the Perl sub we want to be called for that file. Say the i/o library has a function C<asynch_read> which associates a C function C<callback> with a file handle C<fh> - this assumes that it has also provided some routine to open file file and so obtain the file handle. asynch_read(fh, callback) This may expect the C callback function of this form void callback(fh, buffer) int fh ; char * buffer ; { ... } To provide a Perl interface to this library we need to be able to map between the C<fh> parameter and the Perl sub we want called. A hash is a convenient mechanism for storing this mapping. The code below shows a possible implementation HV * Mapping = (HV*)NULL ; void asynch_read(fh, callback) int fh SV * callback CODE: { /* If the hash doesn't already exist, create it */ if (Mapping == (HV*)NULL) Mapping = newHV() ; /* Save the fh -> callback mapping */ hv_store(Mapping, &fh, sizeof(fh), callback, 0) ; /* Register with the C Library */ asynch_read(fh, asynch_read_if) ; } where C<asynch_read_if> could look like this static void asynch_read_if(fh, buffer) int fh ; char * buffer ; { dSP ; SV ** sv ; /* Get the callback associated with fh */ sv = hv_fetch(Mapping, &fh , sizeof(fh), FALSE) ; if (sv == (SV**)NULL) croak("Internal error...\n") ; PUSHMARK(sp) ; XPUSHs(sv_2mortal(newSViv(fh))) ; XPUSHs(sv_2mortal(newSVpv(buffer, 0))) ; PUTBACK ; /* Call the Perl sub */ perl_call_sv(*sv, G_DISCARD) ; } So the Perl interface would look like this # Register the Perl callback asynch_read(fh, \&callback1) ; sub callback1 { my($handle, $buffer) = @_ ; ... } The mapping between the C callback and Perl is stored in the global hash C<Mapping> this time. Using a hash has the advantage that it allows an unlimited number of callbacks to be registered. What if the interface provided by the C callback doesn't contain a parameter which allows the file handle to Perl sub mapping? Say in the asynchronous i/o package, the callback function only gets passed the C<buffer> parameter like this void callback(buffer) char * buffer ; { ... } Without the file handle there is no straightforward way to map from the C callback to the Perl sub. In this case a possible way around this problem is to pre-define a series of C functions to act as the interface to Perl, thus #define MAX_CB 3 typedef void (*FnMap)() ; static void fn1() ; static void fn2() ; static void fn3() ; FnMap Functions[MAX_CB] = { fn1, fn2, fn3 } ; SV * PerlSubs [MAX_CB] = { NULL, NULL, NULL } ; static void Pcb(index, buffer) int index ; char * buffer ; { dSP ; PUSHMARK(sp) ; XPUSHs(sv_2mortal(newSVpv(buffer, 0))) ; PUTBACK ; /* Call the Perl sub */ perl_call_sv(PerlSubs[index], G_DISCARD) ; } static void fn1(buffer) char * buffer ; { Pcb(0, buffer) ; } static void fn2(buffer) char * buffer ; { Pcb(1, buffer) ; } static void fn3(buffer) char * buffer ; { Pcb(2, buffer) ; } void asynch_read(fh, callback) int fh SV * callback CODE: { int index ; /* Find an empty entry */ for (index = 0 ; index < MAX_CB ; ++index) { if (PerlSubs[index] == NULL) break ; } if (index < MAX_CB) { /* Remember the Perl sub */ PerlSubs[index] = callback ; asynch_read(fh, Functions[index]) ; } else croak ("Too many callback functions registered\n") ; } In this case the functions C<fn1>, C<fn2> and C<fn3> are used to remember the Perl sub to be called. Each of the functions holds a separate hard-wired index which is used in the function C<Pcb> to access the C<PerlSubs> array and actually call the Perl sub. The obvious disadvantage of this technique is that there is a hard-wired limit (in this case 3) to the number of callbacks that can exist simultaneously. The only way to increase the limit is to modify the code to add more functions and then re-compile. None the less, as long as the number of functions is chosen with some care, it is still a workable solution and in some cases is the only one available. To summarize, here are a number of possible methods for you to consider for storing the mapping between C and the Perl callback =over 5 =item 1. Ignore the problem - Only allow 1 callback For a lot of situations, like interfacing to a error handler, this may be a perfectly adequate solution. =item 2. Create a sequence of callbacks - hard wired limit If it is impossible to tell from the parameters passed back from the C callback what the context is, then you need to create a sequence of C callback interface functions, and store pointers to each in an array. =item 3. Use a parameter to map to the Perl callback Use a hash to store the mapping between C and Perl. =back =head2 Alternate Stack Manipulation Although I have only made use of the C<POP*> macros to access values returned from Perl subs, it is also possible to bypass these macros and read the stack using the C<ST> macro (See L<perlapi> for a full description of the C<ST> macro). Most of the time the C<POP*> macros should be adequate, the main problem with them is that they force you to process the returned values in sequence. This may not be the most suitable way to process the values in some cases. Using the C<ST> macro will allow random access to the stack. The code below is example 4 recoded to use C<ST> instead of C<POP*>. static void call_AddSubtract2(a, b) int a ; int b ; { dSP ; I32 ax ; int count ; ENTER ; SAVETMPS; PUSHMARK(sp) ; XPUSHs(sv_2mortal(newSViv(a))); XPUSHs(sv_2mortal(newSViv(b))); PUTBACK ; count = perl_call_pv("AddSubtract", G_ARRAY); SPAGAIN ; sp -= count ; ax = (sp - stack_base) + 1 ; if (count != 2) croak("Big trouble\n") ; printf ("%d - %d = %d\n", a, b, SViv(ST(0))) ; printf ("%d + %d = %d\n", a, b, SViv(ST(1))) ; PUTBACK ; FREETMPS ; LEAVE ; } Notes =over 5 =item 1. Notice that it was necessary to define the variable C<ax>. This is because the C<ST> macro expects it to exist. If we were in an XSUB it would not be necessary to define C<ax> as it is already defined for you. =item 2. The code SPAGAIN ; sp -= count ; ax = (sp - stack_base) + 1 ; sets the stack up so that we can use the C<ST> macro. =item 3. Remember that as with example 4, the returned values are still indexed in reverse order. So C<ST(0)> refers to the last value and C<ST(count-1)> refers to the first. =back =head1 SEE ALSO L<perlapi>, L<perlguts>, L<perlembed> =head1 AUTHOR Paul Marquess <pmarquess@bfsec.bt.co.uk> Special thanks to the following people who assisted in the creation of the document. Jeff Okamoto, Tim Bunce, Nick Gianniotis. =head1 DATE Version 0.5, 6th January 1995