Volume Number: 14 (1998)
Issue Number: 5
Column Tag: Programmer's Challenge

May 98 - Programmer's Challenge

by Bob Boonstra, Westford, MA

EggLob

What could he mean by "EggLob", you ask. Well nothing, except that it is an anagram of "Boggle®, the Parker Brothers game that is the inspiration for this month's Challenge. Boggle is played with 16 six-sided letter cubes and a 4x4 tray into which the cubes are shaken. The object is win as many points as possible by joining letters horizontally, vertically, or diagonally to form words. Longer words win more points. This month your Challenge is to solve a game like Boggle, with a few twists.

The prototype for the code you should write is:

#if defined (__cplusplus)
extern "C" {
#endif

long Boggle(
  long dimension,          /* puzzle is square, of size dimension, 4x4 
                              to 7x7 */
  char puzzle[],           /* letter at (row,col) is in
                              puzzle[col+dimension*row] */
  long dictSize,           /* number of words in dictionary */
  const char *dictionary[],/* legal words to use */
  char *wordsFound[],      /* return pointers to the words you found */
  void *privStorage        /* 20MB of storage for your use */
);

#if defined (__cplusplus)
}
#endif

Our first twist on the standard Boggle game is that the puzzle dimension will vary from 4x4 up to 7x7, inclusive. Your Boggle routine will provided the puzzle, a dimension x dimension array of characters. The second twist is that you are allowed to cheat, up to a point, anyway. You can cheat by picking up any of the cubes and exchanging it with any other cube. You can do this as many times as you like. You can exchange cubes, however, you cannot turn a cube so that a different letter is on top. (I've reduced your temptation to cheat in this fashion by not telling you what is on the other five faces of a cube.) Once you have rearranged the puzzle to your liking, you should find as many words as possible from the dictionary of dictSize words, copy the pointers to the words you find from the dictionary into the wordsFound array, and return the number of words you found. Your rearranged puzzle should replace the original puzzle and be returned to the calling routine.

Words are formed from letters that are adjacent vertically, horizontally, or diagonally. No letter from the puzzle may be used more than once within a single word. Words within other words are legal and scored separately. For purposes of matching against the dictionary and for scoring, the letter "Q" represents the two letters "QU". The dictionary may vary in size and content from puzzle to puzzle.

Words are awarded points based on length. No points are scored for words of fewer than 3 letters. Words of 3 or 4 letters earn 1 point, words of 5 letters earn 2 points, words of 6 letters earn 3 points, words of 7 letters earn 5 points, and words of n letters (n>7) earn 5 + 6*(n-7) points. Points are deducted based on how long your Boggle routine executes; 1 point will be deducted for every 20ms that your code executes on my 8500/200. Negative point totals are possible. Entries that do not return after a reasonable time (e.g., 5 minutes) will be terminated and assigned the associated negative score. Entries that claim to have found words that cannot legally be formed using the rearranged puzzle will similarly penalized.

The Challenge winner will be the entry that accumulates the largest point total in a sequence of puzzles.

The privStorage parameter will point to 20MB of preallocated storage for your use. The privStorage will be initialized to zero prior to the first call to Boggle. The same privStorage value will be provided on each subsequent call, and the contents of your private storage will persist from one call to the next.

This will be a native PowerPC Challenge, using the latest CodeWarrior environment. Solutions may be coded in C, C++, or Pascal. Thanks to Ernst Munter for suggesting this Boggle variant.

Three Months Ago Winner

One more time we congratulate Ernst Munter (Kanata, Ontario) for submitting the winning Challenge entry, this time in the February Image Decoder Challenge. The Challenge was to read a GIF image from a file, decode it, and return an offscreen GWorld containing the image. Ernst's code was about 5% faster than the second place entry by Patrick Varilly, and beat Patrick's entry in all but one of the test cases I used.

A couple of keys to the extra speed in Ernst's code. First, notice that almost all of the code is part of methods in the GIF class, and that all of those methods are inlined. This wouldn't be a good general technique for all C++ code, but it provided an edge in this application. Second, look closely at some of the loop constructs. For example, the portion of the PutMore method dealing with the common case, where all of the decoded pixels map into the current image line, compiles as follows with full speed optimizations:

   char* newDest=dest+len;
   ...
   dest=newDest;   
00000014: 7C050378 mr    r5,r0
   do {
    *--dest=stp->value;
00000018: 88040006 lbz   r0,6(r4)
0000001C: 9C05FFFF stbu   r0,-1(r5)
    stp=stp->prefix;
00000020: 80840000 lwz   r4,0(r4)
   } while (stp);
00000024: 28040000 cmplwi  r4,$0000
00000028: 4082FFF0 bne   *-16   ; $00000018

The assignment via the predecremented pointer dest compiles to a single Store Byte With Update (stbu) instruction, making this short loop considerably tighter than other possible constructs. Very nice.

Patrick commented in his code that close to half of his execution time is spent in fread. I remember having the same problem at least once when I competed in the Challenge where I resorted to writing a streamlined version of fread. None of the entries to this month's Challenge tried that technique, though.

Eleven different GIFs were used to test the solutions to this Challenge. The GIFs ranged in size up to 1100x800 pixels, and up to 500KB in length. The table below lists the total execution time in milliseconds for five selected test cases and the time for all test cases combined. It also lists code size, data size, and the programming language used for each entry. The number in parentheses after the entrant's name is the total number of Challenge points earned in all Challenges to date prior to this one (Table 1).

Top Contestants

Here are the Top Contestants for the Programmer's Challenge, including everyone who has accumulated more than 10 points during the past two years. The numbers below include points awarded over the 24 most recent contests, including points earned by this month's entrants.

Munter, Ernst (220 points)
Boring, Randy (73)
Cooper, Greg (61)
Lewis, Peter (51)
Mallett, Jeff (50)
Nicolle, Ludovic (44)
Rieken, Willeke (27)
Antoniewicz, Andy (24)
Gregg, Xan (24)
Murphy, ACC (24)
Day, Mark (20)
Higgins, Charles (20)
Hostetter, Mat (20)
Studer, Thomas (20)
Hart, Alan (14)
O'Connor, Turlough (14)

There are three ways to earn points: (1) scoring in the top 5 of any Challenge, (2) being the first person to find a bug in a published winning solution or, (3) being the first person to suggest a Challenge that I use. The points you can win are:

1st place 20 points
2nd place 10 points
3rd place 7 points
4th place 4 points
5th place 2 points
finding bug 2 points
suggesting Challenge 2 points

Here is Ernst's winning solution to the Image Decoder Challenge:

/*
 An implementation of a decoder of GIF images for the PowerPC Mac.

 Input: a file containing a single image;
     the file is already opened by the calling program

 Output: an off-screen graphics world which contains the image.

 Algorithm
 ---------
 The algorithm follows closely the description in the GIF spec.
 GIF89a extensions are recognized but their data blocks are
 skipped over and not processed.

 Implementation Notes
 --------------------
 The file is read, one large block at a time, into a buffer.
 File and image headers are processed according to the GIF
 spec. This includes constructing a Mac color table from
 either the global or a local GIF color table.

 The LZW decoder draws variable size codes from the buffer,
 maintains a code dictionary, and writes decoded pixel values
 directly into the GWorld pixel map.

 Decoding a single compressed code word yields a string of
 pixels in reverse order. The conventional method  would be
 to push the pixels on a stack, and then pop them off as
 they are written to sequential pixel addresses in the pixel map.

 I use this method if the pixel string goes past the end of
 the current row, in order to deal with extra bytes (rowBytes
 may be greater than imageWidth), and to handle interlaced row
 sequences correctly. In the normal case however, the pixel
 values are copied directly from the code dictionary to the
 pixel map. To facilitate this, I store the length of each
 code string in the dictionary record.
     
*/

#include <stdlib.h>
#include <string.h>
#include "ImageDecoder.h"

enum {kBlockSize=16384,
   kMaxCode=4095,
   kImageSeparator=0x2C,
   kExtensionIntroducer=0x21};

DictEntry 
struct DictEntry {
// Each dictionary entry is the start of a linked list
//     of length one or more.
// The stackedValue location is unrelated to the dictionary
//    entry; it is just a convenient spare location
//    to build the pixel stack, in memory that is likely
//    already in cache when it is needed.
 DictEntry*   prefix; // chains to previous code
 short     length; // length of chain
 unsigned char value; // the pixel value
 unsigned char  stackedValue;

 void Reset(int ch){
  prefix=0;
  length=1;
  value=ch;
 }

 void Init(DictEntry* px,int ch) {
  prefix=px;
  length=1+px->length;
  value=ch;
 }
};

GIF
struct GIF {
 FILE*      f;                    // the GIF file
 Rect      screenRect;              // size of graphics world
 GWorldPtr   osWorld;             // the graphics world
 CTabHandle   globalCTH;            // a global color table
 int       logScreenWidth;          // GIF logical screen
 int      logScreenHeight;
 int      bkGndColor;              // GIF specified bkGnd
 int      interlace;              // to track interlace passes
 char*      pixelBase;            // start of GWorld pixMap
 char*      imageBase;            // start of image in pixMap
 int      rowBytes;              // width of pixMap
 int      imageWidth;              // imageWidth <= rowBytes
 int       readPtr;                // index into file block
 DictEntry   d[4096];              // the code dictionary
 unsigned char block[kBlockSize];  // the file block

GIF::GIF
 GIF(FILE* fx) {
  f=fx;
  fread(block,1,kBlockSize,f);// start by loading 1 block
  readPtr=0;
 }

GIF::CheckSignature
 bool CheckSignature() {
// the file signature must be either GIF87a or GIF89a
  if (0x47494638 != *((long*)block))
   return false;
  int version=*((short*)(block+4));
  if ((version != 0x3761) && (version != 0x3961))
   return false;
  readPtr+=6;
  return true;
 }

GIF::SkipBytes
 void SkipBytes(int n2skip) {
// Skip over n2skip bytes in file block
  readPtr+=n2skip;
  while (readPtr>=kBlockSize) {
// We cannot skip past the end of the block without
//     loading the next block
   readPtr-=kBlockSize;
   fread(block,1,kBlockSize,f);
  }
 }

GIF::Read1Byte
 unsigned int Read1Byte(int & readPtr) {
// Returns the next byte from the file block.
// If we get to the end of the block, we load another.
  const unsigned int rc=block[readPtr++];
  if (readPtr>=kBlockSize) {
   readPtr=0;
   fread(block,1,kBlockSize,f);
  }
  return rc;
 }

GIF::ReadShort
 unsigned int ReadShort() {
// GIF files use little endian integers.
// This function reads two bytes and returns a 16-bit int.
  const unsigned int loByte=Read1Byte(readPtr);
  const unsigned int hiByte=Read1Byte(readPtr);
  return hiByte<<8 | loByte;
 }

GIF::MakeCTab
 CTabHandle MakeCTab(int size) {
// Creates a QuickDraw color table from a GIF color table
  CTabHandle CTH=(CTabHandle)NewHandle(
    sizeof(ColorTable)+
    sizeof(ColorSpec)*(size-1));
  CTabPtr CT=*CTH;
  CT->ctFlags=0;
  CT->ctSize=size-1;
  ColorSpec* cs=CT->ctTable;
   for (int cIndex=0;cIndex<size;cIndex++) {
   cs->value=cIndex;
// Expand 8-bit colors to 16-bit colors
   cs->rgb.red=257 * Read1Byte(readPtr);
   cs->rgb.green=257 * Read1Byte(readPtr);
   cs->rgb.blue=257 * Read1Byte(readPtr);
   cs++;
   }
   return CTH;
 }

GIF::ProcessHeader
 GWorldPtr ProcessHeader() {
// Returns a new offscreen gWorld.
// Returns 0 in all cases of error

  if (!CheckSignature()) return 0;

// Define logical screen size
  logScreenWidth=ReadShort();
  logScreenHeight=ReadShort();
  SetRect(&screenRect,0,0,logScreenWidth,logScreenHeight);

// Read flags
  const int flags=Read1Byte(readPtr);
  const int hasGlobalCT=flags & 0x80;
  bkGndColor=Read1Byte(readPtr);
  readPtr++;  // pixel aspect ratio is not needed

  if (hasGlobalCT) {
   const int CTsize=1L<<((flags&7)+1);
   globalCTH=MakeCTab(CTsize);
   CTabChanged(globalCTH);
  } else globalCTH=0;

// Try to get a new 8-bit offscreen gWorld
  const OSErr err =
  NewGWorld(&osWorld,8,&screenRect,globalCTH,0,0);

// Quickdraw will not free this handle for us!
  if (globalCTH) DisposeHandle((char**)globalCTH);

  if (err) return 0;
  return osWorld;
 }

GIF::ProcessExtension
 void ProcessExtension() {
// Just read and ignore the extension label
  Read1Byte(readPtr);
// and skip over any extension data blocks
  do {
   const int bsize=Read1Byte(readPtr);
   if (bsize==0) break;
   SkipBytes(bsize);
  } while(1);
 }

GIF::ProcessImage
 void ProcessImage() {
// Read the image header to determine image dimensions
//    which could be different from the logical screen
  const int imageLeft=ReadShort();
  const int imageTop=ReadShort();
  imageBase=pixelBase+imageTop*rowBytes+imageLeft;
  imageWidth=ReadShort();
  const int imageHeight=ReadShort();
  const int flags=Read1Byte(readPtr);
  const int hasLocalCT=flags & 0x80;

// Determine interlace geometry
  int skipBytes;
  int stride;
  if (flags & 0x40) {
   interlace=8;
   stride=8*rowBytes;
  } else {
   interlace=1;
   stride=rowBytes;
  }
  skipBytes=stride-imageWidth;

// Draw background color as required by the GIF spec
// (only needed if the image does not fill GIF logScreen)
  if ((globalCTH)
   && ((logScreenHeight>imageHeight)
   || (logScreenWidth>imageWidth))) {
   memset(pixelBase,bkGndColor,logScreenHeight*rowBytes);
  }
  
// The image might have a local color table which replaces
// the global color table. If so, we must update the gWorld.  
  if (hasLocalCT) {
   int CTsize=1L<<((flags&7)+1);
   CTabHandle localCTH=MakeCTab(CTsize);
//  5  9  13  17  21  25  29  33  37  41  45  49  53  57
   UpdateGWorld(&osWorld,8,&screenRect,localCTH,0,0);
   CTabChanged(localCTH);
   DisposeHandle((char**)localCTH);
  }

  DecodeImage(stride,skipBytes,imageHeight);
  
 }

GIF::DecodeImage
 void DecodeImage(int stride,int skipBytes,const int imageHeight) {

// Set up the pointers for writing the pixel map
  char* pixelStore=imageBase;
  char* endOfRow=imageBase+imageWidth;
  const char* endOfImage=endOfRow+(imageHeight-1)*rowBytes;

// Set up initial code parameters and working variables
  int minCodeSize=Read1Byte(readPtr);
  const int clearCode=1<<minCodeSize;
  for (int i=0;i<clearCode;i++) {
   d[i].Reset(i);
  }
  int codeSize=minCodeSize+1;
  unsigned int bitMask=(1L<<codeSize)-1;  
  unsigned int bitBuffer=0;
  int nextCode=clearCode+1;
  int oldCode=0;
  int bitsLoaded=0;
  bool dictFull=false;
  int bsize;
  int localReadPtr=readPtr;

// Read the file data, one GIF data block at a time
  while (0<(bsize=Read1Byte(localReadPtr))) {    
   int ch;

// Read the data of one GIF data block, and convert to pixels
   do {

// Read the next code
    while (bitsLoaded<codeSize) {        
     unsigned int nextByte=Read1Byte(localReadPtr);
     bitBuffer |= nextByte<<bitsLoaded;
     bitsLoaded+=8;
     bsize--;
     if (bsize<=0) break;
    }
    if (bitsLoaded<codeSize) continue;//end of block
    unsigned int code=bitMask & bitBuffer;
    bitsLoaded -= codeSize;
    bitBuffer >>= codeSize;          

    if (code==clearCode) {          

// Reset codeSize, and start all over
     nextCode=clearCode+1;
     codeSize=minCodeSize+1;  
     bitMask=(1L<<codeSize)-1;
     oldCode=0;  
     dictFull=false;
     } else
   
     if (code==clearCode+1) {        
   
// We are done.   
     return;

    } else {                  
  
// Decode pixel string
     if (code<nextCode) {          

// Code available in dictionary
      if (code<clearCode) {          
      // root code represents a single pixel
       ch=code;
       pixelStore=Put1(ch,stride,pixelStore,
        endOfRow,endOfImage,skipBytes);
      } else {                
      // a string of more than 1 pixel
       pixelStore=PutMore(code,stride,pixelStore,
         endOfRow,endOfImage,skipBytes,ch);
      }
      // Add next code to dictionary
      d[nextCode].Init(d+oldCode,ch);
     } else {                

// Anticipated code, we know it will be "nextCode"
      int dummy;
      // Add to dictionary first, then decode
      if (!dictFull) {          
      // Must not change dictionary if already full
       d[nextCode].Init(d+oldCode,ch);
      }
      pixelStore=PutMore(code,stride,pixelStore,
         endOfRow,endOfImage,skipBytes,dummy);
     }
     oldCode=code;
// Always just increment nextCode except when the dictionary
// is full.
// Anytime nextCode exceeds the current bitMask, the codeSize
// must increase by 1, and the bitMask stretched by 1 bit
     if (nextCode<kMaxCode) {        
      nextCode++;
      if (nextCode>bitMask) {      
       bitMask |= nextCode;
       codeSize=codeSize+1;
      }
     }  else dictFull=true;
    }
   } while(bsize);
  }
// Note: if we wanted to continue processing after decoding
//     just one picture, we would need to restore the
//     global readPtr before returning from this function.  
 }

GIF::NextPass
 char* NextPass(int & skipBytes,int & stride) {

// Defines parameters and the starting row for the
// next interlace pass. Does not get called with non-
// interlaced images, but provides robustness for them
// in case the GIF file is corrupted: it prevents writing
// past the end of the pixel map.
  stride=interlace*rowBytes;
  if (interlace>1) interlace/=2;
  skipBytes=stride-imageWidth;
  return imageBase+(rowBytes*interlace);
 }

GIF::PutMore
 char* PutMore(
     const int code,int & stride,char* dest,
     char* &endOfRow,const char* endOfImage,
     int & skipBytes,int & firstChar) {
       
// Decodes code and puts several pixels into the pixel map
//     and returns the updated pixel write pointer       
  DictEntry* stp=d+code;
  int len=stp->length;  
  char* newDest=dest+len;
  if (newDest<=endOfRow) {

// Everything fits in the same line.
// We just put the pixels directly into the pixMap,
// starting at the end of the string.
   dest=newDest;   
   do {
    *--dest=stp->value;
    stp=stp->prefix;
   } while (stp);
   firstChar=*dest;
   // newDest points to the next pixel to be painted
   return newDest;
  } else {
// We may have to handle one or more line breaks.
// Pixels past the end of the current line are pushed
// on a stack, then the pixels in the current line
// are painted; finally the pixels in the stack
// are popped off in reverse order, skipping past
// extra-bytes at line ends.
// If we reach the end of the image before the stack is
// empty, we must be in interlace mode and call a newPass()
// to find the next row to write to.
   DictEntry* stack=d+len;
   DictEntry* endStack=stack;
   int overflow=newDest-endOfRow;
   for (int k=0;k<overflow;k++) { // stack pixels in reverse
    firstChar=stp->value;
    (--stack)->stackedValue=firstChar;
    stp=stp->prefix;
   }

   if (stp) { // paint pixels for the current line
    dest=endOfRow;
    do {
     firstChar=stp->value;
     *--dest=firstChar;
     stp=stp->prefix;
    } while (stp);
   }

   do { // paint pixels from the stack, going forward
    dest=endOfRow+skipBytes;
    if (dest>=endOfImage) {
     dest=NextPass(skipBytes,stride);
    }
    endOfRow=dest+imageWidth;
    do {
     *dest++=(stack++)->stackedValue;
    } while ((stack<endStack)&&(dest<endOfRow));
   } while (stack<endStack);
   // dest points to the next pixel to be painted
   return dest;
  }
 }

GIF::Put1
 char* Put1(
     const int code,int & stride,char* dest,
     char* &endOfRow,const char* endOfImage,
     int &skipBytes) {
// Puts 1 pixel into pixMap
// and returns the updated pixel write pointer     
  if (dest>=endOfRow) {
   dest+=skipBytes;    
   if (dest>=endOfImage) {
    dest=NextPass(skipBytes,stride);  
   }
   endOfRow=dest+imageWidth;
  }
  *dest++=code;
  return dest;
 }
};

ReadImage
GWorldPtr ReadImage(FILE* inputFile) {
// The externally declared function

// Initialize a GIF structure
 GIF* gif=new GIF(inputFile);
 if (0==gif) return 0;
// Check signature, and create the off screen world
 GWorldPtr osWorld=gif->ProcessHeader();
 if (osWorld) {
// Using the geometry of the os-world ...
  PixMapHandle pmHdl=GetGWorldPixMap(osWorld);
  if(LockPixels(pmHdl)) {
   gif->pixelBase=GetPixBaseAddr(pmHdl);
   gif->rowBytes=(**pmHdl).rowBytes & 0x3FFF;
   do {
// Start reading the GIF data and look for the image separator
    int separator=gif->Read1Byte(gif->readPtr);
     if (separator==kImageSeparator) {
     gif->ProcessImage();// This is what it's about
     break;
    } else if (separator==kExtensionIntroducer) {
     gif->ProcessExtension();// these are ignored
    } else break;
   } while(1);

   UnlockPixels(pmHdl);
  }
 }

// Clean up temporary memory allocations
 delete gif;

 return osWorld;
}

SPREAD THE WORD:
Slashdot
Digg
Del.icio.us
Reddit
Newsvine

Nov. 20:	Take Control of Syncing Data in Sow Leopard' released
Nov. 19:	Cocktail 4.5 (Leopard Edition) released
Nov. 19:	macProVideo offers new Cubase tutorials
Nov. 18:	S Stardom anounces Safe Capsule, a companion piece for Apple's
Nov. 17:	Ableton releases Max for Live
Nov. 17:	Ableton releases Max for Live
Nov. 17:	Ableton releases Max for Live
Nov. 17:	Ableton releases Max for Live
Nov. 17:	Ableton releases Max for Live
Nov. 17:	Ableton releases Max for Live
Nov. 17:	Ableton releases Max for Live
Nov. 17:	Ableton releases Max for Live
Nov. 17:	Ableton releases Max for Live
Nov. 17:	Ableton releases Max for Live

MAC TECH

document.write(document.title);

May 98 - Programmer's Challenge

EggLob

Three Months Ago Winner

Top Contestants