home
***
CD-ROM
|
disk
|
FTP
|
other
***
search
/
Frostbyte's 1980s DOS Shareware Collection
/
floppyshareware.zip
/
floppyshareware
/
LGB
/
HYPER.ZIP
/
FILE30
< prev
next >
Wrap
Text File
|
1988-02-01
|
5KB
|
93 lines
How do you create hypertext systems?
====================================
Having already built several very large hypertext system (multi-megabyte),
I've developed both techniques and tools to rapidly create hypertext
systems <FILE34 LINKING>. Here's the steps we go through to convert 150
pages of text into a comprehensive hypertext system <FILE32 SPLITTING>:
STEP 1 OCR (Optical Character Read) the text. We have machines that
read both typewritten and typeset text. See <FILE31 OCR>.
STEP 2 Clean up the text -- run spelling and grammar checkers, then
proof the text to correct OCR errors
STEP 3 Split the text into files -- Using a word processor, we add
codes, a file name, and a file descriptor at each new idea unit
in the text (generally every 1-4 paragraphs). Our SPLITTER
program uses these codes to split the original files into smaller
ASCII files. The 150 pages might become 500 ASCII files.
STEP 4 Build the master cross-reference index to the ASCII files. Using
our MARKER program <FILE64 TOOLS>, we mark the key words and
phrases in each file (and add synonyms if necessary). We
highlight perhaps 3-4 words/phrases per file, then assemble
these word lists into a master cross-reference file.
STEP 5 Put the cross-references into each ASCII file. We load the
word/phrase cross-reference list into HOUDINI, then use this
network to identify files that share common words and phrases.
This information is added to each ASCII file. <FILE33 LINKS>
STEP 6 Integrity check. We use another program to make sure the ASCII
networks are valid (no isolated files, no dead-end links, etc.)
STEP 7 We use both a decision-tree builder and HOUDINI to build the
knowledge hierarchies/networks that lead to the ASCII files.
Although it appears to be complex and time consuming, the overall hypertext
construction speed really depends on the user's knowledge of the subject area.
However, we've had considerable experience <FILE53 TALENTS> in building
systems for others. These systems typically range 2-4 megabytes in size,
contain 1,000-plus files, and 2,000-3,000 hypertext links.
Specifically, these hypertext systems have organized large amounts of
procedural law arising out of special hearings. We converted all the
minutes from many technical hearings into a comprehensive index by topic,
issue, commentary, opinion, and consequences.
As for performance, the systems we've built provide users, in an average
of less than 10 keystrokes, with immediate references to any ruling, topic,
or issue affecting certain areas of their profession. <FILE62 VALUE>
As for production standards, we believe the systems we've built are larger
than any reported in the hypertext literature to date. In addition, while
hypertext literature speaks of a year of effort to build modest systems
(200 nodes), in practice we do that in a day.
That leads to the question of...how do we do it?
Well, we use a number of software tools to build such systems -- splitters,
markers, reference-builders, hierarchical and network linkers, integrity
checkers, compactors, and installer/verifiers.
While we see these tools as essential to any hypertext efforts, we find no
mention of such software techniques in the literature of hypertext, nor by
companies or academia with hypertext software, nor from persons/reviewers
claiming an understanding of hypertext. <FILE74 LIMITATIONS OF EXISTING
CONCEPTS>
From all that, I think that you can draw your own conclusions. Just
remember the tale of the six blind men and their efforts to desribe an
elephant. Hypertext is much the same as most visions of the process do not
match the methods or software needed to accomplish the task.
Hypertext construction is a new field...and I'd say that users of MaxThink
and HOUDINI <FILE26 INFORMATION> understand more of the principles of
hypertext <FILE17 DESIGN> than anyone else. Why?
┌───────────────────────────────────────────────────────────┐
│ Good hypertext depends on abilities to clearly │
│ categorize information. │
└───────────────────────────────────────────────────────────┘
That is something every MaxThink and HOUDINI user already knows and
something users who are unfamiliar with hierarchical and network processors
scarcely understand.
Neil Larson 1/16/88 FILE30
44 Rincon Rd., Kensington, CA 94707
Copyright MaxThink 1988 -- Call 415-428-0104 for permission to reprint