home *** CD-ROM | disk | FTP | other *** search
- Newsgroups: comp.os.vms
- Path: sparky!uunet!spool.mu.edu!howland.reston.ans.net!sol.ctr.columbia.edu!usenet.ucs.indiana.edu!fractal!mdchaney
- From: mdchaney@fractal.ucs.indiana.edu (M Darrin Chaney)
- Subject: Re: Tool to reorganize/transform ASCII files
- Message-ID: <C1Ktxs.JF7@usenet.ucs.indiana.edu>
- Sender: news@usenet.ucs.indiana.edu (USENET News System)
- Nntp-Posting-Host: fractal.ucs.indiana.edu
- Organization: Indiana University, Bloomington
- References: <1993Jan28.142833.7117@bmw.mayo.edu>
- Date: Thu, 28 Jan 1993 18:27:27 GMT
- Lines: 42
-
- In article <1993Jan28.142833.7117@bmw.mayo.edu> fisk@mayo.edu writes:
- >We are looking for a tool that would allow us to take an ASCII input file,
- >and reorganize/transform it to another format.
- >
- >The task at hand is taking ASCII data dumps from several different
- >medical centers that have similar but different formats and "normalizing" them
- >to one common data format that I can import into a database. I would
- >essentially have to take apart the ASCII input records, reorganzie the data,
- >possibly do some transformations on the data (i.e., changing all the diagnosis
- >codes to a standard list), and write the data out to my common format. I
- >will then take this common format and import it into my database.
- >
- >The "perfect" tool would allow me to have a configuration file to drive the
- >entire process (since I have to do this for several different formats).
- >
- >Does anybody know if such a beast exsits? Does this sound like something
- >that awk/gawk would be capable of handling?
- >
- >Thanks for your help and suggestions! Tom.
-
- Well, I was going to suggest gawk, but you've heard about it already. I'll
- suggest it anyway.
-
- While I don't have the gawk source available, you can ftp to my machine
- (fractal.ucs.indiana.edu) and grab some sample awk programs there. I also
- have a zipcode database, and the awk programs to fix it up. You can also
- find the gawk manual in tex or ps format there.
-
- >-------------------------------------------------------------------------------
- >Thomas B. Fisk +----------------------------+ Internet: fisk@mayo.edu
- >Mayo Clinic | If you don't know where | Voice: (507) 255-4341
- >200 First Street SW | you're going you'll never | FAX: (507) 255-5484
- >Mail Stop 2D-337 STM | get there. |
- >Rochester, MN 55905 +----------------------------+
- >-------------------------------------------------------------------------------
-
- Darrin
- --
- M Darrin Chaney, Senior Database Programmer, University Computing Services, IU
- mdchaney@indiana.edu 1000 E 17th St. Work: (812)855-5492
- mdchaney@iubacs.bitnet Bloomington, IN 47408 Home: (812)333-6311
- "I want- I need- to live, to see it all..."
-