home *** CD-ROM | disk | FTP | other *** search
- Path: senator-bedfellow.mit.edu!bloom-beacon.mit.edu!spool.mu.edu!sdd.hp.com!swrinde!cs.utexas.edu!uunet!mcsun!sun4nl!cwi.nl!guido
- From: guido@cwi.nl (Guido van Rossum)
- Newsgroups: alt.binaries.sounds.misc,alt.binaries.sounds.d,comp.dsp,comp.answers,news.answers
- Subject: Changes to: FAQ: Audio File Formats
- Message-ID: <audio-diff_743091234@charon.cwi.nl>
- Date: 19 Jul 93 14:14:28 GMT
- Expires: 16 Aug 93 14:13:54 GMT
- Sender: news@cwi.nl
- Reply-To: guido@cwi.nl
- Followup-To: alt.binaries.sounds.d,comp.dsp
- Lines: 333
- Approved: news-answers-request@MIT.Edu
- Supersedes: <audio-diff_740136129@charon.cwi.nl>
- Xref: senator-bedfellow.mit.edu alt.binaries.sounds.misc:8710 alt.binaries.sounds.d:3429 comp.dsp:6707 comp.answers:1342 news.answers:10511
-
- Archive-name: audio-fmts/diff
- Submitted-by: Guido van Rossum <guido@cwi.nl>
- Version: 3.04
- Last-modified: 19-Jul1993
-
- *** 1.3 1993/06/15 09:22:23
- --- audio.part1 1993/07/19 14:08:13
- ***************
- *** 8,11 ****
- Submitted-by: Guido van Rossum <guido@cwi.nl>
- ! Version: 3.03
- ! Last-modified: 15-Jun-1993
-
- --- 8,11 ----
- Submitted-by: Guido van Rossum <guido@cwi.nl>
- ! Version: 3.04
- ! Last-modified: 19-Jul-1993
-
- ***************
- *** 121,123 ****
- like a tiny floating point number; however, their dynamic range is
- ! that of 14 bit linear data. Source for converting to/from U-LAW
- (written by Jef Poskanzer) is distributed as part of the SOX package
- --- 121,123 ----
- like a tiny floating point number; however, their dynamic range is
- ! that of 12 bit linear data. Source for converting to/from U-LAW
- (written by Jef Poskanzer) is distributed as part of the SOX package
- ***************
- *** 216,223 ****
- Public standards for voice compression are slowly gaining popularity,
- ! e.g. CCITT G.721 and G.723 (ADPCM at 32 and 24 kbits/sec). (ADPCM ==
- ! Adaptive Delta Pulse Code Modulation.) Free source code for a *fast*
- ! 32 kbits/sec ADPCM algorithm is available by ftp from ftp.cwi.nl as
- ! /pub/adpcm.shar. (** NOTE: if you are using v1.0, you should get
- ! v1.1, released 17-Dec-1992, which fixes a serious bug -- the quality
- ! of v1.1 is claimed to be better than uLAW **)
-
- --- 216,229 ----
- Public standards for voice compression are slowly gaining popularity,
- ! e.g. CCITT G.721 (ADPCM at 32 kbits/sec) and G.723 (ADPCM at 24 and 40
- ! kbits/sec). (ADPCM == Adaptive Delta Pulse Code Modulation.) Sun
- ! Microsoft has placed the source code of a portable implementation of
- ! these algorithms (as well as G.711, which defines A-LAW and U-LAW) in
- ! the public domain (needless to say, their proprietary implementation
- ! distributed in binary form with Solaris is better :-). One place to
- ! ftp this source code from is ftp.cwi.nl:/pub/audio/ccitt-adpcm.tar.Z.
- ! Source for another 32 kbits/sec ADPCM implementation, assumed to be
- ! compatible with Intel's DVI audio format, can be ftp'ed from
- ! ftp.cwi.nl:/pub/audio/adpcm.shar. (** NOTE: if you are using v1.0,
- ! you should get v1.1, released 17-Dec-1992, which fixes a serious bug
- ! -- the quality of v1.1 is claimed to be better than U-LAW **)
-
- ***************
- *** 232,233 ****
- --- 238,245 ----
-
- + Tony Robinson <ajr@eng.cam.ac.uk> has written a good FAST loss-less
- + compression for lots of different audio formats (particularly good for
- + WAV and MOD files). The software is available by anonymous ftp from
- + svr-ftp.eng.cam.ac.uk [129.169.24.20], directory misc, file
- + shorten-1.08.tar.Z.
- +
- (Note that U-LAW and silence detection can also be considered
- ***************
- *** 313,315 ****
- --- 325,329 ----
- DEC 3000/300-500 U-LAW 8k 1
- + DEC 5000/20-25 U-LAW 8k 1
- Tandy 1000/*L* 8 22k 3
- + Tandy 2500 8 22k 3
- HP9000/705,710,425e U,A-LAW,16 8k 1
- ***************
- *** 338,339 ****
- --- 352,359 ----
-
- + Notes for the DECstation 5000/20-25: You need either XMedia tools from
- + DEC ($$$$), or the AudioFile package (which works nicely) from
- + crl.dec.com (see below). The audio device is "/dev/bba", you cannot
- + send ".au" files directly to the device, the Xmedia/AF software
- + provide an "audioserver" which must be run to play/record sounds.
- +
- The SGI Personal IRIS 4D/30 and 4D/35 have the same capabilities as
- ***************
- *** 654,657 ****
-
- ! The Tandy 1000 uses a (proprietary?) compressed format. There is a
- ! PD Mac to Tandy conversion program called CONVERT.
-
- --- 674,680 ----
-
- ! The Tandy 1000 uses a (proprietary?) compressed format. There is a PD
- ! Mac to Tandy conversion program called CONVERT. Leonard Erickson
- ! <leonard@qiclab.scn.rain.com> writes: There is a WAV driver from Tandy
- ! if people ask. There also appears to be a program that purports to
- ! convert other formats to Tandy, but I haven't tested this one yet.
-
- *** 1.3 1993/06/15 09:22:23
- --- audio.part2 1993/07/19 14:08:58
- ***************
- *** 8,11 ****
- Submitted-by: Guido van Rossum <guido@cwi.nl>
- ! Version: 3.03
- ! Last-modified: 15-Jun-1993
-
- --- 8,11 ----
- Submitted-by: Guido van Rossum <guido@cwi.nl>
- ! Version: 3.04
- ! Last-modified: 19-Jul-1993
-
- ***************
- *** 518,543 ****
-
- ! From: Joe Campbell N3JBC jpcampb@afterlife.ncsc.mil 74040.305@compuserve.com
-
- ! The U.S. DoD's Federal-Standard-1016 4800 bps code excited linear prediction
- ! voice coder version 3.2 (CELP 3.2) Fortran and C simulation source codes are
- ! now available for worldwide distribution at no charge (on DOS diskettes,
- ! but configured to compile on Sun SPARC stations) from:
- !
- ! Bob Fenichel
- ! National Communications System
- ! Washington, D.C. 20305
- ! 1-703-692-2124
- ! 1-703-746-4960 (fax)
- !
- ! In addition to the source codes, example input and processed speech files
- ! are included along with a technical information bulletin to assist in
- ! implementation of FS-1016 CELP. (An anonymous ftp site is being considered
- ! for future releases.)
- !
- ! Copies of the FS-1016 document are available for $2.50 each from:
- !
- ! GSA Rm 6654
- ! 7th & D St SW
- ! Washington, D.C. 20407
- ! 1-202-708-9205
-
- --- 518,552 ----
-
- ! From: jpcampb@afterlife.ncsc.mil (Joe Campbell)
-
- ! The U.S. DoD's Federal-Standard-1016 based 4800 bps code excited linear
- ! prediction voice coder version 3.2 (CELP 3.2) Fortran and C simulation
- ! source codes are available for worldwide distribution (on DOS
- ! diskettes, but configured to compile on Sun SPARC stations) from NTIS
- ! and DTIC. Example input and processed speech files are included. A
- ! Technical Information Bulletin (TIB), "Details to Assist in
- ! Implementation of Federal Standard 1016 CELP," and the official
- ! standard, "Federal Standard 1016, Telecommunications: Analog to
- ! Digital Conversion of Radio Voice by 4,800 bit/second Code Excited
- ! Linear Prediction (CELP)," are also available.
- !
- ! This is available through the National Technical Information Service:
- !
- ! NTIS
- ! U.S. Department of Commerce
- ! 5285 Port Royal Road
- ! Springfield, VA 22161
- ! USA
- ! (703) 487-4650
- !
- ! The "AD" ordering number for the CELP software is AD M000 118
- ! (US$ 90.00) and for the TIB it's AD A256 629 (US$ 17.50). The LPC-10
- ! standard, described below, is FIPS Pub 137 (US$ 12.50). There is a
- ! $3.00 shipping charge on all U.S. orders. The telephone number for
- ! their automated system is 703-487-4650, or 703-487-4600 if you'd prefer
- ! to talk with a real person.
- !
- ! (U.S. DoD personnel and contractors can receive the package from the
- ! Defense Technical Information Center: DTIC, Building 5, Cameron
- ! Station, Alexandria, VA 22304-6145. Their telephone number is
- ! 703-274-7633.)
-
- ***************
- *** 559,563 ****
-
- - For U.S. FED-STD-1016 (4800 bps CELP) _realtime_ DSP code
- - and information about products using this code, contact:
-
- John DellaMorte
- --- 568,601 ----
-
-
- + The U.S. DoD's Federal-Standard-1015/NATO-STANAG-4198 based 2400 bps
- + linear prediction coder (LPC-10) was republished as a Federal
- + Information Processing Standards Publication 137 (FIPS Pub 137).
- + It is described in:
- +
- + Thomas E. Tremain, "The Government Standard Linear Predictive Coding
- + Algorithm: LPC-10," Speech Technology Magazine, April 1982, p. 40-49.
- +
- + There is also a section about FS-1015 in the book:
- + Panos E. Papamichalis, Practical Approaches to Speech Coding,
- + Prentice-Hall, 1987.
- +
- + The voicing classifier used in the enhanced LPC-10 (LPC-10e) is described in:
- + Campbell, Joseph P., Jr. and T. E. Tremain, "Voiced/Unvoiced Classification
- + of Speech with Applications to the U.S. Government LPC-10E Algorithm,"
- + Proceedings of the IEEE International Conference on Acoustics, Speech, and
- + Signal Processing, 1986, p. 473-6.
- +
- + Copies of the official standard
- + "Federal Standard 1016, Telecommunications: Analog to Digital Conversion
- + of Radio Voice by 4,800 bit/second Code Excited Linear Prediction (CELP)"
- + are available for US$ 5.00 each from:
- +
- + GSA Federal Supply Service Bureau
- + Specification Section, Suite 8100
- + 470 E. L'Enfant Place, S.W.
- + Washington, DC 20407
- + (202)755-0325
- +
- + Realtime DSP code for FS-1015 and FS-1016 is sold by:
- +
- John DellaMorte
- ***************
- *** 566,567 ****
- --- 604,606 ----
- Bedford, MA 01730
- + USA
- 1-617-275-3733
- ***************
- *** 570,582 ****
-
- ! DSP Software Engineering's code can run on a DSP Research's Tiger 30 board
- ! (a PC board with a TMS320C3x and analog interface suited to development work)
- ! or on Intellibit's AE2000 TMS320C31 based 3" by 2.5" card.
- !
- ! DSP Research Intellibit
- ! 1095 E. Duane Ave. P.O. Box 9785
- ! Sunnyvale, CA 94086 McLean, VA 22102-0785
- ! (408)773-1042 (703)442-4781
- ! (408)736-3451 (fax) (703)442-4784 (fax)
-
- ! From: tobiasr@monolith.lrmsc.loral.com (Richard Tobias )
-
- --- 609,621 ----
-
- ! DSP Software Engineering's FS-1016 code can run on a DSP Research's Tiger 30
- ! (a PC board with a TMS320C3x and analog interface suited to development work).
- !
- ! DSP Research
- ! 1095 E. Duane Ave.
- ! Sunnyvale, CA 94086
- ! USA
- ! (408)773-1042
- ! (408)736-3451 (fax)
-
- ! From: tobiasr@monolith.lrmsc.loral.com (Richard Tobias)
-
- ***************
- *** 593,623 ****
-
- ! Newsgroups: comp.dsp
- ! From: bae@hplsdrn.col.hp.com (Bruce Erickson)
- ! Subject: Re: FTP site for CELP audio compression source?
- !
- ! In comp.dsp (subj: FTP site for CELP audio compression source?), Joe
- ! Campbell writes:
- ! > I would like to mention that a document, that is a vital part of the CELP
- ! > release package, is not available in electronic form. Therefore, I urge
- ! > anyone who is seriously interested in this coder to obtain this document:
- ! >
- ! > Details to Assist in Implementation of Federal Standard 1016 CELP.
- ! > National Communications System, Office of Technology & Standards, 1992.
- ! > Technical Information Bulletin 92-1.
- ! [Available for free from Bob Fenichel above --GvR]
- !
- ! I would also like to mention that when Bob gave me permission to put the
- ! CELP disks on wsmr-simutel he asked for people who fetch them to let
- ! him know that they have them.
- !
- ! So if you grab the sources -- from whatever source -- please give him
- ! a call or send him USmail.
- !
- ! I am still waiting for wsmr-simutel to let me know how to upload the CELP
- ! disks -- I will be sure to post here & elsewhere when I upload them!
- !
- ! - Bruce Erickson
- ! bae@col.hp.com
-
- ! From: Cole Erskine <cole@analogical.com> [paraphrased]
-
- --- 632,636 ----
-
- ! From: Cole Erskine <cole@analogical.com>
-
- ! [paraphrased]
-
- ***************
- *** 681,682 ****
- --- 694,696 ----
- 07 End repeat 0 (NONE)
- + 08 Extended 4 ***
-
- ***************
- *** 688,690 ****
- --- 702,712 ----
-
- + ***Extended Info Format:
- + ---------------------
- + 00-01 Time Constant: Mono: 65536 - (256000000/sample_rate)
- + Stereo: 65536 - (25600000/(2*sample_rate))
- + 02 Pack
- + 03 Mode: 0 = mono
- + 1 = stereo
-
- +
- Marker# -- Driver keeps the most recent marker in a status byte
- ***************
- *** 771,772 ****
- --- 793,813 ----
- Copyright 1959, 1964, 1970, 1982
- +
- + A note on the resolution of U-LAW by Frank Klemm <pfk@rz.uni-jena.de>:
- +
- + 8 bit U-LAW has the same lowest magnitude like 12 bit linear and 12 bit
- + U-LAW like 16 linear.
- +
- + Device/Coding Resolution Resolution
- + on maximal level on low level
- + 8 bit linear 8 8
- + 8 bit ulaw 6 12 (used for digital telephone)
- + 12 bit linear 12 12
- + 12 bit ulaw 10 16 (used in DAT/Longplay)
- + 16 bit linear 16 16
- +
- + estimated for some analoge technique:
- + tape recorder (HiFi DIN)
- + 8 9 (no Problem today)
- + tape recorder (semiprofessional)
- + 10.5 13.5
-
-