home
***
CD-ROM
|
disk
|
FTP
|
other
***
search
/
OS/2 Shareware BBS: 14 Text
/
14-Text.zip
/
arc_7e.zip
/
SPCHOUT.SAV
Wrap
Text File
|
1994-04-16
|
120KB
|
3,658 lines
This file uploaded by John Pedersen 76547,357.
Messages collected using Golden Commpass 2.1
Messages sorted with Recon 2.2
#: 1 S10/Thread Header
08-Feb-94 16:10:46
Sb: Speech quick interaction
Fm:
To:
TITLE: SPEECH QUICK INTERACTION
KEYWORDS: QUICK RESPONSE SPEED PROFILE
#: 705 S10/IBM Speech Products ∙5 ∙
08-Feb-94 16:10:46
Sb: Speech quick interaction
Fm: Dave Briccetti [Zif 72241,557
To: All
Hello.
I'm using ICSS for the first time, and I like it very much so far. I'm
looking for pointers to information that will allow me to tune my ICSS
applications for very quick response. For simple responses, such as "Yes" and
"No," I'd like ICSS to return to my application within 1/2 second after the
speaker stops talking. Is this done entirely by adjusting the system
parameters, or is there something else I need to do?.
Thanks, and keep up the good work.
Dave Briccetti
Dave Briccetti & Assoc.
OS/2 Consultant/Developer
Lafayette, CA
#: 725 S10/IBM Speech Products ∙5 ∙
09-Feb-94 16:39:51
Sb: #705-Speech quick interaction
Fm: Ora Williamson 71742,2161
To: Dave Briccetti [Zif 72241,557
>> For simple responses, such as "Yes" and "No," I'd like ICSS to return to my
application within 1/2 second after the speaker stops talking.
Dave: In the User Profile, adjust the Minimum Amount of Silence To Indicate
End Talking variable. Or, adjust it with the ICSSSetValue API call using the
UttEndSilDuration variable.
Hope this helps! Regards, Ora J. Williamson IBM Spoken Language Systems Lab
ICSS Technical Liason
#: 1 S10/Thread Header
27-Jan-94 23:56:29
Sb: Speech Products
Fm:
To:
TITLE: SPEECH PRODUCTS
KEYWORDS: ICSS IPDS
#: 575 S10/IBM Speech Products ∙5 ∙
27-Jan-94 23:56:29
Sb: Speech Products
Fm: R. Glenn Jones 73531,461
To: Ian Wright 73064,3434
Ian, I am a little confused about IBM's various speech products. Was the IBM
Dictation system written using ICSS? I have purchased the dicatation system
(have the software but not the card yet) and wonder now if I have bought the
right product.
Can you tell me what the differences are between the various products?
Thanks R. Glenn Jones (Sears Canada Inc)
#: 580 S10/IBM Speech Products ∙5 ∙
29-Jan-94 11:46:38
Sb: #575-Speech Products
Fm: Ian Wright (IBM) 73064,3434
To: R. Glenn Jones 73531,461
>>I am a little confused about IBM's various speech products
Well, me too. But here's what I have found.
IBM Personal Dictation System ------------------------- * Discrete speech
dictation * 20,000-word vocabulary (World News Today reported 32,000-word
vocab, 20 Nov 93) * 70 words a minute * Needs training * Continuous speech
commands (Solitaire's great fun) * Voice Action Editor to create personal
macros (e.g. to open e-mail when you say "Good morning") * User interface
navigation
IBM Speech Server Series -------------------- * Similar to IPDS in function *
Runs under AIX (host and client) or AIX host and OS/2 client * High-end
solution includes networking
IBM Continuous Speech Series ------------------------ * Speaker-independent *
Continuous speech * Toolkit for developing Windows, OS/2, and AIX
speech-enabled apps
VoiceType 2 ---------- * Supports DOS apps such as word processord, databases,
and spreadsheets
Voice Type Control for Windows -------------------------- * 64-word vocabulary
command and navigation system for Windows 3.1
For Information ------------ Call 1-800-TALK-2ME Ian Wright (IBM)
#: 598 S10/IBM Speech Products ∙5 ∙
01-Feb-94 14:00:27
Sb: #575-Speech Products
Fm: Ora Williamson 71742,2161
To: R. Glenn Jones 73531,461
>>I am a little confused about IBM's various speech products
Time to chime in...
IBM's 2 PRIMARY Speech products are IBM Personal Dictation System (IPDS) and
IBM Continuous Speech Series (ICSS).
IPDS is a large-vocabulary dictation system which allows you to dictate after
performing an 1 1/2 hour enrollment where you train your voice to the system.
IPDS supports discrete speech - there must be a slight pause between words.
IPDS is currently available for OS/2 and runs about $1000 for the software and
the speech accelerator card.
ICSS is a small-vocabulary grammar driven system which allows you to speak
pre-defined words and phrases continuously - without pauses between your
words. ICSS is speaker-independent - you don't have to train your voice to
the system. It is great for walk-up-and-use apps. ICSS ships with an SDK for
speech-enabling applications. The DLL supplies 12 'C' function calls. For
each application, you can load up to 128 of these grammars (1000 words max per
grammar) and dynamically switch between them, typically from window to window.
ICSS is currently available on OS/2, AIX and Windows. It supports sound cards
such as the SoundBlaster, Pro Audio Spectrum and IBM Audio Capture and
Playback Adaptor. ICSS costs $315 for a single development license (which
comes with a runtime license).
For information on ANY of IBM's Speech products, call 1-800-TALK2ME. To order
ICSS, call Indelible Blue at 1-800-776-8284.
Ora Williamson ICSS Technical Liason IBM Spoken Language Systems Lab
#: 604 S10/IBM Speech Products ∙5 ∙
01-Feb-94 16:47:42
Sb: #598-Speech Products
Fm: Scott Gurvey 70040,156
To: Ora Williamson 71742,2161
Ora...
Thanks for making the distinction between the speech products clear. Now I
think I've got it. I sure confused them in the first story I did last year. Do
you know if the Personal Dictation Adaptor can serve as the audio input device
for ICSS? SO far the only audio board I have for my PS/2 95 is the
Audiovation/A, and there is no OS/2 support for that.
Cheers,
Scott
#: 611 S10/IBM Speech Products ∙5 ∙
01-Feb-94 23:37:06
Sb: #604-Speech Products
Fm: Brandon Booth [IBM] 71042,3144
To: Scott Gurvey 70040,156
ICSS can use ACPA, Sound Blaster, and Pro Audio Spectrum cards for audio
input.
Brandon
ICSS Development
#: 622 S10/IBM Speech Products ∙5 ∙
02-Feb-94 13:23:16
Sb: #611-Speech Products
Fm: Scott Gurvey 70040,156
To: Brandon Booth [IBM] 71042,3144
>ICSS can use ACPA, Sound Blaster, and Pro Audio Spectrum cards for audio
input.
So.... Can the IPDS card emulate any of those? Can the Audiovation or
Windsurfer? Will ICSS work with either of the last two (the MWave cards) once
OS/2 support is available?
Does the idea of needing three IBM audio boards on a computer to support three
different IBM products strike anyone else as a little silly?
Cheers,
Scott
#: 677 S10/IBM Speech Products ∙5 ∙
06-Feb-94 20:52:35
Sb: #622-Speech Products
Fm: Brandon Booth [IBM] 71042,3144
To: Scott Gurvey 70040,156
Scott
We should be able to use any card supported by MMPM, and so if Windsurfer is
so supported, we should be able to use it. I really can't comment on our
relationship to IPDS except to say that the two products really do solve
different problems (and is perhaps not as optimal as one would like).
Brandon
ICSS Development
#: 700 S10/IBM Speech Products ∙5 ∙
08-Feb-94 14:35:56
Sb: #598-Speech Products
Fm: R. Glenn Jones 73531,461
To: Ora Williamson 71742,2161
Ora, The exchange rate must be a little higher than I thought. IBM Canada is
charging roughly $800 each for the software and adapter. I bought IPDS because
I intend to use it as a means of entering data and to a lesser extent to
control programs. Voice control for Windows was mentioned, is this written in
ICSS? R. Glenn Jones (Sears Canada Inc)
#: 716 S10/IBM Speech Products ∙5 ∙
09-Feb-94 10:15:31
Sb: #700-Speech Products
Fm: Ora Williamson 71742,2161
To: R. Glenn Jones 73531,461
>> IBM Canada is charging roughly $800 each for the software and adapter.
Glenn: The $800 is for the cost of IPDS, not ICSS, right? IPDS was not part
of the $79 special - only ICSS.
>> Voice control for Windows was mentioned, is this written in ICSS?
I don't understand the above question. Are you asking about ICSS for Windows,
or ICSS for OS/2?
Regards, Ora J. Williamson IBM Spoken Language Systems Lab ICSS Technical
Liason
#: 738 S10/IBM Speech Products ∙5 ∙
09-Feb-94 23:01:14
Sb: #716-Speech Products
Fm: R. Glenn Jones 73531,461
To: Ora Williamson 71742,2161
Ora, yes those figures were for IPDS.
I was asking how Voice Control for Windows fit into the picture. I was asking
if Voice Control for Windows was an application that IBM has written using
ICSS. R. Glenn Jones (Sears Canada Inc)
#: 1 S10/Thread Header
06-Feb-94 22:07:02
Sb: ICSS WIN Problem
Fm:
To:
TITLE: ICSS WIN PROBLEM
KEYWORDS: ICSS WINDOWS TEST BANDWIDTH
#: 679 S10/IBM Speech Products ∙5 ∙
06-Feb-94 22:07:02
Sb: ICSS WIN Problem
Fm: Dan Devine 76040,215
To: all
I have just installed ICSS for Windows. The installation went smoothly on my
Gateway 486-66 w/ 16M.
I'm getting two errors from what is probably the same problem. When I try to
run the test I get the
following error when I ACTIVATE the TESTING.CTX file:
LoadContext failed, rc:49, Recognition processing could not open the
associated CPDFILE.
The error appears when I try to activate the other ctx files as well. I also
get the second half of this
error (Recognition..CPDFILE) when I load the Stock Market demo.
I have a PAS16 card that functions properly. Any help would be appreciated.
DAN
#: 689 S10/IBM Speech Products ∙5 ∙
07-Feb-94 16:41:34
Sb: #679-ICSS WIN Problem
Fm: Ora Williamson 71742,2161
To: Dan Devine 76040,215
>> LoadContext failed, rc:49, Recognition processing could not open the
associated CPDFILE.
Dan:
Do me a favor and look for the file MEDIUM.CPD in the following directory:
\ICSSWIN\MODEL\US_ENG\MEDIUM.
If it is in there, then you may have a path problem in the AUTOEXEC, or your
Virtual Memory size is not large enough. Set Virtual Memory to Permanent and
around 12 - 15 Meg.
If you don't have the .CPD file, then I will send it to you.
Ora Williamson IBM Spoken Language Systems
#: 742 S10/IBM Speech Products ∙5 ∙
10-Feb-94 11:28:19
Sb: #689-ICSS WIN Problem
Fm: Dan Devine 76040,215
To: Ora Williamson 71742,2161
Ora,
Thanks for your help. The problem resulted because I had installed the
high banwidth models only, not the medium - which the included contexts had
been created with.
Thanks again,
Dan
#: 814 S10/IBM Speech Products ∙5 ∙
16-Feb-94 13:01:28
Sb: #742-ICSS WIN Problem
Fm: Ora Williamson 71742,2161
To: Dan Devine 76040,215
>>The problem resulted because I had installed the high banwidth models only,
not the medium
Dan: We are updating our documentation to deal with this anomoly. Thanks for
your patience!
Ora.
#: 1 S10/Thread Header
17-Feb-94 09:31:22
Sb: ICSS Prob Determination
Fm:
To:
TITLE: ICSS PROB DETERMINATION
KEYWORDS: LIST PROBLEMS SOLUTIONS
#: 832 S10/IBM Speech Products ∙5 ∙
17-Feb-94 09:31:22
Sb: ICSS Prob Determination
Fm: Ora Williamson 71742,2161
To: all
To ICSS Users:
I have uploaded a file in our section ICSSPD.1 for your perusal. This file is
a list of common problems that we have seen that you may be encountering.
Possible solutions are included. As the list grows, we will upload additional
hints/tips in sequential file order (ICSSPD.2, ...).
Hope this is helpfulRegards, Ora J. Williamson IBM Spoken Language Systems Lab
ICSS Technical Liasion
#: 1 S10/Thread Header
17-Feb-94 19:21:52
Sb: ICSS API for Phonemes
Fm:
To:
TITLE: ICSS API FOR PHONEMES
KEYWORDS: RETURN PHONEMES
#: 846 S10/IBM Speech Products ∙5 ∙
17-Feb-94 19:21:52
Sb: ICSS API for Phonemes
Fm: Tom Vreeland 76711,1075
To: Brandon Booth [IBM] 71042,3144
We are currently extending our MEDIAscript OS/2 myltimedia authoring tools to
support speech recognition with ICSS. This will enable developers to use our
tools to build Speech enabled multimedia applications without any C
programming.
One important API which is missing from the current support in V 1.0 is the
ability to get a phoneme string back in addition to the text string. This is a
key element in the strategies we would like to use to extend the capabilities
of ICSS. Does such an API exist internally? Could it be provided?
This would facilitate lexicon and dictionary construction, and support many
uses in linguistic research using this technology.
NOTE: We have also added pen support to our authoring interface and like the
idea that we can load a gesture set and sr context for a given application
element to support both elements of an enhanced mm interface.
#: 849 S10/IBM Speech Products ∙5 ∙
17-Feb-94 22:17:44
Sb: #846-ICSS API for Phonemes
Fm: Bob Cave 73232,2474
To: Tom Vreeland 76711,1075
Tom,
As far as I know, we have no internal API to get the phoneme string back along
with the recognized text. I will double check this and get back to you (may
not be until next week as I will be out of the office tomorrow).
Regards,
Bob Cave, ICSS Development Team
#: 892 S10/IBM Speech Products ∙5 ∙
19-Feb-94 21:23:41
Sb: #846-ICSS API for Phonemes
Fm: Brandon Booth [IBM] 71042,3144
To: Tom Vreeland 76711,1075
Hi Tom
The stuff you're describing with the multimedia authoring sounds cool. I'm
sorry we don't have any api to return the phoneme string with the reco'd text.
You could read our dictionary (icss_us.dic) to get this information. There
are no plans to add such an api to ICSS.
Brandon
ICSS Development - IBM
#: 1 S10/Thread Header
08-Feb-94 14:55:06
Sb: Other Language Lexicons
Fm:
To:
TITLE: OTHER LANGUAGE LEXICONS
KEYWORDS: LANGUAGES FOREIGN SPEAKER ACCENTS RECOGNITION
#: 702 S10/IBM Speech Products ∙5 ∙
08-Feb-94 14:55:06
Sb: Other Language Lexicons
Fm: James E. Osbourn 74150,2571
To: Brandon Booth 71042,3144
Are there ICSS dictionaries/lexicons for other languages than English? If
yes, which ones? are they available in the US?
James in Kansas City
#: 726 S10/IBM Speech Products ∙5 ∙
09-Feb-94 16:39:59
Sb: #702-Other Language Lexicons
Fm: Ora Williamson 71742,2161
To: James E. Osbourn 74150,2571
>>Are there ICSS dictionaries/lexicons for other languages than English?
James: No, there are not, but you can emulate foreign phrases using the
English phoneme set. Add the foreign words to an addendum dictionary and
create the associated phoneme strings with the U.S. English phonemes provided
with ICSS.
When doing this, it is best to create the phrases as 1 long word. For
example:
MUCHAS_GRACIAS M UW CH AA S G R AX S IY AA S
Hope this helps! Regards, Ora J. Williamson IBM Spoken Language Systems Lab
ICSS Technical Liason
#: 733 S10/IBM Speech Products ∙5 ∙
09-Feb-94 21:13:57
Sb: #702-Other Language Lexicons
Fm: Brandon Booth [IBM] 71042,3144
To: James E. Osbourn 74150,2571
James
I'm sorry, we currently only support American English.
Brandon
ICSS Development
#: 748 S10/IBM Speech Products ∙5 ∙
10-Feb-94 12:27:02
Sb: #733-Other Language Lexicons
Fm: Bob Hutchison 72254,1315
To: Brandon Booth [IBM] 71042,3144
Brandon,
Can you clarify this a bit. What part of America? The US has such a wide
range of accents that I cannot quite imagine what you mean. Certainly in
written english there are differences between American English and the rest of
us, but oral?
Bob
#: 829 S10/IBM Speech Products ∙5 ∙
16-Feb-94 20:03:18
Sb: #748-Other Language Lexicons
Fm: Ken Solkowitz 72103,2111
To: Bob Hutchison 72254,1315
Bob,
With ICSS' speaker independent aspects, it recognizes almost anyone in the
U.S. I've demonstrated at Comdex for the past two years and it recognized
everyone fine. It truly is speaker independent.
Regards,
Ken Solkowitz
IBM Continuous Speech Series
Bethesda, MD
301-803-3897
#: 833 S10/IBM Speech Products ∙5 ∙
17-Feb-94 10:36:58
Sb: #829-Other Language Lexicons
Fm: Bob Hutchison 72254,1315
To: Ken Solkowitz 72103,2111
Ken,
That's what I thought. Thanks for confirming.
Bob
#: 872 S10/IBM Speech Products ∙5 ∙
19-Feb-94 13:09:34
Sb: #829-Other Language Lexicons
Fm: Michael Burgun 100020,1604
To: Ken Solkowitz 72103,2111 (X)
Ken,
<<<Bob, With ICSS' speaker independent aspects, it recognizes almost anyone in
the U.S. I've demonstrated at Comdex for the past two years and it recognized
everyone fine. It truly is speaker independent. >>>
What about people without funny accents like us Australians?!!?
Michael
#: 874 S10/IBM Speech Products ∙5 ∙
19-Feb-94 15:13:52
Sb: #872-Other Language Lexicons
Fm: Ken Solkowitz 72103,2111
To: Michael Burgun 100020,1604
Michael,
Actually we have a harder time with UK English speakers than we do with anyone
else. We are working on this by recording UK English speakers. That folded
into our voice models will help. We usually recognize UK English fairly well.
I'll have to send a note to the IBMers in Australia using the product and ask
for their results.
Best Regards,
Ken Solkowitz
IBM Continuous Speech Series
Bethesda, MD 20817
301-803-3897
#: 904 S10/IBM Speech Products ∙5 ∙
20-Feb-94 07:20:35
Sb: #874-Other Language Lexicons
Fm: Michael Burgun 100020,1604
To: Ken Solkowitz 72103,2111
Ken,
I doubt you will have any trouble with Australians - I was just "stirring"
(Australian for teasing).
Michael
#: 882 S10/IBM Speech Products ∙5 ∙
19-Feb-94 21:22:51
Sb: #748-Other Language Lexicons
Fm: Brandon Booth [IBM] 71042,3144
To: Bob Hutchison 72254,1315
Hi Bob
I don't find American accents change that much (I've lived in New York State,
Maryland, and Oregon, and traveled to much of the rest of the country). I
don't know of any American accent that ICSS can't handle. At trade shows we
have seen ICSS work with various foreign accents.
Brandon
ICSS Development - IBM
#: 1 S10/Thread Header
16-Feb-94 16:27:26
Sb: #378-An Odd Application
Fm:
To:
TITLE: AN ODD APPLICATION
KEYWORDS: TIMING TIMESTAMP START
#: 819 S10/IBM Speech Products ∙5 ∙
16-Feb-94 16:27:26
Sb: #378-An Odd Application
Fm: Rufus Laggren 76346,1441
To: Brandon Booth [IBM] 71042,3144
>> The way ICSS works your program tells it when to start gathering
>> speech data and when ICSS returns with the reco'd text it also returns
>> timing data (utterance length, reco time). With this your program should
>> be able to coordinate it's input sources. What you will not get from ICSS
>> is timing information on a word by word basis.
Can ICSS emit msg's to the opsys at pre-specified times (such as "start of
sound" or "end of sound")? Or can it "timestamp" its sound *and* text stream
at predefined intervals?
I'm assuming (from complete ignorance) that it emits at least 2 outputs, the
text and sampled sound stream (?).
Rufus
#: 888 S10/IBM Speech Products ∙5 ∙
19-Feb-94 21:23:20
Sb: #819-An Odd Application
Fm: Brandon Booth [IBM] 71042,3144
To: Rufus Laggren 76346,1441
Rufus
ICSS does not currently post messages for Start of Talking, etc. We have had
this requirement from other customers and it is under consideration. And yes,
we do return the reco'd text, and you can get the sound stream back if you
want it.
Brandon
ICSS Development - IBM
#: 921 S10/IBM Speech Products ∙5 ∙
21-Feb-94 01:55:03
Sb: #888-An Odd Application
Fm: Rufus Laggren 76346,1441
To: Brandon Booth [IBM] 71042,3144
>> ICSS does not currently post messages for Start of Talking, etc. We
>> have had this requirement from other customers
Yes, it seems useful to capture a few "milestones" to maintain a timeline, if
it's not too expensive. Thnks for you quick note.
#: 1 S10/Thread Header
16-Feb-94 19:05:50
Sb: M-ACPA vs SoundBlaster
Fm:
To:
TITLE: M-ACPA VS SOUNDBLASTER
KEYWORDS: M-ACPA MWAVE M-WAVE SOUNDBLASTER SB SOUNDBOARDS
#: 826 S10/IBM Speech Products ∙5 ∙
16-Feb-94 19:05:50
Sb: M-ACPA vs SoundBlaster
Fm: Vernon Harvey 72120,2767
To: Brandon Booth (IBM) 71042,3144
Brandon:
I am currently using the ICSS software with the MMPM/2 and a SoundBlaster 16
ASP card. I have just discovered that the developer's toolkit only allows me
to write PM applications when using the "icssmmpm" device, rather than being
able to write simple text mode C programs. This is not a big deal, but slows
down prototyping a bit.
As a result, I am curious. What are the pros and cons of the SoundBlaster
versus the M-ACPA card. Are there any features on the SB card that I cannot
get on the M-ACPA card or vice versa. And most importantly, will there be any
difference in voice recognition by using one card or the other.
I have never seen the M-ACPA card advertised anywhere, so any pricing info and
features lists would be more than welcome.
Thanks again.
Vernon Harvey.
#: 890 S10/IBM Speech Products ∙5 ∙
19-Feb-94 21:23:28
Sb: #826-M-ACPA vs SoundBlaster
Fm: Brandon Booth [IBM] 71042,3144
To: Vernon Harvey 72120,2767
Hi Vernon
Well the basic difference between SB and ACPA is that SB's are very popular
and ACPA's nearly unknown. In fact IBM is converting over to another series
of cards based on MWAVE technology. We have not noticed any difference in
reco accuracy between the two cards (16 bit in the case of SB, the 8 bit cards
do reco worse).
Brandon
ICSS Development
#: 939 S10/IBM Speech Products ∙5 ∙
21-Feb-94 14:51:56
Sb: #890-M-ACPA vs SoundBlaster
Fm: Vernon Harvey 72120,2767
To: Brandon Booth [IBM] 71042,3144
Brandon:
Thanks for the info on the differences. I guess I will stick with the SB card
for now, and maybe get the M-Wave card when it is released for OS/2. Is the
M-Wave card the one that incorporates voice, data, fax and sound card in a
single board. I remember seeing something at the PACRIM show here, and it
sounded pretty impressive, except for the 9600 baud modem (Personally, I want
V.Fast, but I will settle for 14.4). The card I saw however, only had Windows
software, and they said that an OS/2 version was in the works. Any idea when
its due out?
Regards,
Vernon.
#: 953 S10/IBM Speech Products ∙5 ∙
21-Feb-94 20:24:41
Sb: #939-M-ACPA vs SoundBlaster
Fm: Brandon Booth [IBM] 71042,3144
To: Vernon Harvey 72120,2767
Vernon
I'm sorry, I don't know much about the M-Wave based cards. You might ask over
on the MMPM section of OS2DF1.
Brandon
ICSS Development
#: 1 S10/Thread Header
17-Feb-94 17:55:09
Sb: ICSS - H/W & S/W Quests
Fm:
To:
TITLE: ICSS - H/W & S/W QUESTS
KEYWORDS: TIMING TIMESTAMP START STORAGE DIGITIZE
#: 843 S10/IBM Speech Products ∙5 ∙
17-Feb-94 17:55:09
Sb: ICSS - H/W & S/W Quests
Fm: Marc S Grushcow 70751,1501
To: Brandon Booth 71042,3144
Brandon,
It looks like we might have a paying project that will let us use both OS/2
and ICSS. It will be an interesting way to get our feet wet. I have some
questions about the ICSS API and hardware...
1) Timing
I want to know when the speaker started an utterance. ICSSGetSpokenWords
returns AcquisitionTime and RecognitionTime. Assume that the function just
completed and I immediately get the system time. How good an estimate of
utterance start time is system time minus the sum of the acquisition and
recognition time?
2) Storage
Is it possible to store the digitized data as well as having it go to ICSS for
processing. Ideally, I'd like ICSS to give me the digitized data along with
the text, but I can't find any support for this in the manual. If it could do
this, I wouldn't have to save "silence", but I would happily settle for a
complete record of the digitized session (yes I know the sample rate is
22KHz).
3)Processor, memory and disk
We are looking at a 48xDX2 at 66MHz with 16 or 24MB RAM which meets the h/w
requirement. At 16MB, would we expect to see swapping? I don't expect tht
our app would be really big. (I know that that isn't a lot of help). In any
case, we will probably go SCSI for the disk (.5 or 1GB) and are thinking
about the Adaptec local bus controller (2742 I think). Any comments? Again,
we would like to be able to save the digitized data in parallel with speech
analysis.
4)Sound Cards
I gather that ICSS still doesn't support any DSP units. We will probably get
the Pro Audio Spectrum 16 unless there is a technical reason to go with the
M-ACPA card.
5)A Grammar Question
We will probably go with bigrams based on sample sentences and sentence
fragments. We have quite a lot of these. I was wondering about the BNF form
though. If I write a BNF that can handle...
Buy grapes, oranges, cheese, and bread and beer.
will it also handle...
Buy grapes, oranges ...<long pause>... cheesse, and bread and beer.
where <long pause> is longer than Minimum Silence Time?
Thanks in advance
Marc
#: 848 S10/IBM Speech Products ∙5 ∙
17-Feb-94 22:11:37
Sb: #843-ICSS - H/W & S/W Quests
Fm: Bob Cave 73232,2474
To: Marc S Grushcow 70751,1501
Marc,
Hi. I am sitting in for Brandon for a few days, and I think that I can answer
your questions:
1) Timing - the system time minus sum of acquisition time and reco time should
produce a pretty good estimate. How close does the application need to be?
2) Storing digitized data - No problem here. You can tell ICSS to record each
utterance while it performs recognition. Check the system parameter called
Record Session Name. You give it a path and file name, and ICSS records each
utterance in a separate file with .001, .002, etc. appended to the file name.
I have used this several times, it works great.
3) Hardware reqts - your configuration is quite similar to mine back at the
office. I do get swapping with 16 MB RAM, but I also tend to have a bunch of
other things running at the same time. SCSI disk with the Adaptec controller
sounds great...I wish I had one! If you want to save 22khz samples, go with
the 1GB drive.
4) Sound Cards - PAS 16 is fine, and you are correct--no DSP support.
5) Grammar - when the user is silent for a time greater than the current value
for end talking silence, ICSS will stop listening. You can increase this
value, but that will make all recognition response times longer.
I hope this helps.
Bob Cave, ICSS Development Team
#: 891 S10/IBM Speech Products ∙5 ∙
19-Feb-94 21:23:37
Sb: #843-ICSS - H/W & S/W Quests
Fm: Brandon Booth [IBM] 71042,3144
To: Marc S Grushcow 70751,1501
Hi Marc
On your first question, to be honest I'm not sure how good a number you will
get. Our acquistion time starts when we start acquiring data (hence the
name). The start of talking may occur well after that. We've had customers
ask for messages to be posted back on these kinds of events, and we are
evaluating this enhancement.
On saving the speech signal, look at the SetValue function and the
RecordSpeech and RecordSession settings. That will do what you want. And you
can use either the 22 kHz or 11 kHz sample rates.
16 meg main ram should not show any swapping, so the disk (which sounds quite
nice) shouldn't factor in (except of course if you are saving the sampled
data).
ICSS does not off load DSP processing, so the PAS card is a fine choice.
In any case if a user pauses longer than the end silence time, ICSS will think
she has ended the utterance and will begin to process the speech data. You
can play with this value to get a balance that makes sense to you.
Glad I can help.
Brandon
ICSS Development - IBM
#: 929 S10/IBM Speech Products ∙5 ∙
21-Feb-94 09:12:53
Sb: #891-ICSS - H/W & S/W Quests
Fm: Marc S Grushcow 70751,1501
To: Brandon Booth [IBM] 71042,3144
Brandon,
I think I liked Bob's answer about identifying the start of speech time better
than yours. I hate to start an argument, but who is right? This could be a
sticky problem for us.
Our project is dealing with multimodal dialogues - we are getting input from
both voice and gestures (pen, touch screen or mouse). We can save the "ink"
from the gestures and timestamp them for later analysis. It looks like speech
is going to be a problem.
Two things... First we want to know when the utterance occured (start time and
duration). Second, I think that we want to have the whole voice sessions
recorded, not just the slices where ICSS thinks that it found something. The
scientists involved will want to be able to playback the digitized voice and
see where ICSS found things at least until they are confident that it is
working.
??
Marc
#: 940 S10/IBM Speech Products ∙5 ∙
21-Feb-94 15:04:17
Sb: #929-ICSS - H/W & S/W Quests
Fm: Vernon Harvey 72120,2767
To: Marc S Grushcow 70751,1501
Hi Mark:
Your system sounds a bit similar to my home system on the H/W level. You
might want to check the availability of OS/2 2.1 drivers for the Adaptec VLB
controller. When I bought my system in December, Adaptec had the VLB
controller, but no driver support for OS/2. As a result, I went with the
BusLogic BT-445S (I think that's the right model), and it is working without
any problems. Adaptec was supposed to have drivers
available sometime in January.
Regards,
Vernon Harvey.
#: 951 S10/IBM Speech Products ∙5 ∙
21-Feb-94 20:24:33
Sb: #929-ICSS - H/W & S/W Quests
Fm: Brandon Booth [IBM] 71042,3144
To: Marc S Grushcow 70751,1501
Hi Marc
I'm sorry, but I believe I am correct about the timings returned. Let me make
a destinction between the entire audio data and the utterance data, which is
from the start of talking (when the speech signal rises above the threshold)
to the end of talking (when the speech signal falls below the threshold). In
the shipping products we return the acquistion time, which is the first of
these. Customers have requested being notified of the events in the second of
these. We are evaluating this request. Also, you can record all of the
audio data from the time the card is opened until the time it is closed, and
analyse this if you wish.
Brandon
ICSS Development
#: 983 S10/IBM Speech Products ∙5 ∙
22-Feb-94 21:13:53
Sb: #929-ICSS - H/W & S/W Quests
Fm: Bob Cave 73232,2474
To: Marc S Grushcow 70751,1501 (X)
Marc,
Sorry, but I must yield to Brandon on this one. He is much closer to that
part of the code than I am.
Bob
#: 1 S10/Thread Header
03-Feb-94 00:57:24
Sb: ICSS problem
Fm:
To:
TITLE: ICSS PROBLEM
KEYWORDS: WINDOWS INSTALLVIRTUAL MEMORY SWAPFILE BANDWIDTH MICROPHONE WIN32
#: 634 S10/IBM Speech Products ∙5 ∙
03-Feb-94 00:57:24
Sb: ICSS problem
Fm: Harry R. Crowe 72677,2714
To: sysop
I have just received the ICSS package for Windows v3.1 My install failed. The
process accepted all 9 disks, and then reported that I did not have Virtual
Memory activated. My environment is
MSDOS v6.2
PC = Clone 486DX2/66 with 20M installed RAM
Disks are C=170M and D=130M
Memory management is via native mode MS drivers
(himem.sys and emm386.exe)
Chosen installation area is D-drive with 40M free When I tried to access the
test program (Developer's Toolkit) the system responded by
NO load
NO action
RETURN to Program Manager Nothing else in the ICSS window worked properly.
The DOS path had been ammended and re-boot was performed. I then blew away the
installed programs on D: and attempted to re-install the system. This was not
smoothe. The same result occurred, and I have now given up (in disgust).....
Can you help me???????
#: 642 S10/IBM Speech Products ∙5 ∙
03-Feb-94 15:46:43
Sb: #634-ICSS problem
Fm: Ora Williamson 71742,2161
To: Harry R. Crowe 72677,2714
>>I have just received the ICSS package for Windows v3.1 My install failed.
Harry:
Did you restart Windows before trying anything?
In the Control Panel under 386 Enhanced, check your Virtual Memory size. Can
your swap file grow to 12 Meg or larger? If not, set it up as thus, and try,
try again.
Don't get upset! Once you are correctly configured, you'll be very happy!
Regards, Ora Williamson IBM Spoken Language Systems Lab
#: 671 S10/IBM Speech Products ∙5 ∙
05-Feb-94 22:15:02
Sb: #642-ICSS problem
Fm: Harry R. Crowe 72677,2714
To: Ora Williamson 71742,2161
Thank you for your response. You proposed two steps to start my path to
success
with ICSS:
1. Restart Windows
2. Generate a 12M swapfile
I honestly do not remember if I first restarted windows, but now any time I go
into the Developers Toolkit the screen blackens momentarily and then returns
to the Program Manager. I have not yet implemented the 12M Swap File. I will
do so now with my new 345M hard disk and see what happens - remember from my
first message: I have 20M (!) of installed RAM. Do I still need such a swap
file???
I may be checking back!
thanks again...
HC
#: 673 S10/IBM Speech Products ∙5 ∙
06-Feb-94 01:07:36
Sb: #642-ICSS problem
Fm: Harry R. Crowe 72677,2714
To: Ora Williamson 71742,2161
Ora:
This is one of those times when my extensive computer background is feeling
like a light shirt on a cold night... I have messaged you once since my first
call for help regarding ICSS for Windows. This email is before any reply to
my second message would be possible for you (I think). At your suggestion, I
created a >12M swap file. But I also re-installed the system to my C-drive
(instead of D). The install went more smoothely, and I can now bring up the
Developer's Toolkit. But I cannot DO anything. If I attempt to TEST the
system, I get various non-useful responses of the type that make one crazy.
The TechRef manual says to go to the DevToolKit and select Context from the
Actions pull-down menus.... There IS no Context Menu Item!! As a matter of
fact, the manual does not seem to correspond to any screens that I have
intially found! This is most frustrating. By sheer stubborn menu-hunting I
have found a Grammars/BNF/File/Context item that seems to locate the Testing
context. But then God help me if I try to do anything with it (such as
ACTIVATE it, as stated in the manual). HELP! I only have one life and do not
wish to spend it on ICSS-in-the-dark! Do I have the wrong manual for the
software I was sent???...
For reference, my environment is as follows. I have a 486dx2/66 with MSDOS
v6.2, which performs other multi-media operations fine. The sound card is a
Sound Blaster 16/ASP that DOES record in SB software - so I know the hardware
works... Any assistance would be greatly appreciated. I can call on a voice
line during working hours, but the machine is at home and you know how hard it
is to trouble shoot something you are not in front of!
Thanks,
HC
#: 682 S10/IBM Speech Products ∙5 ∙
06-Feb-94 22:38:14
Sb: #673-ICSS problem
Fm: Brandon Booth [IBM] 71042,3144
To: Harry R. Crowe 72677,2714
Harry
When Ora was speaking about testing ICSS, she was refering to the Test Context
program (wtstctx.exe). This can be invoked from the Development Environment
panel from the Speech pulldown (I think this is correct, I don't quite
remember). You need to load a compiled context that is shipped with ICSS.
These have extension .ctx. Please make sure you have the ICSSDIR environment
variable set to the install directory of ICSS (if its installed on the C:
drive than ICSSDIR should be set to C:\ICSSWIN). The Grammar pulldown is for
compiling contexts. Your machine config sound fine. Please make sure that
you can record and play back audio with your setup.
Brandon
ICSS Development
#: 688 S10/IBM Speech Products ∙5 ∙
07-Feb-94 16:35:46
Sb: #673-ICSS problem
Fm: Dan Devine 76040,215
To: Harry R. Crowe 72677,2714
Harry,
I was getting a similar problem when I tried to activate the testing context
(see my
other post). I spoke to tech support today and they said I was getting the
error
because I only loaded the High not the Medium. The included contexts were
created with the Medium so I need to install that as well. This was not
stated in
the manual either. Hope this helps.
Dan
#: 694 S10/IBM Speech Products ∙5 ∙
07-Feb-94 22:59:00
Sb: #688-ICSS problem
Fm: Brandon Booth [IBM] 71042,3144
To: Dan Devine 76040,215
Dan is quite correct. The ICSS contexts are input sample rate specific. We
have two input rates: High(22050 kHz) and Medium (11025 kHz). The contexts
that we ship are compiled in the Medium rate (bandwidth). You need to have
the Medium models installed in order to do reco with a Medium context.
Brandon
ICSS Development
#: 710 S10/IBM Speech Products ∙5 ∙
09-Feb-94 00:37:45
Sb: #694-ICSS problem
Fm: Harry R. Crowe 72677,2714
To: Brandon Booth [IBM] 71042,3144
BB:
Thank you for your response. With Ora's and your help, the system now seems
to be up and running! I re-installed with the Medium format and the
Test/Sample operations worked fine.... By the way, just what does the WIN32S
component provide? I took a removable (ICSS installed) disk from home to
work, and the DevToolKit would not come up claiming (in one of those fast,
dark screen blips before going back to the program manager that "This program
does not work in DOS mode"). Is this because I had not done an INSTALL on the
work computer? Should I do an incremental install of the WIN32S component to
funtion on both? This is a matter of curiosity, but I WOULD like to know!
Also, while perusing the HELP documents, I noticed a query regarding other
processors. Is the MathCo absolutely necessary (I have a 486dx2/66 at home,
so I am OK there)?? Will a 386/40 work by itself? And if not would it with a
MathCo?
Thanks for your input...
HC
#: 734 S10/IBM Speech Products ∙5 ∙
09-Feb-94 21:16:18
Sb: #710-ICSS problem
Fm: Brandon Booth [IBM] 71042,3144
To: Harry R. Crowe 72677,2714
Hi Harry
Glad you got running. Ora is an amazing help to a lot of people (she's a lot
more people oriented than I; I'm more of a like to program in a cave sort). We
use the Microsoft Win32s support to provide linear memory (as opposed to
segmented memory normal to Windows 3.1), and this is mostly because we ported
from OS/2. ICSS will not work unless Win32s support is installed (although
some components will come up, as they don't use Win32s). Part of ICSS heavily
uses floating point operations and the program is computationally intensive
enough that a 486 DX 33 mHz machine is the smallest machine I'd recommend. A
486 DX2 66 mHz is a very nice machine for ICSS (I develop on one at work - a
clone actually). Did I answer all of your questions?
Brandon
ICSS Development
#: 747 S10/IBM Speech Products ∙5 ∙
10-Feb-94 12:27:00
Sb: #734-ICSS problem
Fm: Bob Hutchison 72254,1315
To: Brandon Booth [IBM] 71042,3144
Brandon,
I purchased ICSS for windows unaware that it required win32s. Now, this
wouldn't normally be a problem, but for some reason win32s will not run with
my windows drivers (S3) -- really annoying, but not your problem.
Why do you use win32s? If it is for 32bit support you can get that with
Watcom's compiler and you would be able to run on just about any windows
machine.
Bob
#: 881 S10/IBM Speech Products ∙5 ∙
19-Feb-94 21:22:47
Sb: #747-ICSS problem
Fm: Brandon Booth [IBM] 71042,3144
To: Bob Hutchison 72254,1315
Bob
We use Win32s for the flat memory addressing as we ported from our OS/2 2.1
version. We felt it was the best solution.
Brandon
ICSS Development - IBM
#: 932 S10/IBM Speech Products ∙5 ∙
21-Feb-94 10:49:58
Sb: #881-ICSS problem
Fm: Bob Hutchison 72254,1315
To: Brandon Booth [IBM] 71042,3144
Brandon,
I guess the cost to me is that I cannot develop for OS/2 and Windows at the
same time. Unless you know something I don't about Win32s under OS/2.
Bob
#: 952 S10/IBM Speech Products ∙5 ∙
21-Feb-94 20:24:38
Sb: #932-ICSS problem
Fm: Brandon Booth [IBM] 71042,3144
To: Bob Hutchison 72254,1315
Bob
If it helps, developing ICSS for all of the operating systems that we do is as
much (or more) of a headache. I'm sorry we can help.
Brandon
ICSS Development
#: 961 S10/IBM Speech Products ∙5 ∙
22-Feb-94 01:03:03
Sb: #952-ICSS problem
Fm: Richard Frank 73511,3406
To: Brandon Booth [IBM] 71042,3144
Brandon,
Is a Sony F-VX500 mic good enough to use? I have no frame of reference yet.
RIchard
#: 989 S10/IBM Speech Products ∙5 ∙
22-Feb-94 23:34:20
Sb: #961-ICSS problem
Fm: Brandon Booth [IBM] 71042,3144
To: Richard Frank 73511,3406 (X)
Richard
I don't know. We've used the mics listed in the tech ref (Sennheiser 410,
518, 431; Conneaut Audio Devices 89; Electro-Voice 857; Radio Shack 33-1037,
33-1067). If you have good results, let us know.
Brandon
ICSS Development
#: 1008 S10/IBM Speech Products ∙5 ∙
23-Feb-94 15:50:25
Sb: #952-ICSS problem
Fm: Bob Hutchison 72254,1315
To: Brandon Booth [IBM] 71042,3144
Brandon,
I'm trying not to criticise too much :-) I appreciate the work you are faced
with.
Bob
#: 711 S10/IBM Speech Products ∙5 ∙
09-Feb-94 00:37:47
Sb: #688-ICSS problem
Fm: Harry R. Crowe 72677,2714
To: Dan Devine 76040,215
Dan:
Thanks for your input. I was in the same boat: "High Bandwidth only"... I
then installed the Medium and all was fine! This area is exploration for me,
but what are people (e.g. you) doing with this?? It is obviously something
that would be served by a better version of OLE!? Like Interactive Object
Embedding? The program development route is pretty long, with many
corners!...
Anyway thanks for the response...
HC
#: 669 S10/IBM Speech Products ∙5 ∙
05-Feb-94 21:58:46
Sb: ICSS problem
Fm: Terry Jones 73131,3137
To: sysop
I appear to have a couple of problems with the Windows development kit I
bought and hope you can help.
When I was installing the software everything went fine up to disk 9, the
WIN32S diskette. When it started up it looked at the A: drive despite all the
others were loaded on the B: drive, and it wasn't possible to load it. How do
I load this?
The second problem is that when I followed instructions on testing the ICSS
installation I got the error
"StartConversation failed rc:15, Invalid ADCIF Number has been
specified"
after selecting TEST CONTEXT in SPEECH. I tried the stock exchange demo and I
got the following error:
"Invalid ADCIF Number has been specified"
Any idea what is causing this, and how to solve it?
My system is a 486 clone, I am using a SoundBlaster board and am using QEMM.
Hope you can help,
Terry Jones
(703) 832-7802
#: 681 S10/IBM Speech Products ∙5 ∙
06-Feb-94 22:38:08
Sb: #669-ICSS problem
Fm: Brandon Booth [IBM] 71042,3144
To: Terry Jones 73131,3137
Terry
You have to have the Win32s support installed for ICSS to work. I don't know
why the Win32s disk was looking for the A drive. What you should do is call
our bug report number (800 - 627 - 8822) and describe your problem to Dan (the
guy who answers that line - he's good at it). The ADCIF message means that it
(ICSS) thinks you have the wrong card number (this is set in one of the
profiles - see the info in the tech ref on profiles). If you have a single
audio card installed, I believe 0 is the card number to use. However, I
would distrust any error messages until you have Win32s correctly installed.
Brandon
ICSS Development
#: 1 S10/Thread Header
28-Feb-94 09:34:24
Sb: ICSS questions
Fm:
To:
TITLE: ICSS QUESTIONS
KEYWORDS: VISUAL BASIC VBX CONFIDENCE CERTAINTY SPEED BANDWIDTH
#: 1094 S10/IBM Speech Products ∙5!∙
28-Feb-94 09:34:24
Sb: ICSS questions
Fm: Stephen Ng 70302,1510
To: all
I've been playing with ICSS/Windows and have been quite impressed with it. I
have a couple of comments/questions/suggestions:
1. I got ICSS working with Visual Basic; it wasn't too bad except I had to
write 2 small C routines to do type conversions, and I also had to use the
MessageBlaster VBX to handle the ICSS reply messages. Of course what would
really be nice here would be a VBX--would anyone else be interested in such a
creature?
2. I hooked up ICSS to my telephone, and have been getting decent results. I
have a little restaurant locator app running on a Dialogic board, and you can
select from among 20 different cuisines by speaking the name of the cuisine
(French, Italian, etc.) (617-643-EATS in case you want to try it.) It seems
to have the most trouble recognizing single syllable words (it always wants to
interpret "French" as "Greek"). The radio shack gadget for getting a
mic-level output from the phone seems pretty cheap and may be adding some
distortion. I also need to fool with the sound card mixer settings to make
sure volume is optimal. Nevertheless, what about adding some kind of
certainty rating to the API so I can see how good the match is? And also a
way to get the next-closest-guess so I can ask the user about the choices?
3. I know I'm really pushing ICSS, since it's designed to work with
high-quality microphones directly into a sound card, and not the telephone
handsets over phone lines. But I'm trying to make things easier by keeping
the grammars trivial, and the vocabularies small. Is it possible to adjust
sound model for typical telephone handset acoustics by using a graphic
equalizer or doing some signal processing on the sound? Can I adjust the data
tables ICSS uses? Or, since it seems to have a propensity to think that most
single-syllable cuisines are "greek" I can bias the bigram table so that
"greek" is less likely? (I can upload the .bnf/.ctx/.wav files if it would
help.)
4. I read that ICSS may use the first utterance to adjust for the particular
mike. Does that mean the first utterance since the ICSS was started? Or per
conversation? My application uses a single conversation for multiple phone
calls--should I use different conversations for different calls so it can
readjust for different phones?
5. Perhaps ICSS has a hard time distinguishing the vowels in "French/Greek".
So would it help if I changed my grammar such that the user had to say
"please" after the cuisine name? I.e. "French, please?" ICSS would know that
the second word was "please", and perhaps that could help it figure out that
the vowel in "french" was not the same sound???
6. The 1000-word "limit"--does the performance degrade linearly or
exponentially as we approach this limit? Another way to ask the question is,
what if I got a machine twice as fast as my 486/66--could I recognize 2000
words as quickly as 1000 words now?
Thanks for your help,
--Steve
#: 1106 S10/IBM Speech Products ∙5!∙
28-Feb-94 21:05:44
Sb: #1094-ICSS questions
Fm: Ken Solkowitz 72103,2111
To: Stephen Ng 70302,1510
Stephen -
I am going to bring your post into work and have Bob take a look at it.
Knowing Bob, he'll take care of this tonight making tomorrow's discussion a
mute point.
Regards,
Ken Solkowitz
IBM Continuous Speech Series
Bethesda, MD 20817
301-803-3897
#: 1111 S10/IBM Speech Products ∙5!∙
28-Feb-94 23:41:42
Sb: #1094-ICSS questions
Fm: Bob Cave 73232,2474
To: Stephen Ng 70302,1510
Stephen,
That is quite a post! I can answer some of your questions now, others will
require some consultation "back at the ranch".
>1)...the ICSS reply messages. Of course what would really be nice here would
>be a VBX--would anyone else be interested in such a creature?
Given some of the discussion in this topic over the past couple of days, I
think the answer is definitely "yes".
2) We are investigating the notion of adding some type of "confidence"
indicator to the response for a recognition. I am not sure about returning
the next best response...I will look into it.
>3. I know I'm really pushing ICSS, since it's designed to work with
high-quality >microphones directly into a sound card, and not the telephone
handsets over >phone lines.
This is true: you are pushing ICSS quite hard, possibly beyond its
capabilities. What is your hardware configuration? How are you getting voice
data into ICSS? Are you splitting the phone line, with one end going into the
Dialogic card and the other end into a sound card? The difficulties with
using ICSS over the phone come from the reduced bandwidth present in the sound
samples. Our speech models are tuned for microphone input bandwidth. I don't
think an equalizer will help much. The models ("data tables") cannot be
modified by an application. In general, the longer and more distinct your
words are, the better recognition you will get.
>4. I read that ICSS may use the first utterance to adjust for the particular
mike.
This is true, and the adjustment is per StartConversation call, I believe.
Using a new conversation for each call would add overhead to your calls. You
can minimize this by making sure that there is always at least one context
loaded. The initial LoadContext call causes a large file called a Common
Probability Distribution file to be loaded, and this can take several seconds.
Once it is loaded, all contexts use it and subsequent LoadContext calls are
quite fast.
>5. Perhaps ICSS has a hard time distinguishing the vowels in "French/Greek".
>So would it help if I changed my grammar such that the user had to say
"please" >after the cuisine name?
I do not think this will buy you anything, but I will check it out.
>6. The 1000-word "limit"--does the performance degrade linearly or
exponentially >as we approach this limit?
Performance for ICSS recognition is affected more by the branching factor
(number of choices for the next word at any given point within a grammar)
within a context than the number of different words. In any case, a faster
machine will definitely produce faster recognition, but not necessarily more
words.
I hope this helps. Please post back if you have any more questions.
Bob Cave, ICSS Development
#: 1155 S10/IBM Speech Products ∙5!∙
02-Mar-94 10:58:44
Sb: #1111-ICSS questions
Fm: Stephen Ng 70302,1510
To: Bob Cave 73232,2474
Bob,
Thanks for your very helpful response! I look forward to any more info you
may have about my questions; meanwhile here are some responses to some of your
questions:
You write:
"Given some of the discussion in this topic over the past couple of days, I
think the answer is definitely "yes"."
I will look into developing a VBX interface to ICSS and possibly distributing
it as shareware. (Would anyone care to volunteer how much this would be worth
to them? "IBM should provide it for free" is a valid, though not encouraging
to me, response.)
You write:
"What is your hardware configuration? How are you getting voice data into
ICSS? Are you splitting the phone line, with one end going into the Dialogic
card and the other end into a sound card? The difficulties with using ICSS
over the phone come from the reduced bandwidth present in the sound samples.
"
I'm running a 486/66DX2 with 16 meg under Windows 3.1, using a Pro Audio
Studio 16, but I'm certainly open to using upgrading my hardware if it will
help. Yes, I'm splitting the phone line just as you say. I realize this is a
difficult problem, but I am hopeful that ICSS will provide at least as good a
solution as some of the dedicated hardware voice recognition solutions
designed for just telephones (speaker-independent, discrete speech, small
vocabulary).
You write:
"In general, the longer and more distinct your words are, the better
recognition you will get."
That has definitely been my experience; my biggest problem is with recognizing
a single syllable word.
You write:
">4. I read that ICSS may use the first utterance to adjust for the particular
mike.
This is true, and the adjustment is per StartConversation call, I believe."
Interesting, so perhaps at the start of each phone call, the I should do a
StartConversation, and ask the user to say "hello"?
You write:
"Performance for ICSS recognition is affected more by the branching factor
(number of choices for the next word at any given point within a grammar)
within a context than the number of different words. In any case, a faster
machine will definitely produce faster recognition, but not necessarily more
words."
So I should keep the grammar perplexity down if ICSS gets too slow. But what
if I have a trivial grammar? I don't understand your comment that a faster
machine will not buy me more words....the manual says that 1000 words is not a
hard limit, but that I simply won't like the performance as I approach 1000
words. Doesn't that mean that I can go over 1000 with a really really fast
machine?
Thanks,
Steve
#: 1163 S10/IBM Speech Products ∙5!∙
02-Mar-94 17:18:52
Sb: #1155-ICSS questions
Fm: Vernon Harvey 72120,2767
To: Stephen Ng 70302,1510
Stephen:
I seem to recall when I was playing with the Dialogic voice card, that there
was an interface bus which allows you to take the audio signal directly off
the card and feed it to another card. If your audio card has an input
connector, maybe you could patch them together to provide a cleaner audio
connection, possibly with less signal loss.
Regards,
Vernon.
#: 1172 S10/IBM Speech Products ∙5!∙
02-Mar-94 21:48:04
Sb: #1163-ICSS questions
Fm: Bob Cave 73232,2474
To: Vernon Harvey 72120,2767
Vernon,
The interface bus used by the Dialogic cards is proprietary, although that is
opening up with the new Signal Computing Systems Archtecture (SCSA) standards.
One can, however, use a Dialogic matrix switch card to provide input to the
sound card through its audio input jack. (I wish I had though to mention this
in my previous reply to Steve <g>).
Thanks,
Bob Cave, ICSS Developoment
#: 1182 S10/IBM Speech Products ∙5!∙
03-Mar-94 13:44:11
Sb: #1155-ICSS questions
Fm: Vernon Harvey 72120,2767
To: Stephen Ng 70302,1510 (X)
Stephen:
I did a quick look through my technical info on the Dialogic card last night.
If you have a D/4X series card, there is a 20 pin bus connector which allows
you to get audio from the card for any of the 4 channels. I found the info in
the appendix, and it may be of some use to connect to card to your sound card
for speech recognition. The connector is referred to as a P3 audio connection
bus in my Dialogic manual. I don't think that you need the Dialogic Matrix
switch in order to connect the two cards, however, you may want to check with
Dialogic tech support to be sure.
Regards,
Vernon.
#: 1333 S10/IBM Speech Products ∙5!∙
11-Mar-94 11:03:39
Sb: #1182-ICSS questions
Fm: Stephen Ng 70302,1510
To: Vernon Harvey 72120,2767
Vernon,
Thanks for the tip! What publication are you looking at? I've looked my D/4x
Hardware reference book, but it seems to just refer me to other docs. I'll
try to get the info from Dialogic directly.... I would like to try hooking up
to the bus directly to see if I can get a cleaner signal. (The main issue
appears to be making sure my sound card doesn't get fried when the phone
rings.)
--Steve
#: 1344 S10/IBM Speech Products ∙5!∙
11-Mar-94 18:06:11
Sb: #1333-ICSS questions
Fm: Vernon Harvey 72120,2767
To: Stephen Ng 70302,1510
Stephen:
Sorry for the delay in replying, I have not been online lately (Busy cruising
around in my new car!). The manual which I found this information in was the
Dialog/4X User's Guide, version 2.30, which appears to date back to about
1988. It was in one of the appendices. I have come to the point where I
never throw out the old manuals, as it seems that there are always ommissions
in the future releases. Particularily when the company decides that it can
make more money selling the information.
Glad to have been able to help.
Regards,
Vernon Harvey
Rydex Industries Corporation.
#: 1242 S10/IBM Speech Products ∙5!∙
06-Mar-94 18:20:14
Sb: #1155-ICSS questions
Fm: Rufus Laggren 76346,1441
To: Stephen Ng 70302,1510
>... Shareware VBX for ICSS... value ??
IMHO:
Workable -> $30+
Very good -> $50+
Just a guess, I'm afraid, and biased to the low-medium end. I.e., if the base
package becomes available for about $200 or less (hopefully), it allows
"impulse purchase" experimentation by many developers and a VBX priced as
above would fit that market. However, I confess to usually being low in my $$
estimates.
Would suggest that you include a nag screen (not too gross), and configure
your release to allow some material enhancement to be provide through
registration. E.g., distribute a fully usable package, but provide optimized
dll's or some small useful enhancements with registration... I judge others
by myself, and I think that such a "reminder" is needed in shareware, and that
although it's a departure from the pure free test traditionally provided, it's
justified and fair and ultimately acceptable.
I'd like to shareware continue to produce fine products, and I think that many
people which otherwise would just use the product forever free would consider
registering to gain a some small-but-not-purely-symbolic enhancement if they
indeed find the base product fairly useful.
Hope you find it do-able. Good luck.
#: 1243 S10/IBM Speech Products ∙5!∙
06-Mar-94 18:20:21
Sb: #1111-ICSS questions
Fm: Rufus Laggren 76346,1441
To: Bob Cave 73232,2474
>> The difficulties with using ICSS over the phone come from the reduced
>> bandwidth present in the sound samples. Our speech models are tuned for
>> microphone input bandwidth.
Is it possible to utilize different "speech models"? Perhaps one optimized
for telephony? Is it feasible to create a speech model through a training
session using the actual equipment with a specific limited speech/text stream
which gets recorded and then corrected or "set" into the speech model? Note
opportunity for additonal products here <g> (various speeck models,
"mastering" software to create speech models...).
>> Performance for ICSS recognition is affected more by the branching
>> factor (number of choices for the next word at any given point within a
>> grammar) within a context than the number of different words.
Hmm. Accruacy depends on the "number of choices for the next word..", but not
the "..number of different words"? ..... ??????
Can I *preset* the branching profile for the next incoming speech stream *and*
preset the available vocalbulary selection such that amoung very limited
choices the hit-rate or confidence level rises substantially? Or is this
already an automatic part of the ICSS functionality?
No rush. When you get a chance. Thanks.
#: 1245 S10/IBM Speech Products ∙5!∙
06-Mar-94 21:07:13
Sb: #1243-ICSS questions
Fm: Bob Cave 73232,2474
To: Rufus Laggren 76346,1441
Rufus,
>>Is it possible to utilize different "speech models"?
>>Perhaps one optimized for telephony?
Yes, this is possible, and I agree with regard to the additional product
opportunities. Many of our customers (and potential customers) have expressed
a requirement for a version of ICSS which works in a telephony environment.
>>Hmm. Accruacy depends on the "number of choices for the next word..",
>>but not the "..number of different words"? ..... ??????
When I said "performance" in the quote you included, I was referring to
recognition speed, not recognition accuracy. I should have been more
explicit.
>> Can I *preset* the branching profile for the next incoming speech stream
>>*and* preset the available vocalbulary selection such that amoung very
>> limited choices the hit-rate or confidence level rises substantially? Or
is
>>this already an automatic part of the ICSS functionality?
You can have multiple vocabularies (or "contexts" in ICSS terminology) loaded
into memory at any given time. Each time you perform recognition, you can
select any one of the loaded contexts. You will get the best accuracy by
defining small contexts with words which are distinct from one another rather
than putting all possible responses into a single context. Does this answer
the question?
Regards,
Bob Cave, ICSS Development
#: 1259 S10/IBM Speech Products ∙5!∙
07-Mar-94 23:39:08
Sb: #1245-ICSS questions
Fm: Rufus Laggren 76346,1441
To: Bob Cave 73232,2474
>> You will get the best accuracy by defining small contexts with words
>> which are distinct from one another rather than putting all possible
>> responses into a single context
Yes, does answer my question, point on. Thanks. I was wondering what would
improve the "greek" interpretation of "French" over the phone. Seems by
defining small context for a particular expected response, even 1 sylable
words would be recognized accurately.
Incidentally, how *does* one build a speech model?
Thanks for you quick note. Rufus
#: 1284 S10/IBM Speech Products ∙5!∙
08-Mar-94 21:52:44
Sb: #1259-ICSS questions
Fm: Bob Cave 73232,2474
To: Rufus Laggren 76346,1441
Rufus,
>> Incidentally, how *does* one build a speech model?
The truth is, I know very little about that part of the process. I know that
we start with several thousand recorded utterances and transcripts, and the
whole thing ties up a high-end Risc System/6000 for 24 hours or so. Beyond
that, it may as well be magic <g>.
Perhaps Brandon knows more than I do on this one?
Regards,
Bob Cave, ICSS Development
#: 1187 S10/IBM Speech Products ∙5!∙
03-Mar-94 17:05:11
Sb: #1170-ICSS questions
Fm: Stephen Ng 70302,1510
To: Bob Cave 73232,2474
Bob,
Of course what's frustrating/tantalizing about this is that it seems to work
perfectly for a while, then it has periods of not matching at all. I've
started recording the sessions as you've suggested.
I uploaded two files "Barbe00.wav" and "desse00.wav" which are me saying
"French" and ICSS matching "barbecue" and "dessert". Can you tell me if there
is any mangling I can do to these files to make them match better?
Thanks,
Steve
#: 1197 S10/IBM Speech Products ∙5!∙
03-Mar-94 22:29:45
Sb: #1187-ICSS questions
Fm: Bob Cave 73232,2474
To: Stephen Ng 70302,1510
Steve,
I have downloaded the files and will take them to work and listen to them
tomorrow. It would also be helpful if you could upload your BNF file and tell
me what parameters you are using when you create the context. This will allow
me to see if I can duplicate the results you are getting.
Thanks,
Bob Cave, ICSS Development
#: 1199 S10/IBM Speech Products ∙5!∙
04-Mar-94 11:52:07
Sb: #1197-ICSS questions
Fm: Stephen Ng 70302,1510
To: Bob Cave 73232,2474
Bob,
I really appreciate your taking the time to look at this, especially since I'm
really pushing ICSS to its limits.
With the fax I received from Vernon, I hooked up the line in on the Pro Audio
Studio card directly to the Dialogic card this morning. I *think* I'm getting
cleaner audio; I'll send you a sample file by mail (I can't upload files onto
the conference).
I'll be doing some testing over the next few days to try to get some idea
about whether this improves things.
Yesterday I tried a quick test of the "<cuisine>, please" and "please,
<cuisine>, please" idea but it didn't seem to help things. Starting a new
conversation for each phone call didn't seem to improve the accuracy much
either.
Thanks again,
--Steve
#: 1244 S10/IBM Speech Products ∙5!∙
06-Mar-94 20:48:08
Sb: #1199-ICSS questions
Fm: Bob Cave 73232,2474
To: Stephen Ng 70302,1510
Steve,
I got your file and will look at it on Monday.
The other files you sent had a considerable amount of distortion in the
background (an audible hum). When I brought up the files under the Test
Context function, and looked at the waveform, it was clear that there was a
problem. The distortion was well above the default thresholds, and ICSS would
have attempted to perform recognition on it.
Regards,
Bob Cave, ICSS Development
#: 1200 S10/IBM Speech Products ∙5!∙
04-Mar-94 11:52:09
Sb: #1192-ICSS questions
Fm: Stephen Ng 70302,1510
To: Vernon Harvey 72120,2767
Vernon,
Thanks for the fax from the manual page; it's exactly what I needed. (Makes
me wonder why I paid 30% more for my board to get it direct from Dialogic when
their tech support wasn't that helpful--why so desperate for my 45 bucks?)
I hooked up pin 1 (line 1) and pin 2 (line 2) of P3 to the left and right
signal lines, and pin 20 to the ground line. Seems to work fine. But ICSS
appears to respond to only one side...no big deal.
Dialogic told me it's about 3db less than tip and ring coming straight off the
phone line, and that they weren't sure if the 12 volts (or whatever) that you
get when the phone rings would come in through these lines ("probably not, but
find a friend who is handy with a voltmeter"). Well, I tested it myself,
there is no signal on those lines until a connection is made.
I'll be testing today to see if this actually improves my ICSS performance.
Thanks again,
Steve
#: 1207 S10/IBM Speech Products ∙5!∙
04-Mar-94 16:49:34
Sb: #1200-ICSS questions
Fm: Vernon Harvey 72120,2767
To: Stephen Ng 70302,1510 (X)
Stephen:
If I understand you correctly, it sounds like you are trying to connect a mono
telephone output to a stereo input on your sound card. The AUD1 is audio
output for telephone line 1, while AUD2 is for your second phone line.
Remember, the Dialogic card supports up to 4 incoming telephone lines on a
single card. If possible, see if you can connect to the microphone input on
your sound card, as that should be a mono input (Unless you have left and
right mike inputs).
Regards,
Vernon.
#: 1250 S10/IBM Speech Products ∙5!∙
07-Mar-94 10:25:15
Sb: #1207-ICSS questions
Fm: Stephen Ng 70302,1510
To: Vernon Harvey 72120,2767
Vernon,
My last message wasn't too clear.
I have a phone simulator on line 1 of the dialogic card, and my actual phone
on line 2. While testing, I want the sound card to take input from line 1,
and during production to take input from line 2. So I thought I could hook up
line 1 to the left channel, and line 2 to the right. This works fine as far
as the sound card is concerned, but ICSS doesn't seem to recognize input from
one of the channels (the right channel?). No big deal, as it just means I
need to plug and unplug phone some phone lines when going from testing to
production. For real multi-line support, I'll need an audio mixer or an AMX
card.....
--Steve
#: 1256 S10/IBM Speech Products ∙5!∙
07-Mar-94 21:37:21
Sb: #1250-ICSS questions
Fm: Brandon Booth [IBM] 71042,3144
To: Stephen Ng 70302,1510
Steve
ICSS only deals with mono. It would be very confused with stereo input. Or
did I misunderstand your post?
Brandon
ICSS Development
#: 1258 S10/IBM Speech Products ∙5!∙
07-Mar-94 22:52:23
Sb: #1250-ICSS questions
Fm: Bob Cave 73232,2474
To: Stephen Ng 70302,1510
Steve,
I had a look at the new files you sent today. They are definitely better
sounding than the previous set, but I did not have much luck using them for
recognition with the BNF you provided (although it worked fine when I used the
microphone). When I looked at the waveforms under Test Context, there was
still a fair amount of noise showing up. You may want to consider raising the
Start Talking and End Talking thresholds, since the noise does not seem that
loud.
Regards,
Bob Cave, ICSS Development
#: 1309 S10/IBM Speech Products ∙5!∙
09-Mar-94 23:31:43
Sb: #1258-ICSS questions
Fm: Stephen Ng 70302,1510
To: Bob Cave 73232,2474
Bob,
Thanks, I will try raising the thresholds and see if that improves things.
Brandon: Did my subsequent message clear up the stereo input comment I had?
I basically am using the left channel for one phone line, and the right phone
line for another phone line, but I never have both lines active at once
(certainly it would be understandable for this to confuse ICSS). However ICSS
doesn't seem to hear one of the channels. This is a minor point for me(it may
even be a glitch with my setup).
--Steve
#: 1 S10/Thread Header
14-Mar-94 09:50:38
Sb: ICSS Spelling Letters
Fm:
To:
TITLE: ICSS SPELLING LETTERS
KEYWORDS: ALPHABET
#: 1362 S10/IBM Speech Products ∙5!∙
14-Mar-94 09:50:38
Sb: ICSS Spelling Letters
Fm: Office Automation 100012,3267
To: All
Hello,
I am using ICSS under OS/2 2.1 with a IBM PC 486 66Mhz with 16M of ram.
I need to be able to spell names to ICSS but I have big problems because
the accuracy is very very bad. Is it normal and is there a solution to solve
this problem?
Ian GAMBINI, AMADEUS Company
#: 1366 S10/IBM Speech Products ∙5!∙
14-Mar-94 14:06:02
Sb: #1362-ICSS Spelling Letters
Fm: Vernon Harvey 72120,2767
To: Office Automation 100012,3267
Ian:
How are you doing the spelling? Are you using single phoneme based spelling
characters, such as "ay", "bee" "see", etc. If so, you will probably have
recognition problems as ICSS seems to work best with multi-syllable
recognition (probably because there is more data to recognize). You may want
to try a military based spelling operation such as "FOXTROT ROMEO ALPHA NANCY
CHARLIE INDIGO EDWARD" to spell "Francis" (I don't think I have the correct
words in the example, but I think you see what I am getting at). As I recall,
I think there was a dictionary or BNF file in one of the ICSS directories
which contains the police car vocabulary. That demo used the same sort of
logic to enter license plate numbers, so you could probably scalp the word
list from there.
Regards,
Vernon Harvey.
#: 1379 S10/IBM Speech Products ∙5!∙
15-Mar-94 04:46:21
Sb: #1366-ICSS Spelling Letters
Fm: Denis Lacroix 100012,3270
To: Vernon Harvey 72120,2767
First, thank you for your answer.
Is there really no other solution to spell letters, because in this case you
have to learn all the correspondance between letters and words ( A-ALPHA,
B-BRAVO,...).
You can't ask everybody to learn this syntax just for spelling.
I am using a DX 66, do you think it will be really better if I use a Pentium ?
Regards,
Ian GAMBINI, AMADEUS Company
#: 1393 S10/IBM Speech Products ∙5!∙
15-Mar-94 20:52:43
Sb: #1379-ICSS Spelling Letters
Fm: Bob Cave 73232,2474
To: Denis Lacroix 100012,3270
Basically, what Vernon said was correct. Recognizing individual letters for
the purpose of spelling names is extremely difficult. There is simply not
enough information available for ICSS to reliably distinguish between the
choices. The military alphabet was created to solve this problem of speech
recognition between humans. It can be used with ICSS also, and will produce
far better results than saying the letters by themselves. I recognize the
impracticality of having everyone memorize the military alphabet for the
purpose of talking to a computer.
If there is a very limited set of valid combinations of letters, they can be
added to a context as words (ABC = EY B IY S IY in ICSS phonemes).
Regards,
Bob Cave, ICSS Development
#: 1469 S10/IBM Speech Products ∙5!∙
19-Mar-94 12:30:04
Sb: #1379-ICSS Spelling Letters
Fm: Rufus Laggren 76346,1441
To: Denis Lacroix 100012,3270
>> Is there really no other solution to spell letters, because in this
>> case you
FWI. (I'm no expert). You have 1000 words to play with for a context (and you
can switch contexts instantly). So you are not limited to the words used by
police or military. If you know which letters (or numbers, etc) *must* occur
next, you can limit the selection further, as is suggested elsewhere here.
I don't know if this will help, but you if you favor the letter sound very
much, you may also try *pre*-fixing the actual sound by saying "letter" so the
machine hear "letter-ay", for example. Not sure this will help or not.
Think how much your understanding of what *you* hear depends on the *context*,
rather than on the sound stream. You're a lot more complex than ICSS, and I
think you would have trouble gathering data w/out the context.
Best luck, Rufus
#: 1374 S10/IBM Speech Products ∙5!∙
14-Mar-94 23:28:51
Sb: #1362-ICSS Spelling Letters
Fm: Brandon Booth [IBM] 71042,3144
To: Office Automation 100012,3267
Ian
All speech recognizers that I know about have trouble recognizing single
phoneme utterances. This is also true of people! The problem stems from the
fact that there is no data about preceeding and succeeding phones (as it is a
single phone), and so there is no data to distinguish confusible phones (e.g.,
'b' and 'p'). As a subsequent post mentions using the military "foxtrot,
tango" style (or any equivalent) works much better.
Brandon
ICSS Development
#: 1380 S10/IBM Speech Products ∙5!∙
15-Mar-94 05:01:52
Sb: #1374-ICSS Spelling Letters
Fm: Denis Lacroix 100012,3270
To: Brandon Booth [IBM] 71042,3144
First, thank you for your answer.
Is there really no other solution to spell letters, because in this case you
have to learn all the correspondance between letters and words ( A-ALPHA,
B-BRAVO,...).
You can't ask everybody to learn this syntax just for spelling.
I am using a DX 66, do you think it will be really better if I use a Pentium ?
Regards,
Ian GAMBINI, AMADEUS Company
#: 1398 S10/IBM Speech Products ∙5!∙
15-Mar-94 23:08:38
Sb: #1380-ICSS Spelling Letters
Fm: Brandon Booth [IBM] 71042,3144
To: Denis Lacroix 100012,3270
Denis
I'm sorry, but a fast machine isn't the answer (at least in this case!). The
problem stems from the length of the utterance.
Brandon
ICSS Development
#: 1 S10/Thread Header
24-Mar-94 11:00:06
Sb: ICSS Accuracy
Fm:
To:
TITLE: ICSS ACCURACY
KEYWORDS: CONFIDENCE CERTAINTY RECOGNITION
#: 1534 S10/IBM Speech Products ∙5!∙
24-Mar-94 11:00:06
Sb: ICSS Accuracy
Fm: Office Automation 100012,3267
To: All 72120,2767
Hello,
I have a big problem with the ICSS recognition, because I have always an
answer. Is there no solution to know if the recognition is good enough to be
accepted. For example, if I say "AVAILABILITY TO TOKYO", sometimes the ICSS
answer is "MOVE UP" !!! For my project, it's impossible to do something
really good with this problem.
I have also another question. Is it possible to say "I WOULD LIKE AN
AVAILABILITY FROM..." and to ignore all the speech before the word
AVAILABILITY ?
If not, is it possible to ask ICSS which phonemes have been recognized?
Ian GAMBINI, AMADEUS Company
#: 1547 S10/IBM Speech Products ∙5!∙
24-Mar-94 21:36:39
Sb: #1534-ICSS Accuracy
Fm: Bob Cave 73232,2474
To: Office Automation 100012,3267
Ian,
The current version of ICSS does not return a confidence factor. It is up to
the application to parse the response to determine if it is usable. We are
looking into ways to enhance the product in this area.
>> Is it possible to say "I WOULD LIKE AN AVAILABILITY FROM..." and to
>> ignore all the speech before the word AVAILABILITY ?
No. ICSS will use all of the speech input for recognition. It does not
perform word spotting.
>> If not, is it possible to ask ICSS which phonemes have been recognized?
ICSS can not directly return the phonemes that were recognized. Your best bet
would be to look up the recognized words in the dictionary to get the
associated phonemes.
Regards,
Bob Cave, ICSS Development, IBM
#: 1612 S10/IBM Speech Products ∙5!∙
28-Mar-94 23:20:50
Sb: #1534-ICSS Accuracy
Fm: Brandon Booth [IBM] 71042,3144
To: Office Automation 100012,3267
Ian
Our recognition is quite good if you have your system configured properly. You
should record a speech file with your mic and sound card setup and listen to
it, looking for excessive noise. If you continue to have a problem, you may
want to call our tech support line 1-800-627-8363.
Brandon
ICSS Development
#: 1 S10/Thread Header
30-Mar-94 06:16:29
Sb: Headset
Fm:
To:
TITLE: HEADSET
KEYWORDS:
#: 1640 S10/IBM Speech Products ∙5!∙
30-Mar-94 06:16:29
Sb: Headset
Fm: Tim M. Heagarty 71541,1432
To: All
Folks,
I am looking for a headset that could be used with the ICSS system and a SB-16
card. Anyone out there have one that they like? You could save me a lot of
looking and spec. checking. I want an operator to be able to sit and speak to
their system and hear the answer without disrupting all others around them.
It should cut down on background noise too.
Thanks for your assistance.
Tim Heagarty
#: 1655 S10/IBM Speech Products ∙5!∙
30-Mar-94 23:12:20
Sb: #1640-Headset
Fm: Brandon Booth [IBM] 71042,3144
To: Tim M. Heagarty 71541,1432
Tim
The headset that we are familiar with is the Sennheiser 410. This works quite
well, but is a probably a little expensive.
Brandon
ICSS Development
#: 1 S10/Thread Header
21-Mar-94 06:47:06
Sb: #1480-ICCS
Fm:
To:
TITLE: ICCS
KEYWORDS: THRESHOLD NOISE NOISY ENVIRONMENT
#: 1482 S10/IBM Speech Products ∙5!∙
21-Mar-94 06:47:06
Sb: #1480-ICCS
Fm: Ken Solkowitz 72103,2111
To: Richard Frank 73511,3406
Richard -
Do you know how to adjust the thresholds when you're in a noisy environment?
At comdex we use 1500-1750. Here in the office we use around 750. I will check
on the Airborne shipment. The mic was originally for Norm but how about if I
allow you to keep it and use it to promote ICSS the best that you can. Don't
publicize that I gave you the mic. But do publicize if we've been responsive
and helpful. If you need anything else, glossies, fact sheets, demos sent to
someone, please contact me.
Regards,
Ken Solkowitz
IBM Continuous Speech Series
Bethesda, MD 20817
301-803-3897
Fax: 301-803-3167
#: 1486 S10/IBM Speech Products ∙5!∙
21-Mar-94 11:14:51
Sb: #1482-ICCS
Fm: Richard Frank 73511,3406
To: Ken Solkowitz 72103,2111
> Do you know how to adjust the thresholds when you're in a noisy
> environment? At comdex we use 1500-1750. Here in the office we use
> around 750.
I went up to 2000 on the thresholds, but no joy. We used the mixer to
decrease the mic input and increase the speaker output and it worked fairly
well.
> I will check on the Airborne shipment. The mic was originally for
> Norm but how about if I allow you to keep it and use it to promote
> ICSS the best that you can.
I bought my own mic, but Norm really should have one. He can reach alot of
people through his show.
> If you need anything else, glossies, fact sheets, demos sent to
> someone, please contact me.
Once I sort through the left-over literature I'll let you know. If you have
some literature on the IPDS I would like to have it.
Richard Frank
#: 1 S10/Thread Header
24-Mar-94 00:00:47
Sb: ICSS not working
Fm:
To:
TITLE: ICSS NOT WORKING
KEYWORDS: ADCIF SOUNDBLASTER SB
#: 1525 S10/IBM Speech Products ∙5!∙
24-Mar-94 00:00:47
Sb: ICSS not working
Fm: Brian Gorrell 70127,1712
To: Sysop
I just received my copy of the ICSS Developers Toolkit today. During the test
program I get the following message when I press recognize: A failure was
detected in the ISTG, ISTG restart has been attempted. I have changed the
configuration parameters to Identification Number "1" and Input Source Name
"icssmmpm". I loaded the TESTING context. My configuration is a Pentium
66MHz machine with a Soundblaster 16 SCSI-2 and its default mike (600 ohms
input impedence and sensitivity of 10 to 200mV) running on OS/2 2.1, MMPM/2,
and the service packs for both. I verified that the mike is working by
recording some speech with the Digital Audio Since I am running the service
pack, I UNPACKed the MEMSH.DLL from the original OS/2 2.1 GA CD. UNPACK
reported that it did not need to be unpacked so it copied it as is. The file
size is slightly smaller than the one that came with the CSD. It looks
correct to me. What can I do to resolve this? Thanks. BTW, I did not find
any mention of this error message in the manual, but since I just got it maybe
I overlooked it.
#: 1546 S10/IBM Speech Products ∙5!∙
24-Mar-94 21:28:13
Sb: #1525-ICSS not working
Fm: Bob Cave 73232,2474
To: Brian Gorrell 70127,1712
Brian,
The symptom you have is the one that occurs with the CSD version of MEMSH.DLL.
Have you tried recording a file with the ICSS utility for recording files
(under the Speech pull-down)? I am sitting here at home without my tech
reference, and I cannot remember if MMPM wants the ACDIFNumber for the first
card to be 0 or 1. Try setting it to 0.
Regards,
Bob Cave, ICSS Development
#: 1551 S10/IBM Speech Products ∙5!∙
24-Mar-94 23:09:18
Sb: #1546-ICSS not working
Fm: Christopher R. Peck 74214,3567
To: Bob Cave 73232,2474
I received the exact same message (I check the ACDIF # with the Digital Audio
App under MMPM like the readme said to & it's 1 for my SB16). I copied the
MEMSH.DLL file from the original OS/2 2.1 CD & still got the same error (I
went out & bought a new mike to be sure it wasn't that). I then tried the
"Stock Market" & that seemed to work ok (it missed alot of the numbers & said
- but it did pick up what i said). Any suggestions???
#: 1570 S10/IBM Speech Products ∙5!∙
26-Mar-94 02:03:01
Sb: #1546-ICSS not working
Fm: Brian Gorrell 70127,1712
To: Bob Cave 73232,2474
Thank you for your response, but I still am having the same problem. I double
checked the MEMSH.DLL file. It is 13968 bytes dated 5-07-93 at 12:18pm. I
copied the Service Pack version to a backup directory before putting the GA
version in the \MMOS2\DLL directory. The SP version is 14848 bytes dated
2-08-94 at 7:19pm. So this can not be the problem. As for the ACDIFNumber,
trying it at 0 made no difference. I believe 1 is correct though. I also
tried your other suggestion of recording a file. The recording went okay and
I then went into the test program and tried to use the recorded file as the
input. It went through without an error this time, but it recognized "Testing
testing testing, one two three" as "Zero zero six two zero six one two three
six." I then tried loading the RL context and used the sample WAV file
included and it correctly recognized "Where is the nearest french restaurant."
What else can I do to resolve this? Thanks.
#: 1608 S10/IBM Speech Products ∙5!∙
28-Mar-94 21:54:08
Sb: #1570-ICSS not working
Fm: Bob Cave 73232,2474
To: Brian Gorrell 70127,1712
Brian,
Did you record the your sample file at 11 khz ("medium" setting in the record
function, I believe)? The testing context and the RL context are delivered
for medium bandwidth only.
I will need to check out my setup back at the office to confirm the MEMSH.DLL
sizes, and will post back tomorrow.
Regards,
Bob Cave, ICSS Development, IBM
#: 1635 S10/IBM Speech Products ∙5!∙
29-Mar-94 22:54:43
Sb: #1608-ICSS not working
Fm: Bob Cave 73232,2474
To: Bob Cave 73232,2474
I installed the OS/2 2.1 Service Pack and the MMPM/2 Service Pack. The
service pack for MMPM definitely kills ICSS. I tried using the GA version of
MEMSH.DLL to no avail, ICSS was still broken. I was able to fix ICSS by using
the SB16 drivers (SB16D2.SYS) from the GA version, however, and they work with
either version of MEMSH.DLL. Using the CSD version of SB16D2.SYS seems to
cause the problem with ICSS. The GA version of the driver is dated 5/7/93,
and the size is 41956 bytes.
I have downloaded new drivers from the Creative Labs forum here on CIS, and I
will try those tomorrow (although I think they may be the same as the drivers
that came in the MMPM CSD).
Stay tuned...I should have more info tomorrow evening.
Bob Cave, ICSS Development, IBM
#: 1657 S10/IBM Speech Products ∙5!∙
30-Mar-94 23:12:26
Sb: #1635-ICSS not working
Fm: Brandon Booth [IBM] 71042,3144
To: Bob Cave 73232,2474
Bob Cave did have good luck with the Sound Blaster drivers he downloaded from
CompuServ. He'll have to give the details, as I forgot the paper I printed
off at work.
Brandon
ICSS Development
#: 1687 S10/IBM Speech Products ∙5!∙
01-Apr-94 18:41:54
Sb: #1657-ICSS not working
Fm: Bob Cave 73232,2474
To: Brandon Booth [IBM] 71042,3144
Here is what I found regarding MMPM and ICSS:
For best results, use the GA version of MEMSH.DLL with either the GA
SB16D2.SYS, or the latest version available in the Creative Labs section here
on CIS (sb_os2.exe in library 3, I think). I was not able to get ICSS to work
with the SP versions of MEMSH.DLL and SB16D2.SYS.
I hope this clears things up.
Bob Cave, ICSS Development, IBM
#: 1706 S10/IBM Speech Products ∙5!∙
02-Apr-94 13:35:27
Sb: #1687-ICSS not working
Fm: Brandon Booth [IBM] 71042,3144
To: Bob Cave 73232,2474
Thanks for looking into that, Bob.
Brandon
ICSS Development
#: 1614 S10/IBM Speech Products ∙5!∙
28-Mar-94 23:20:54
Sb: #1546-ICSS not working
Fm: Brandon Booth [IBM] 71042,3144
To: Bob Cave 73232,2474
Bob
For MMPM the ADCIFNumber (Analog to Digital Conversion InterFace Number,
horrible acronym!) of 0 means use the current default, if you have one card
this should be the same as using 1.
Brandon
ICSS Development
#: 1613 S10/IBM Speech Products ∙5!∙
28-Mar-94 23:20:52
Sb: #1525-ICSS not working
Fm: Brandon Booth [IBM] 71042,3144
To: Brian Gorrell 70127,1712
Brian
The MMPM folks (over on OS2DF1 forum) told us that it was enough to keep the
original memsh.dll, so I'm not sure what the problem is. You might try using
the GA MMPM. Do make sure to install the medium models as the supplied
contexts (e.g., the testing context) are compiled in medium.
Brandon
ICSS Development
#: 1712 S10/IBM Speech Products ∙5!∙
02-Apr-94 16:00:42
Sb: #1613-ICSS not working
Fm: Brian Gorrell 70127,1712
To: Brandon Booth [IBM] 71042,3144
>> Brian The MMPM folks (over on OS2DF1 forum) told us that it was enough
>> to keep the original memsh.dll, so I'm not sure what the problem is.
>> You might try using the GA MMPM. Do make sure to install the medium
>> models as the supplied contexts (e.g., the testing context) are
>> compiled in medium. Brandon ICSS Development
Brandon
Going back to the GA MMPM fixed my problems, however, I would like to see a
fix for this so that I can use the CSD version of MMPM. I need it to fix some
SoundBlaster 16 problems with the original GA MMPM. Only replacing the
MEMSH.DLL as has been suggested will not fix the problem with ICSS. Thanks.
#: 1748 S10/IBM Speech Products ∙5!∙
04-Apr-94 21:13:59
Sb: #1712-ICSS not working
Fm: Brandon Booth [IBM] 71042,3144
To: Brian Gorrell 70127,1712
Brian
My Co-worker Bob Cave has had some success using the latest Sound Blaster
drivers available from the Creative Labs section of CompuServ (Go BLASTER I
think), with the corrective service version of MMPM. You might want to try
that.
Brandon
ICSS Development
There is 1 Reply.
#: 1750 S10/IBM Speech Products ∙5!∙
04-Apr-94 21:55:22
Sb: #1748-ICSS not working
Fm: Bob Cave 73232,2474
To: Brandon Booth [IBM] 71042,3144
Brandon,
To clarify -- I was not able to get the SP version of MEMSH.DLL to work with
any
SB16 drivers (I know, I told you I did, but when I did further testing, it
turned out to still be broken...worked for two recos, then hung...my
apologies). I was able to use the SP version of MMPM with the GA MEMSH.DLL,
however, with either the GA SB16 drivers or the new drivers from CIS.
It seems that this is a "your mileage may vary" situation. Some folks are
able to get things working by only replacing the DLL. This was not the case
for me, and a couple of other people. This problem has been reported with
ACPA cards also. The MMPM folks are working hard to find out what is wrong.
Bob Cave, ICSS Development, IBM
#: 1837 S10/IBM Speech Products ∙5!∙
09-Apr-94 12:26:58
Sb: #1750-ICSS not working
Fm: Brandon Booth [IBM] 71042,3144
To: Bob Cave 73232,2474
Thanks Bob,
Steve C. is continuing to look at the situation for us. The problem has
turned out to be more difficult than we first thought, and it is uncertain
when it will be resolved.
Brandon
ICSS Development
#: 1712 S10/IBM Speech Products ∙5!∙
02-Apr-94 16:00:42
Sb: #1613-ICSS not working
Fm: Brian Gorrell 70127,1712
To: Brandon Booth [IBM] 71042,3144
>> Brian The MMPM folks (over on OS2DF1 forum) told us that it was enough
>> to keep the original memsh.dll, so I'm not sure what the problem is.
>> You might try using the GA MMPM. Do make sure to install the medium
>> models as the supplied contexts (e.g., the testing context) are
>> compiled in medium. Brandon ICSS Development
Brandon
Going back to the GA MMPM fixed my problems, however, I would like to see a
fix for this so that I can use the CSD version of MMPM. I need it to fix some
SoundBlaster 16 problems with the original GA MMPM. Only replacing the
MEMSH.DLL as has been suggested will not fix the problem with ICSS. Thanks.
#: 1 S10/Thread Header
11-Feb-94 13:55:33
Sb: Minor ICSS/2 problem.
Fm:
To:
TITLE: MINOR ICSS/2 PROBLEM.
KEYWORDS: CSD
#: 769 S10/IBM Speech Products ∙5 ∙
11-Feb-94 13:55:33
Sb: Minor ICSS/2 problem.
Fm: Vernon Harvey 72120,2767
To: Brandon Booth 71042,3144
Brandon:
Well, I just received and installed my ICSS for OS/2, however, I seem to have
a minor problem. I am using a SoundBlaster 16 ASP, with a cheap micro-phone
for now (still waiting for my Sennheiser) on a 80486DX 50MHz VLB system which
has 16M of memory. The ICSS software installed fine, and the Stock Market
demo works (recognition is a bit weak due to the low quality microphone).
The TSTCTXPM test program does not seem to work. I can start the program,
load the "TESTING" context, and change the settings to accept input from the
"icssmmpm" device. When I press the recognize button, and start speaking
immediately, I get a "Spoke too soon" message like I am supposed to get. If I
wait for the "Start speaking" prompt and then start to speak, no waveform
information information is displayed, and a few seconds after I finish
speaking, I get the message "A failure was detected in the ISTG; ISTG restart
has been attempted".
Any ideas?
Regards,
Vernon Harvey.
#: 774 S10/IBM Speech Products ∙5 ∙
11-Feb-94 22:30:35
Sb: #769-Minor ICSS/2 problem.
Fm: H. V. Belton 75470,3670
To: Vernon Harvey 72120,2767
Vernon,
Check that :
?:\MMOS2\DSP - Where "?" is the Drive where MMPM is installed
was added to your LIBPATH in config.sys
#: 798 S10/IBM Speech Products ∙5 ∙
15-Feb-94 18:58:14
Sb: #774-Minor ICSS/2 problem.
Fm: Vernon Harvey 72120,2767
To: H. V. Belton 75470,3670
Hello H.V.
I have a "C:\MMOS2\DLL" in the LIBPATH statement, but no "DSP". I tried
adding "C:\MMOS2\DSP" as you suggested, but still no luck on the TSTCTXPM.EXE.
Any other suggestions are more than welcome.
Regards,
Vernon.
#: 827 S10/IBM Speech Products ∙5 ∙
16-Feb-94 19:39:50
Sb: #769-Minor ICSS/2 problem.
Fm: Bob Cave 73232,2474
To: Vernon Harvey 72120,2767
Vernon,
We have noticed this symptom when using the latest SB 16 ASP device drivers
provided with MMPM/2 (part of a corrective service package). Are you using
these drivers? If so, your best bet may be to go back to the previous level.
We are currently working on a solution.
If you are not using new drivers, post back here and I will see what else I
can find out.
Bob Cave, ICSS Development (sitting in for Brandon on the net for a few days)
#: 862 S10/IBM Speech Products ∙5 ∙
18-Feb-94 17:35:47
Sb: #827-Minor ICSS/2 problem.
Fm: Vernon Harvey 72120,2767
To: Bob Cave 73232,2474
Bob:
Yes, I am using the MMPM/2 CSD. I will switch back and see it solves the
problem.
Regards,
Vernon.
#: 885 S10/IBM Speech Products ∙5 ∙
19-Feb-94 21:23:06
Sb: #769-Minor ICSS/2 problem.
Fm: Brandon Booth [IBM] 71042,3144
To: Vernon Harvey 72120,2767
Hi Vernon
We do have a known problem with the OS/2 2.1 CSD version. Are you using that
version? I'm not sure how available this version is. It seems strange that
the stock demo would work, but not test context as they exercise the same code
path. Do you have PATH and LIBPATH set correctly? Do you have the environment
variables (ICSSDIR particularly) set correctly. I kind of grasping at straws
here. It would be really great if you could write a small demo program and
see how that works. If you want to register a bug (or for phone tech support)
call our level 1 support at 1-800-627-8363.
Brandon
ICSS Development - IBM
#: 938 S10/IBM Speech Products ∙5 ∙
21-Feb-94 14:45:43
Sb: #885-Minor ICSS/2 problem.
Fm: Vernon Harvey 72120,2767
To: Brandon Booth [IBM] 71042,3144
Brandon:
Bob Cave was kind enough to step in and mention the problem with the CSD for
the MMPM/2 software. I am currently using the CSD version, so I will revert
back to the original MMPM/2 and see if that resolves the problem.
Regards,
Vernon.
#: 946 S10/IBM Speech Products ∙5 ∙
21-Feb-94 18:31:56
Sb: #938-Minor ICSS/2 problem.
Fm: Vernon Harvey 72120,2767
To: Vernon Harvey 72120,2767
Brandon:
With regards to your suggestion of writing a small demo program, I am also in
the process of doing just that. I originally tried to compile and use the
RECO.C test program, but dicovered that apparently I must write PM
applications when using ICSS via a SoundBlaster card. I guess RECO.C is only
valid if using the ACPA card.
If you know of a ways that I can write non-PM applications to work with the SB
card, please let me know, as I am not yet a PM pro.
Regards,
Vernon Harvey.
#: 954 S10/IBM Speech Products ∙5 ∙
21-Feb-94 20:55:42
Sb: #946-Minor ICSS/2 problem.
Fm: Brandon Booth [IBM] 71042,3144
To: Vernon Harvey 72120,2767
Vernon
If you are going to use the ICSS MMPM support (as for Sound Blaster), you have
to write a PM application (the decision was forced on us). Learning PM isn't
really that hard and I recommend giving it a shot.
Brandon
ICSS Development
#: 1125 S10/IBM Speech Products ∙5!∙
01-Mar-94 16:10:36
Sb: #885-Minor ICSS/2 problem.
Fm: Vernon Harvey 72120,2767
To: Brandon Booth [IBM] 71042,3144
Brandon:
Now that the OS/2 2.1 Service Pack is out, I was considering installing it on
my system. You mentioned that the MMPM/2 CSD had a problem when using ICSS
with a SB16 ASP sound card. Does this problem still exist with the OS/2 2.1
Service Pack? I would rather move forward than back to the initial release of
the MMPM/2 software if I can.
Regards,
Vernon Harvey.
#: 1136 S10/IBM Speech Products ∙5!∙
01-Mar-94 21:47:28
Sb: #1125-Minor ICSS/2 problem.
Fm: Bob Cave 73232,2474
To: Vernon Harvey 72120,2767
Vernon,
We will be checking this out over the next couple of days. I am not sure if
the problems were reported on a beta of the MMPM/2 CSD or on the released
version.
Bob Cave, ICSS Development
#: 1140 S10/IBM Speech Products ∙5!∙
01-Mar-94 22:25:07
Sb: #1125-Minor ICSS/2 problem.
Fm: Brandon Booth [IBM] 71042,3144
To: Vernon Harvey 72120,2767
Vernon
My understanding is that the OS/2 2.1 service pack changes in MMPM don't
coexist well with the current version of ICSS for OS/2. I'm sorry, but we
don't have any fixes for this problem right now.
Brandon
ICSS Development
#: 1162 S10/IBM Speech Products ∙5!∙
02-Mar-94 17:09:24
Sb: #1140-Minor ICSS/2 problem.
Fm: Vernon Harvey 72120,2767
To: Brandon Booth [IBM] 71042,3144
Brandon:
1: Is there a fix in the works yet?
2: Can I still install the Service pack and leave the MMPM/2 at its
current level (Ie: Initial release before all the problems with
MMPM/2 CSD and Service Pack software updates?
3: Is the problem in ICSS or in the MMPM/2 software?
4: If a fix is in the works, any idea when it will be available?
5: If I go to an IBM ACPA card, can I use the new MMPM/2?
6: Should I report this as a problem to IBM support? Will it help
to speed up the fix.
I am starting to get into ICSS pretty heavily, but don't want to hit a wall
waiting for the service pack or ICSS fixes if I can get around it. I actually
got my first ICSS PM program working a few nights ago, and am now getting to
really understand the APIs. It really is pretty slick.
I have noticed a few problems with recognition, but I think it may be related
to a problem with MMPM/2. I have noticed that when I record audio passages
using the ICSS record or the MMPM/2 Digital audio, I have static in the
recording (It may be in the playback, I'm not sure yet. I am going to take a
look on the Multimedia forum for OS/2 to see if there is a resolution.
Regards,
Vernon.
#: 1168 S10/IBM Speech Products ∙5!∙
02-Mar-94 19:16:48
Sb: #1162-Minor ICSS/2 problem.
Fm: Ken Solkowitz 72103,2111
To: Vernon Harvey 72120,2767
Vernon -
I might be over stepping my bounds here and I'll hand off to Brandon as the
"authority" but I think you should call IBM and initiate a problem report.
Basically, they released the MMPM code and it broke existing product code and
potentially your product. That will help.
Regards,
Ken Solkowitz
IBM Continuous Speech Series
Bethesda, MD 20817
301-803-3897
#: 1194 S10/IBM Speech Products ∙5!∙
03-Mar-94 20:50:31
Sb: #1162-Minor ICSS/2 problem.
Fm: Brandon Booth [IBM] 71042,3144
To: Vernon Harvey 72120,2767
Vernon
I'm sorry, there is no fix for the problem ICSS and the MMPM service pack
currently available. I don't know if it makes sense to install the OS/2
update and not install if for MMPM (that is I don't what interaction there
is). You might ask that question in the OS2DF1 MMPM section. I don't know if
it is ICSS or MMPM, but we worked well together before MMPM changed, so I (of
course) suspect MMPM :-). I you use an ACPA card you do not have to use the
icssmmpm input source dll. You can use the icssacpa input source dll. This
later dll does not use MMPM at all (but is limited to ACPA cards only),
thereby bypassing the problem. Reporting the problem to to IBM support might
actually help light the right fires (its screwy how things work).
On your static problem, are you using an 8-bit card right now? That can be a
source of added noise. Static will definitely interfere with recognition.
Placing the audio card away from other cards and the system fan might help.
Brandon
ICSS Development
#: 1206 S10/IBM Speech Products ∙5!∙
04-Mar-94 16:32:28
Sb: #1194-Minor ICSS/2 problem.
Fm: Vernon Harvey 72120,2767
To: Brandon Booth [IBM] 71042,3144
Brandon:
Thanks for your info in the ICSS and MMPM/2 problems. I guess I will submit a
problem report to IBM to get that ball rolling. I think in the mean time I
might go buy an ACPA card to ensure that I have something working.
With regards to the SoundBlaster card, it is a 16 bit card in a 16 bit slot,
so I assume that is not the problem. I will try re-arranging my card to see
if that eliminates the problem with the static. It is sitting in between my
VLB SCSI and Diamond Stealth card right now, so maybe there is a problem.
Regards,
Vernon.
#: 1229 S10/IBM Speech Products ∙5!∙
05-Mar-94 14:11:01
Sb: #1206-Minor ICSS/2 problem.
Fm: Brandon Booth [IBM] 71042,3144
To: Vernon Harvey 72120,2767
Hi Vernon
Moving cards around is one thing I can suggest. We have seen problems in the
past with placing the audio card near the system fan or other cards which
generate RF (radio frequency) noise. For instance token ring cards can
generate a substantial hum. Naturally, we don't know all of the situations
which cause problems. I should say we have used the Sound Blaster 16 card in
our lab with good results.
Brandon
ICSS Development
#: 1850 S10/IBM Speech Products ∙5!∙
10-Apr-94 00:42:56
Sb: Minor ICSS/2 problem.
Fm: Bruce Hartwell 72056,1064
To: Brandon Booth [IBM] 71042,3144
I am running into the same error message described in this thread:
"A Failer Has Been Detected In The ISTG, ISTG Restart Has Been Attempted".
I am running ICSS for OS/2 2.1 on an IBM clone 486 33Mhz with 20MB of RAM and
a Soundblaster Pro ( old 8 bit version ). I have not upgraded to the service
pack yet but
I am using the lastest Soundblaster drivers.
I couldn't tell from reading through the messages if a fix had been found or
not. Has
a fix or workaround been found? If not, does this problem create any major
side-effects
in applications created using the API?
Thanks in advance for the info.
#: 1855 S10/IBM Speech Products ∙5!∙
10-Apr-94 18:35:11
Sb: #1850-Minor ICSS/2 problem.
Fm: Brandon Booth [IBM] 71042,3144
To: Bruce Hartwell 72056,1064
Bruce
The problem that has been described is using ICSS for OS/2 with the OS/2 2.1
CSD version of MMPM/2. Some people have been able to work around it by using
the original version of MEMSH.DLL. Others have not gotten this to work. We
do not have a fix for this problem, and it is currently under investigation.
The problem tends to interfere with recognition. Is this happening to you?
Brandon
ICSS Development
#: 1856 S10/IBM Speech Products ∙5!∙
10-Apr-94 21:56:10
Sb: Minor ICSS/2 problem.
Fm: Bruce Hartwell 72056,1064
To: Brandon Booth [IBM] 71042,3144
Brandon:
The sample app that comes with ICSS ( stockmarket ) seems to work (
recognition is
weak, but that is likely due to my low quality board and mic ). I haven't
gotten into
the API enough yet to really test overall performance.
I am in the intial investigation phase of a project that requires fairly
consistent
voice recognition on an OS/2 box. I was concerned when I read through several
messages describing errors.
Any idea when a fix might be forthcoming? In your opinion, would this be a
good product to use in developing an educational system that required strong
voice recognition?
Thanks for your help!
#: 1977 S10/IBM Speech Products ∙5!∙
14-Apr-94 21:27:39
Sb: #1856-Minor ICSS/2 problem.
Fm: Brandon Booth [IBM] 71042,3144
To: Bruce Hartwell 72056,1064
Bruce
Our ICSS for OS/2 works very well on the original OS/2 2.1 with MMPM, if you
use a reasonable card and mic. A 16 bit card (such as a Creative Labs or
Media Vision 16 bit card) is a better choice than a 8 bit card (because we
have to scale the 8 bit samples to 16; there is more noise in the result and
less information). We give a list of mics that we've used successfully in the
tech ref. In evaluating your system, I recommend using the Test Context
program tstctxpm.exe. It will allow you to interact directly with a context,
and it will display the waveform. This will allow you to see how much noise
there is in your system. We also have a record/playback feature. You should
record and play some files and listen to the results. If you hear lots of
noise, this will be a problem for speech recognition.
As for the problem with the MMPM CSD, it is being worked on by the MMPM group
and our group together. I do not have a date for when the problem will be
found (these things sometimes take time).
Brandon
ICSS Development
#: 1 S10/Thread Header
15-Mar-94 21:00:17
Sb: ICSS & MMPM/2 CSD Fix
Fm:
To:
TITLE: ICSS & MMPM/2 CSD FIX
KEYWORDS: CSD SB16 CREATIVE
#: 1394 S10/IBM Speech Products ∙5!∙
15-Mar-94 21:00:17
Sb: ICSS & MMPM/2 CSD Fix
Fm: Bob Cave 73232,2474
To: all
Hello all.
Thanks to the folks down in the MMPM development group, we now have a
work-around for the problems you have encountered when using ICSS with the new
MMPM/2 CSD. Before installing the CSD, copy x:\MMOS2\DLL\MEMSH.DLL to another
directory. After installing, copy MEMSH.DLL back over the copy included with
the service pack. According to the MMPM developers, there should be no
negative ramifications of using the G.A. version of MEMSH.DLL with the CSD.
Regards,
Bob Cave, ICSS Development
#: 1397 S10/IBM Speech Products ∙5!∙
15-Mar-94 22:56:38
Sb: #1394-ICSS & MMPM/2 CSD Fix
Fm: Steve Weeks 70541,646
To: Bob Cave 73232,2474
> Before installing the CSD, ........
Thanks Bob, I know about 6 people that will need to know this and will pass it
on.
Steve Weeks
#: 1426 S10/IBM Speech Products ∙5!∙
16-Mar-94 18:19:55
Sb: #1394-ICSS & MMPM/2 CSD Fix
Fm: AL COLE 71552,2526
To: Bob Cave 73232,2474
I've order ICSS am on back order, but I have already installed the OS/2 2.1
CSD. What should I do when I install ICSS?
Thanks Al Cole
#: 1454 S10/IBM Speech Products ∙5!∙
17-Mar-94 21:35:04
Sb: #1426-ICSS & MMPM/2 CSD Fix
Fm: Brandon Booth [IBM] 71042,3144
To: AL COLE 71552,2526
Hi Al
If you are using the OS/2 2.1 CSD, you need to save x:\mmos2\dll\memsh.dll
from your GA (pre CSD) installation. After the CSD install, overwrite the csd
dll with the earlier version. This information is from the MMPM folks.
Brandon
ICSS Development
#: 1456 S10/IBM Speech Products ∙5!∙
17-Mar-94 22:31:22
Sb: #1426-ICSS & MMPM/2 CSD Fix
Fm: Bob Cave 73232,2474
To: AL COLE 71552,2526
This refers specifically to the MMPM portion of the CSD. If you have already
installed it, you should (I think) be able to extract the original GA version
of MEMSH.DLL from your MMPM install disks or CD. Can anyone confirm this?
Regards,
Bob Cave, IBM
ICSS Development
#: 1544 S10/IBM Speech Products ∙5!∙
24-Mar-94 21:04:50
Sb: #1456-ICSS & MMPM/2 CSD Fix
Fm: Christopher R. Peck 74214,3567
To: Bob Cave 73232,2474
Perhaps you could place memsh.dll from the GA mmpm disk on here & your ftp
sites
for downloading? Or simple instructions to get it (do I use unpack?)...
#: 1545 S10/IBM Speech Products ∙5!∙
24-Mar-94 21:16:45
Sb: #1544-ICSS & MMPM/2 CSD Fix
Fm: Bob Cave 73232,2474
To: Christopher R. Peck 74214,3567
Christopher,
>>Perhaps you could place memsh.dll from the GA mmpm disk on here & your ftp
>>sites for downloading? Or simple instructions to get it (do I use
unpack?)...
I will look into this...it seems like a good idea. At the least I will try to
get some instructions for getting the GA version off of the diskettes. It may
take a couple of days, since I will be out of the office tomorrow.
Bob Cave, ICSS Development
#: 1552 S10/IBM Speech Products ∙5!∙
24-Mar-94 23:12:24
Sb: #1545-ICSS & MMPM/2 CSD Fix
Fm: Christopher R. Peck 74214,3567
To: Bob Cave 73232,2474
Geez - do you live on here? I just posted that question...
I have the CD version of OS/2 2.1 & the file didn't appear to be in packed
format on there - but the test program still doesn't work correctly. The
Stock
program seems to be fine.
Can I get away with the smaller C compiler (the $99 kit) for doing
applications
or do I need the full C/Set & OS/2 Toolkit?
-thanks (I hope)
Chris
#: 1606 S10/IBM Speech Products ∙5!∙
28-Mar-94 21:38:58
Sb: #1552-ICSS & MMPM/2 CSD Fix
Fm: Bob Cave 73232,2474
To: Christopher R. Peck 74214,3567
Christopher,
Timing is everything, I guess...
As far as I know, you should be able to use the smaller compiler. ICSS does
not require the toolkit.
Your problem with the Test Context program is strange. I will check out one
of the systems at the office and see what I can find out.
Regards,
Bob Cave, ICSS Development, IBM
#: 1610 S10/IBM Speech Products ∙5!∙
28-Mar-94 23:20:45
Sb: #1552-ICSS & MMPM/2 CSD Fix
Fm: Brandon Booth [IBM] 71042,3144
To: Christopher R. Peck 74214,3567
Christopher
Just a bit of advice, you're going to really wish you had the OS/2 toolkit if
your doing OS/2 programming. The compiler alone isn't going to cut it.
Brandon
#: 1631 S10/IBM Speech Products ∙5!∙
29-Mar-94 16:41:30
Sb: #1610-ICSS & MMPM/2 CSD Fix
Fm: Erwin Kiss 72202,666
To: Brandon Booth [IBM] 71042,3144
Brandon,
I recently bought ICSS Toolkit and I am impressed with its
capabilities. Even though I am only using an 8-bit Sound Blaster Pro card and
a $35 Radio Shack microphone, I still get pretty good results. My question is
about the choice of compilers. I use Borland's C++ for OS/2, do you know if
the ICSS API can be called from Borland? Also, does the IBM OS/2 Toolkit work
with Borland? And finally, how much does the Toolkit cost, and where can I
purchase it?
You have a great product, and I can't wait until I have enough time to speech
enable my simple home-made text adventure games.
#: 1653 S10/IBM Speech Products ∙5!∙
30-Mar-94 23:01:45
Sb: #1631-ICSS & MMPM/2 CSD Fix
Fm: Brandon Booth [IBM] 71042,3144
To: Erwin Kiss 72202,666
Erwin
I believe you can use the Borland compiler. I think there are couple of
tricks to know. I'll ask at work and post the results. I'm sure the OS/2
toolkit works with the Borland compiler. I'm sorry, I don't know how much it
costs. I think you can buy it from Indelible Blue: 1-800-776-8284.
Brandon
ICSS Development
#: 1554 S10/IBM Speech Products ∙5!∙
24-Mar-94 23:59:14
Sb: #1545-ICSS & MMPM/2 CSD Fix
Fm: Christopher R. Peck 74214,3567
To: Bob Cave 73232,2474
I just install mmpm/2 from my OS/2 2.1 CD on top of the MMPM/2 csd, the test
application works just fine now. I had tried just replacing mapsh.dll - but -
that didn't fix it (maybe it's the SB16 drivers). If I have time this weekend
I'll try installing Creative Labs new MMPM drivers & see what happens.
#: 1607 S10/IBM Speech Products ∙5!∙
28-Mar-94 21:40:35
Sb: #1554-ICSS & MMPM/2 CSD Fix
Fm: Bob Cave 73232,2474
To: Christopher R. Peck 74214,3567
Chris,
I saw this note after posting my reply to your previous note. Let me know how
it goes with the new drivers from Creative Labs.
Regards,
Bob Cave, ICSS Development, IBM
#: 1611 S10/IBM Speech Products ∙5!∙
28-Mar-94 23:20:47
Sb: #1554-ICSS & MMPM/2 CSD Fix
Fm: Brandon Booth [IBM] 71042,3144
To: Christopher R. Peck 74214,3567
Christopher
The MMPM folks told us that just using the original memsh.dll with the csd
mmpm would work, so that's what we're telling people. If it's not working for
you, we need to know.
Brandon
ICSS Development
#: 1793 S10/IBM Speech Products ∙5!∙
06-Apr-94 17:58:46
Sb: #1611-ICSS & MMPM/2 CSD Fix
Fm: Vernon Harvey 72120,2767
To: Brandon Booth [IBM] 71042,3144
Brandon:
I just tried the suggested work around for the MMPM/2 CSD, and seem to be
semi-operational. I can use the record and playback operations on the context
test program, but the test context operation is still giving me the ISTG
failed message. Any suggestions other than blasting the MMPM/2 and restoring
the original version?
#: 1795 S10/IBM Speech Products ∙5!∙
06-Apr-94 21:53:15
Sb: #1793-ICSS & MMPM/2 CSD Fix
Fm: Bob Cave 73232,2474
To: Vernon Harvey 72120,2767
Vernon,
Which audio adapter are you using? If it is the SB16, I suggest also trying
the GA drivers (SB16D2.SYS) or downloading the latest version from the
Creative Labs section (sb_os2.exe in library 3). I was able to get both of
these drivers to work with ICSS and the GA version of MEMSH.DLL.
Regards,
Bob Cave, ICSS Development, IBM
#: 1806 S10/IBM Speech Products ∙5!∙
07-Apr-94 13:57:36
Sb: #1795-ICSS & MMPM/2 CSD Fix
Fm: Vernon Harvey 72120,2767
To: Bob Cave 73232,2474
Bob:
Thanks for the suggestion. Shortly after I posted that last message, I came
read another message from either yourself or Brandon which mentioned just
that. I have obtained the new SB16 drivers from the Creative Labs Forum, and
will be installing tonight.
Regards,
Vernon Harvey.
#: 1838 S10/IBM Speech Products ∙5!∙
09-Apr-94 12:27:02
Sb: #1793-ICSS & MMPM/2 CSD Fix
Fm: Brandon Booth [IBM] 71042,3144
To: Vernon Harvey 72120,2767
Vernon
The one thing I can definitely say is that ICSS works very well on the
original OS/2 2.1 with original MMPM/2. It seems likely that the combination
of the OS/2 2.1 CSD with the original MMPM/2 should work (but is in some ways
not a great solution). The other possibility of just using the original
MEMSH.DLL with the CSD MMPM has apparently worked in some cases, but not all
(obviously). I am sorry that is all I know right now. Detective work on the
problem is going on right now in our lab and in the MMPM lab. I do not know
when the problem will be solved.
Brandon
ICSS Development
#: 1882 S10/IBM Speech Products ∙5!∙
11-Apr-94 18:15:58
Sb: #1838-ICSS & MMPM/2 CSD Fix
Fm: Vernon Harvey 72120,2767
To: Brandon Booth [IBM] 71042,3144
Brandon:
I finally got around to installing the Creative Labs SB card updates and
everything now appears to be working properly.
Thanks for your assistance.
Regards,
Vernon Harvey
#: 1979 S10/IBM Speech Products ∙5!∙
14-Apr-94 21:27:42
Sb: #1882-ICSS & MMPM/2 CSD Fix
Fm: Brandon Booth [IBM] 71042,3144
To: Vernon Harvey 72120,2767
Hi Vernon
I'm glad your up and running.
Brandon
ICSS Development
#: 1 S10/Thread Header
11-Apr-94 08:21:28
Sb: Big Processing Time
Fm:
To:
TITLE: BIG PROCESSING TIME
KEYWORDS: SPEED GRAMMAR
#: 1859 S10/IBM Speech Products ∙5!∙
11-Apr-94 08:21:28
Sb: Big Processing Time
Fm: Office Automation 100012,3267
To: All 72120,2767
Hello,
I have a problem with the processing time using ICSS.
I need a quite big grammar and to have good accuracy, I have to compile it
with the High Bandwith Model and with a very small value for the Beam
parameter (0.0000001).
When I say "AVAILABILITY TO NEW-YORK", I have to wait about 3 or 4 seconds for
the response (after saying it).
Is it normal to wait so much ?
I use a 486 DX 66 with 32 Megabytes.
Regards,
Ian GAMBINI, Amadeus Company
#: 1883 S10/IBM Speech Products ∙5!∙
11-Apr-94 18:22:03
Sb: #1859-Big Processing Time
Fm: Vernon Harvey 72120,2767
To: Office Automation 100012,3267
Ian:
Check your End Silence duration parameter. This parameter tells the ICSS
software how much silence to wait for before assuming that the user has
stopped speaking. It probably needs to be a trimmed down a bit.
Regards,
Vernon Harvey
#: 1892 S10/IBM Speech Products ∙5!∙
12-Apr-94 07:47:52
Sb: #1883-Big Processing Time
Fm: Office Automation 100012,3267
To: Vernon Harvey 72120,2767
Harvey:
First, thank you for your answer, but the End Silence duration parameter was
set to 0.8 seconds (the default value). I tryed to take 0.4 seconds for this
parameter but my problem is the same (less 0.4 seconds). I have to wait about
3 seconds for a small sentence and up to 10 or more if I say something longer.
Is it normal to have to wait so much ?
Regards,
Ian GAMBINI, Amadeus Company
#: 1908 S10/IBM Speech Products ∙5!∙
12-Apr-94 20:40:28
Sb: #1859-Big Processing Time
Fm: Bob Cave 73232,2474
To: Office Automation 100012,3267
Ian,
>>> I need a quite big grammar
Could you be a little more specific as to how large the grammar is? The
combination of a large grammar (more choices at any point in the search) and
a high bandwidth context will have a negative impact on how long it takes to
complete a recognition. Is there any way you can break up your context into
several smaller ones? This might even allow you to get away with using the
medium bandwidth, which will also help performance.
Regards,
Bob Cave
ICSS Development, IBM
#: 1916 S10/IBM Speech Products ∙5!∙
13-Apr-94 09:30:32
Sb: #1859-Big Processing Time
Fm: Ora Williamson 71742,2161
To: Office Automation 100012,3267
>>I have a problem with the processing time using ICSS.
Ian: I am from the ICSS development lab. It sounds like something is up with
your grammar. Are you compiling Finite State or Word Pair? How nested
are your productions? Would you mind faxing us a copy of your grammar, or
sending us the .BNF and .DIC, so we can test it, tweak it, improve it, and
send it back to you?
The fax number is 301-803-3167. Send it to my attention.
Regards, Ora Williamson ICSS Technical Liason IBM Spoken Language Systems Lab
#: 1980 S10/IBM Speech Products ∙5!∙
14-Apr-94 21:58:42
Sb: #1859-Big Processing Time
Fm: Brandon Booth [IBM] 71042,3144
To: Office Automation 100012,3267
Ian
You would be better off with a group of smaller contexts, if possible. You'll
get better accuracy and better speed. As has been noted you can play with the
beam width (which is a speed vs. accuracy tradeoff), and the end silence
duration (where you have to be careful of people who speak slowly). If you
can use medium bandwidth, that is faster. Also you need to pay attention to
the structure of the bnf. It is possible to have faster and slower bnf's
which produce the same results.
Brandon
ICSS Development
#: 1 S10/Thread Header
12-Apr-94 20:45:54
Sb: Installing ICSS from B:
Fm:
To:
TITLE: INSTALLING ICSS FROM B:
KEYWORDS:
#: 1909 S10/IBM Speech Products ∙5!∙
12-Apr-94 20:45:54
Sb: Installing ICSS from B:
Fm: Bob Cave 73232,2474
To: all
Hi all, I couldn't find the original note on this topic, but I would like to
post the information here anyway...
There was a post last week from a user who ran into problems installing ICSS
for Windows from the B: drive. There is a bug at the end of the ICSS install,
when it asks for the Win32s diskette. You can clear this message and complete
the install. Then, put the Win32s diskette back in the B: drive and invoke
the setup program directly. The diskette is the standard Win32s distribution
diskette, and has its own setup program. The ICSS install program simply
invokes it for you as a last step.
Any questions, please post.
Regards,
Bob Cave
ICSS Development, IBM